[2023-10-07 23:50:50,107][50642] Saving configuration to ./train_atari/atari_amidar_APPO/config.json... [2023-10-07 23:50:50,473][50642] Rollout worker 0 uses device cpu [2023-10-07 23:50:50,474][50642] Rollout worker 1 uses device cpu [2023-10-07 23:50:50,474][50642] Rollout worker 2 uses device cpu [2023-10-07 23:50:50,475][50642] Rollout worker 3 uses device cpu [2023-10-07 23:50:50,475][50642] Rollout worker 4 uses device cpu [2023-10-07 23:50:50,476][50642] Rollout worker 5 uses device cpu [2023-10-07 23:50:50,476][50642] Rollout worker 6 uses device cpu [2023-10-07 23:50:50,477][50642] Rollout worker 7 uses device cpu [2023-10-07 23:50:50,477][50642] Rollout worker 8 uses device cpu [2023-10-07 23:50:50,478][50642] Rollout worker 9 uses device cpu [2023-10-07 23:50:50,478][50642] Rollout worker 10 uses device cpu [2023-10-07 23:50:50,478][50642] Rollout worker 11 uses device cpu [2023-10-07 23:50:50,479][50642] Rollout worker 12 uses device cpu [2023-10-07 23:50:50,479][50642] Rollout worker 13 uses device cpu [2023-10-07 23:50:50,480][50642] Rollout worker 14 uses device cpu [2023-10-07 23:50:50,480][50642] Rollout worker 15 uses device cpu [2023-10-07 23:50:50,755][50642] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-10-07 23:50:50,755][50642] InferenceWorker_p0-w0: min num requests: 2 [2023-10-07 23:50:50,758][50642] Using GPUs [1] for process 1 (actually maps to GPUs [1]) [2023-10-07 23:50:50,759][50642] InferenceWorker_p1-w0: min num requests: 2 [2023-10-07 23:50:50,805][50642] Starting all processes... [2023-10-07 23:50:50,806][50642] Starting process learner_proc0 [2023-10-07 23:50:52,520][50642] Starting process learner_proc1 [2023-10-07 23:50:52,524][51605] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-10-07 23:50:52,524][51605] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-10-07 23:50:52,542][51605] Num visible devices: 1 [2023-10-07 23:50:52,565][51605] Setting fixed seed 1234 [2023-10-07 23:50:52,566][51605] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-10-07 23:50:52,566][51605] Initializing actor-critic model on device cuda:0 [2023-10-07 23:50:52,566][51605] RunningMeanStd input shape: (4, 84, 84) [2023-10-07 23:50:52,567][51605] RunningMeanStd input shape: (1,) [2023-10-07 23:50:52,582][51605] ConvEncoder: input_channels=4 [2023-10-07 23:50:52,763][51605] Conv encoder output size: 512 [2023-10-07 23:50:52,766][51605] Created Actor Critic model with architecture: [2023-10-07 23:50:52,766][51605] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): ConvEncoder( (enc): RecursiveScriptModule( original_name=ConvEncoderImpl (conv_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Conv2d) (1): RecursiveScriptModule(original_name=ReLU) (2): RecursiveScriptModule(original_name=Conv2d) (3): RecursiveScriptModule(original_name=ReLU) (4): RecursiveScriptModule(original_name=Conv2d) (5): RecursiveScriptModule(original_name=ReLU) ) (mlp_layers): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ReLU) ) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=10, bias=True) ) ) [2023-10-07 23:50:53,308][51605] Using optimizer [2023-10-07 23:50:53,309][51605] No checkpoints found [2023-10-07 23:50:53,309][51605] Did not load from checkpoint, starting from scratch! [2023-10-07 23:50:53,310][51605] Initialized policy 0 weights for model version 0 [2023-10-07 23:50:53,311][51605] LearnerWorker_p0 finished initialization! [2023-10-07 23:50:53,312][51605] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-10-07 23:50:54,278][50642] Starting all processes... [2023-10-07 23:50:54,282][51710] Using GPUs [1] for process 1 (actually maps to GPUs [1]) [2023-10-07 23:50:54,282][51710] Set environment var CUDA_VISIBLE_DEVICES to '1' (GPU indices [1]) for learning process 1 [2023-10-07 23:50:54,286][50642] Starting process inference_proc0-0 [2023-10-07 23:50:54,287][50642] Starting process inference_proc1-0 [2023-10-07 23:50:54,287][50642] Starting process rollout_proc0 [2023-10-07 23:50:54,300][51710] Num visible devices: 1 [2023-10-07 23:50:54,287][50642] Starting process rollout_proc1 [2023-10-07 23:50:54,287][50642] Starting process rollout_proc2 [2023-10-07 23:50:54,288][50642] Starting process rollout_proc3 [2023-10-07 23:50:54,288][50642] Starting process rollout_proc4 [2023-10-07 23:50:54,291][50642] Starting process rollout_proc5 [2023-10-07 23:50:54,324][51710] Setting fixed seed 1234 [2023-10-07 23:50:54,326][51710] Using GPUs [0] for process 1 (actually maps to GPUs [1]) [2023-10-07 23:50:54,294][50642] Starting process rollout_proc6 [2023-10-07 23:50:54,326][51710] Initializing actor-critic model on device cuda:0 [2023-10-07 23:50:54,326][51710] RunningMeanStd input shape: (4, 84, 84) [2023-10-07 23:50:54,327][51710] RunningMeanStd input shape: (1,) [2023-10-07 23:50:54,295][50642] Starting process rollout_proc7 [2023-10-07 23:50:54,296][50642] Starting process rollout_proc8 [2023-10-07 23:50:54,296][50642] Starting process rollout_proc9 [2023-10-07 23:50:54,297][50642] Starting process rollout_proc10 [2023-10-07 23:50:54,300][50642] Starting process rollout_proc11 [2023-10-07 23:50:54,340][51710] ConvEncoder: input_channels=4 [2023-10-07 23:50:54,302][50642] Starting process rollout_proc12 [2023-10-07 23:50:54,302][50642] Starting process rollout_proc13 [2023-10-07 23:50:54,796][51710] Conv encoder output size: 512 [2023-10-07 23:50:54,799][51710] Created Actor Critic model with architecture: [2023-10-07 23:50:54,799][51710] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): ConvEncoder( (enc): RecursiveScriptModule( original_name=ConvEncoderImpl (conv_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Conv2d) (1): RecursiveScriptModule(original_name=ReLU) (2): RecursiveScriptModule(original_name=Conv2d) (3): RecursiveScriptModule(original_name=ReLU) (4): RecursiveScriptModule(original_name=Conv2d) (5): RecursiveScriptModule(original_name=ReLU) ) (mlp_layers): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ReLU) ) ) ) ) ) (core): ModelCoreIdentity() (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=10, bias=True) ) ) [2023-10-07 23:50:55,435][51710] Using optimizer [2023-10-07 23:50:55,435][51710] No checkpoints found [2023-10-07 23:50:55,436][51710] Did not load from checkpoint, starting from scratch! [2023-10-07 23:50:55,436][51710] Initialized policy 1 weights for model version 0 [2023-10-07 23:50:55,438][51710] LearnerWorker_p1 finished initialization! [2023-10-07 23:50:55,438][51710] Using GPUs [0] for process 1 (actually maps to GPUs [1]) [2023-10-07 23:50:56,468][50642] Starting process rollout_proc14 [2023-10-07 23:50:56,472][52105] Worker 10 uses CPU cores [20, 21] [2023-10-07 23:50:56,535][50642] Starting process rollout_proc15 [2023-10-07 23:50:56,541][52099] Worker 3 uses CPU cores [6, 7] [2023-10-07 23:50:56,583][52101] Worker 6 uses CPU cores [12, 13] [2023-10-07 23:50:56,596][52108] Worker 13 uses CPU cores [26, 27] [2023-10-07 23:50:56,603][52107] Worker 12 uses CPU cores [24, 25] [2023-10-07 23:50:56,625][52102] Worker 8 uses CPU cores [16, 17] [2023-10-07 23:50:56,811][52096] Worker 2 uses CPU cores [4, 5] [2023-10-07 23:50:56,820][52103] Worker 7 uses CPU cores [14, 15] [2023-10-07 23:50:56,878][52106] Worker 11 uses CPU cores [22, 23] [2023-10-07 23:50:56,906][52098] Worker 4 uses CPU cores [8, 9] [2023-10-07 23:50:56,989][52060] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-10-07 23:50:56,989][52060] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-10-07 23:50:57,011][52060] Num visible devices: 1 [2023-10-07 23:50:57,024][52059] Using GPUs [1] for process 1 (actually maps to GPUs [1]) [2023-10-07 23:50:57,024][52059] Set environment var CUDA_VISIBLE_DEVICES to '1' (GPU indices [1]) for inference process 1 [2023-10-07 23:50:57,036][52100] Worker 5 uses CPU cores [10, 11] [2023-10-07 23:50:57,044][52059] Num visible devices: 1 [2023-10-07 23:50:57,116][52095] Worker 1 uses CPU cores [2, 3] [2023-10-07 23:50:57,116][52061] Worker 0 uses CPU cores [0, 1] [2023-10-07 23:50:57,127][52104] Worker 9 uses CPU cores [18, 19] [2023-10-07 23:50:57,646][52060] RunningMeanStd input shape: (4, 84, 84) [2023-10-07 23:50:57,647][52060] RunningMeanStd input shape: (1,) [2023-10-07 23:50:57,658][52060] ConvEncoder: input_channels=4 [2023-10-07 23:50:57,669][52059] RunningMeanStd input shape: (4, 84, 84) [2023-10-07 23:50:57,669][52059] RunningMeanStd input shape: (1,) [2023-10-07 23:50:57,681][52059] ConvEncoder: input_channels=4 [2023-10-07 23:50:57,761][52060] Conv encoder output size: 512 [2023-10-07 23:50:57,783][52059] Conv encoder output size: 512 [2023-10-07 23:50:58,478][52728] Worker 14 uses CPU cores [28, 29] [2023-10-07 23:50:58,513][50642] Inference worker 0-0 is ready! [2023-10-07 23:50:58,515][52796] Worker 15 uses CPU cores [30, 31] [2023-10-07 23:50:58,514][50642] Inference worker 1-0 is ready! [2023-10-07 23:50:58,515][50642] All inference workers are ready! Signal rollout workers to start! [2023-10-07 23:50:58,516][52101] EnvRunner 6-0 uses policy 0 [2023-10-07 23:50:58,516][52061] EnvRunner 0-0 uses policy 0 [2023-10-07 23:50:58,517][52103] EnvRunner 7-0 uses policy 1 [2023-10-07 23:50:58,516][52108] EnvRunner 13-0 uses policy 1 [2023-10-07 23:50:58,516][52096] EnvRunner 2-0 uses policy 0 [2023-10-07 23:50:58,517][52098] EnvRunner 4-0 uses policy 0 [2023-10-07 23:50:58,517][52095] EnvRunner 1-0 uses policy 1 [2023-10-07 23:50:58,517][52102] EnvRunner 8-0 uses policy 0 [2023-10-07 23:50:58,517][52100] EnvRunner 5-0 uses policy 1 [2023-10-07 23:50:58,517][52104] EnvRunner 9-0 uses policy 1 [2023-10-07 23:50:58,517][52106] EnvRunner 11-0 uses policy 1 [2023-10-07 23:50:58,517][52105] EnvRunner 10-0 uses policy 0 [2023-10-07 23:50:58,517][50642] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan, 1: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-10-07 23:50:58,517][52107] EnvRunner 12-0 uses policy 0 [2023-10-07 23:50:58,517][52099] EnvRunner 3-0 uses policy 1 [2023-10-07 23:50:58,648][52728] EnvRunner 14-0 uses policy 0 [2023-10-07 23:50:58,748][52796] EnvRunner 15-0 uses policy 1 [2023-10-07 23:51:00,742][50642] Heartbeat connected on Batcher_0 [2023-10-07 23:51:00,745][50642] Heartbeat connected on LearnerWorker_p0 [2023-10-07 23:51:00,748][50642] Heartbeat connected on Batcher_1 [2023-10-07 23:51:00,751][50642] Heartbeat connected on LearnerWorker_p1 [2023-10-07 23:51:00,760][50642] Heartbeat connected on InferenceWorker_p0-w0 [2023-10-07 23:51:00,760][50642] Heartbeat connected on InferenceWorker_p1-w0 [2023-10-07 23:51:00,762][50642] Heartbeat connected on RolloutWorker_w0 [2023-10-07 23:51:00,768][50642] Heartbeat connected on RolloutWorker_w1 [2023-10-07 23:51:00,768][50642] Heartbeat connected on RolloutWorker_w2 [2023-10-07 23:51:00,773][50642] Heartbeat connected on RolloutWorker_w3 [2023-10-07 23:51:00,775][50642] Heartbeat connected on RolloutWorker_w4 [2023-10-07 23:51:00,779][50642] Heartbeat connected on RolloutWorker_w6 [2023-10-07 23:51:00,780][50642] Heartbeat connected on RolloutWorker_w5 [2023-10-07 23:51:00,783][50642] Heartbeat connected on RolloutWorker_w7 [2023-10-07 23:51:00,787][50642] Heartbeat connected on RolloutWorker_w9 [2023-10-07 23:51:00,790][50642] Heartbeat connected on RolloutWorker_w8 [2023-10-07 23:51:00,791][50642] Heartbeat connected on RolloutWorker_w10 [2023-10-07 23:51:00,792][50642] Heartbeat connected on RolloutWorker_w11 [2023-10-07 23:51:00,799][50642] Heartbeat connected on RolloutWorker_w13 [2023-10-07 23:51:00,799][50642] Heartbeat connected on RolloutWorker_w12 [2023-10-07 23:51:00,801][50642] Heartbeat connected on RolloutWorker_w14 [2023-10-07 23:51:00,805][50642] Heartbeat connected on RolloutWorker_w15 [2023-10-07 23:51:01,210][50642] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 563.6, 1: 577.7. Samples: 3074. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-10-07 23:51:01,211][50642] Avg episode reward: [(0, '0.500'), (1, '0.000')] [2023-10-07 23:51:06,210][50642] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 936.2, 1: 976.5. Samples: 14714. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-10-07 23:51:06,211][50642] Avg episode reward: [(0, '0.347'), (1, '0.203')] [2023-10-07 23:51:08,338][52059] Updated weights for policy 1, policy_version 10 (0.0009) [2023-10-07 23:51:08,696][52059] Updated weights for policy 1, policy_version 20 (0.0007) [2023-10-07 23:51:09,038][52060] Updated weights for policy 0, policy_version 10 (0.0009) [2023-10-07 23:51:09,062][52059] Updated weights for policy 1, policy_version 30 (0.0009) [2023-10-07 23:51:09,406][52060] Updated weights for policy 0, policy_version 20 (0.0008) [2023-10-07 23:51:09,783][52060] Updated weights for policy 0, policy_version 30 (0.0010) [2023-10-07 23:51:11,210][50642] Fps is (10 sec: 6553.7, 60 sec: 5163.1, 300 sec: 5163.1). Total num frames: 65536. Throughput: 0: 1223.3, 1: 1244.0. Samples: 31318. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-07 23:51:11,211][50642] Avg episode reward: [(0, '0.210'), (1, '0.160')] [2023-10-07 23:51:11,518][52059] Updated weights for policy 1, policy_version 40 (0.0009) [2023-10-07 23:51:11,876][52059] Updated weights for policy 1, policy_version 50 (0.0009) [2023-10-07 23:51:11,965][52060] Updated weights for policy 0, policy_version 40 (0.0009) [2023-10-07 23:51:12,245][52059] Updated weights for policy 1, policy_version 60 (0.0007) [2023-10-07 23:51:12,338][52060] Updated weights for policy 0, policy_version 50 (0.0008) [2023-10-07 23:51:12,708][52060] Updated weights for policy 0, policy_version 60 (0.0008) [2023-10-07 23:51:15,755][52059] Updated weights for policy 1, policy_version 70 (0.0009) [2023-10-07 23:51:16,127][52059] Updated weights for policy 1, policy_version 80 (0.0008) [2023-10-07 23:51:16,208][52060] Updated weights for policy 0, policy_version 70 (0.0009) [2023-10-07 23:51:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 7408.0, 300 sec: 7408.0). Total num frames: 131072. Throughput: 0: 1466.8, 1: 1479.7. Samples: 52132. Policy #0 lag: (min: 33.0, avg: 33.0, max: 33.0) [2023-10-07 23:51:16,211][50642] Avg episode reward: [(0, '0.230'), (1, '0.180')] [2023-10-07 23:51:16,498][52059] Updated weights for policy 1, policy_version 90 (0.0007) [2023-10-07 23:51:16,579][52060] Updated weights for policy 0, policy_version 80 (0.0009) [2023-10-07 23:51:16,937][52060] Updated weights for policy 0, policy_version 90 (0.0008) [2023-10-07 23:51:20,024][52059] Updated weights for policy 1, policy_version 100 (0.0008) [2023-10-07 23:51:20,378][52059] Updated weights for policy 1, policy_version 110 (0.0010) [2023-10-07 23:51:20,543][52060] Updated weights for policy 0, policy_version 100 (0.0008) [2023-10-07 23:51:20,741][52059] Updated weights for policy 1, policy_version 120 (0.0008) [2023-10-07 23:51:20,917][52060] Updated weights for policy 0, policy_version 110 (0.0008) [2023-10-07 23:51:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 10107.7, 300 sec: 10107.7). Total num frames: 229376. Throughput: 0: 1349.9, 1: 1385.9. Samples: 62084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:51:21,211][50642] Avg episode reward: [(0, '0.340'), (1, '0.190')] [2023-10-07 23:51:21,212][51710] Saving new best policy, reward=0.190! [2023-10-07 23:51:21,293][52060] Updated weights for policy 0, policy_version 120 (0.0008) [2023-10-07 23:51:24,771][52059] Updated weights for policy 1, policy_version 130 (0.0008) [2023-10-07 23:51:25,138][52059] Updated weights for policy 1, policy_version 140 (0.0007) [2023-10-07 23:51:25,161][52060] Updated weights for policy 0, policy_version 130 (0.0008) [2023-10-07 23:51:25,506][52059] Updated weights for policy 1, policy_version 150 (0.0009) [2023-10-07 23:51:25,525][52060] Updated weights for policy 0, policy_version 140 (0.0008) [2023-10-07 23:51:25,864][52059] Updated weights for policy 1, policy_version 160 (0.0009) [2023-10-07 23:51:25,902][52060] Updated weights for policy 0, policy_version 150 (0.0008) [2023-10-07 23:51:26,210][50642] Fps is (10 sec: 16384.1, 60 sec: 10649.3, 300 sec: 10649.3). Total num frames: 294912. Throughput: 0: 1491.8, 1: 1513.9. Samples: 83236. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-07 23:51:26,211][50642] Avg episode reward: [(0, '0.570'), (1, '0.320')] [2023-10-07 23:51:26,211][51710] Saving new best policy, reward=0.320! [2023-10-07 23:51:26,269][51605] Saving new best policy, reward=0.570! [2023-10-07 23:51:26,270][52060] Updated weights for policy 0, policy_version 160 (0.0008) [2023-10-07 23:51:29,840][52059] Updated weights for policy 1, policy_version 170 (0.0010) [2023-10-07 23:51:30,207][52059] Updated weights for policy 1, policy_version 180 (0.0009) [2023-10-07 23:51:30,319][52060] Updated weights for policy 0, policy_version 170 (0.0009) [2023-10-07 23:51:30,570][52059] Updated weights for policy 1, policy_version 190 (0.0008) [2023-10-07 23:51:30,675][52060] Updated weights for policy 0, policy_version 180 (0.0008) [2023-10-07 23:51:31,043][52060] Updated weights for policy 0, policy_version 190 (0.0011) [2023-10-07 23:51:31,210][50642] Fps is (10 sec: 16384.1, 60 sec: 12027.4, 300 sec: 12027.4). Total num frames: 393216. Throughput: 0: 1558.2, 1: 1573.5. Samples: 102388. Policy #0 lag: (min: 31.0, avg: 42.5, max: 63.0) [2023-10-07 23:51:31,211][50642] Avg episode reward: [(0, '0.740'), (1, '0.240')] [2023-10-07 23:51:31,215][51605] Saving new best policy, reward=0.740! [2023-10-07 23:51:34,276][52059] Updated weights for policy 1, policy_version 200 (0.0007) [2023-10-07 23:51:34,641][52059] Updated weights for policy 1, policy_version 210 (0.0007) [2023-10-07 23:51:35,005][52059] Updated weights for policy 1, policy_version 220 (0.0007) [2023-10-07 23:51:35,044][52060] Updated weights for policy 0, policy_version 200 (0.0008) [2023-10-07 23:51:35,411][52060] Updated weights for policy 0, policy_version 210 (0.0010) [2023-10-07 23:51:35,777][52060] Updated weights for policy 0, policy_version 220 (0.0010) [2023-10-07 23:51:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 12170.7, 300 sec: 12170.7). Total num frames: 458752. Throughput: 0: 1499.6, 1: 1528.9. Samples: 114152. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:51:36,211][50642] Avg episode reward: [(0, '1.010'), (1, '0.430')] [2023-10-07 23:51:36,212][51605] Saving new best policy, reward=1.010! [2023-10-07 23:51:36,212][51710] Saving new best policy, reward=0.430! [2023-10-07 23:51:38,991][52059] Updated weights for policy 1, policy_version 230 (0.0007) [2023-10-07 23:51:39,361][52059] Updated weights for policy 1, policy_version 240 (0.0009) [2023-10-07 23:51:39,664][52060] Updated weights for policy 0, policy_version 230 (0.0007) [2023-10-07 23:51:39,721][52059] Updated weights for policy 1, policy_version 250 (0.0007) [2023-10-07 23:51:40,036][52060] Updated weights for policy 0, policy_version 240 (0.0009) [2023-10-07 23:51:40,413][52060] Updated weights for policy 0, policy_version 250 (0.0008) [2023-10-07 23:51:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 12280.4, 300 sec: 12280.4). Total num frames: 524288. Throughput: 0: 1563.1, 1: 1569.3. Samples: 133732. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-07 23:51:41,211][50642] Avg episode reward: [(0, '1.050'), (1, '0.820')] [2023-10-07 23:51:41,212][51605] Saving new best policy, reward=1.050! [2023-10-07 23:51:41,212][51710] Saving new best policy, reward=0.820! [2023-10-07 23:51:43,753][52059] Updated weights for policy 1, policy_version 260 (0.0008) [2023-10-07 23:51:44,125][52059] Updated weights for policy 1, policy_version 270 (0.0010) [2023-10-07 23:51:44,381][52060] Updated weights for policy 0, policy_version 260 (0.0008) [2023-10-07 23:51:44,503][52059] Updated weights for policy 1, policy_version 280 (0.0010) [2023-10-07 23:51:44,749][52060] Updated weights for policy 0, policy_version 270 (0.0009) [2023-10-07 23:51:45,118][52060] Updated weights for policy 0, policy_version 280 (0.0008) [2023-10-07 23:51:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 12367.0, 300 sec: 12367.0). Total num frames: 589824. Throughput: 0: 1664.3, 1: 1690.0. Samples: 154016. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-07 23:51:46,211][50642] Avg episode reward: [(0, '1.100'), (1, '0.790')] [2023-10-07 23:51:46,220][51605] Saving new best policy, reward=1.100! [2023-10-07 23:51:48,536][52059] Updated weights for policy 1, policy_version 290 (0.0008) [2023-10-07 23:51:48,920][52059] Updated weights for policy 1, policy_version 300 (0.0010) [2023-10-07 23:51:49,292][52059] Updated weights for policy 1, policy_version 310 (0.0009) [2023-10-07 23:51:49,308][52060] Updated weights for policy 0, policy_version 290 (0.0009) [2023-10-07 23:51:49,659][52059] Updated weights for policy 1, policy_version 320 (0.0009) [2023-10-07 23:51:49,695][52060] Updated weights for policy 0, policy_version 300 (0.0008) [2023-10-07 23:51:50,066][52060] Updated weights for policy 0, policy_version 310 (0.0008) [2023-10-07 23:51:50,432][52060] Updated weights for policy 0, policy_version 320 (0.0009) [2023-10-07 23:51:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 12437.2, 300 sec: 12437.2). Total num frames: 655360. Throughput: 0: 1667.8, 1: 1681.2. Samples: 165422. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:51:51,211][50642] Avg episode reward: [(0, '1.260'), (1, '0.860')] [2023-10-07 23:51:51,213][51710] Saving new best policy, reward=0.860! [2023-10-07 23:51:51,213][51605] Saving new best policy, reward=1.260! [2023-10-07 23:51:53,541][52059] Updated weights for policy 1, policy_version 330 (0.0008) [2023-10-07 23:51:53,910][52059] Updated weights for policy 1, policy_version 340 (0.0009) [2023-10-07 23:51:54,271][52059] Updated weights for policy 1, policy_version 350 (0.0009) [2023-10-07 23:51:54,491][52060] Updated weights for policy 0, policy_version 330 (0.0008) [2023-10-07 23:51:54,853][52060] Updated weights for policy 0, policy_version 340 (0.0009) [2023-10-07 23:51:55,222][52060] Updated weights for policy 0, policy_version 350 (0.0009) [2023-10-07 23:51:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 12495.3, 300 sec: 12495.3). Total num frames: 720896. Throughput: 0: 1700.0, 1: 1714.7. Samples: 184978. Policy #0 lag: (min: 26.0, avg: 32.5, max: 58.0) [2023-10-07 23:51:56,211][50642] Avg episode reward: [(0, '2.080'), (1, '0.960')] [2023-10-07 23:51:56,213][51605] Saving new best policy, reward=2.080! [2023-10-07 23:51:56,213][51710] Saving new best policy, reward=0.960! [2023-10-07 23:51:58,174][52059] Updated weights for policy 1, policy_version 360 (0.0007) [2023-10-07 23:51:58,535][52059] Updated weights for policy 1, policy_version 370 (0.0008) [2023-10-07 23:51:58,901][52059] Updated weights for policy 1, policy_version 380 (0.0009) [2023-10-07 23:51:59,078][52060] Updated weights for policy 0, policy_version 360 (0.0008) [2023-10-07 23:51:59,453][52060] Updated weights for policy 0, policy_version 370 (0.0010) [2023-10-07 23:51:59,826][52060] Updated weights for policy 0, policy_version 380 (0.0007) [2023-10-07 23:52:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 12544.1). Total num frames: 786432. Throughput: 0: 1691.3, 1: 1724.9. Samples: 205864. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:52:01,211][50642] Avg episode reward: [(0, '2.540'), (1, '1.120')] [2023-10-07 23:52:01,219][51605] Saving new best policy, reward=2.540! [2023-10-07 23:52:01,220][51710] Saving new best policy, reward=1.120! [2023-10-07 23:52:02,849][52059] Updated weights for policy 1, policy_version 390 (0.0009) [2023-10-07 23:52:03,213][52059] Updated weights for policy 1, policy_version 400 (0.0008) [2023-10-07 23:52:03,572][52059] Updated weights for policy 1, policy_version 410 (0.0007) [2023-10-07 23:52:03,742][52060] Updated weights for policy 0, policy_version 390 (0.0008) [2023-10-07 23:52:04,105][52060] Updated weights for policy 0, policy_version 400 (0.0010) [2023-10-07 23:52:04,483][52060] Updated weights for policy 0, policy_version 410 (0.0008) [2023-10-07 23:52:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 12585.7). Total num frames: 851968. Throughput: 0: 1712.1, 1: 1712.0. Samples: 216170. Policy #0 lag: (min: 22.0, avg: 24.3, max: 54.0) [2023-10-07 23:52:06,211][50642] Avg episode reward: [(0, '2.590'), (1, '1.700')] [2023-10-07 23:52:06,211][51605] Saving new best policy, reward=2.590! [2023-10-07 23:52:06,212][51710] Saving new best policy, reward=1.700! [2023-10-07 23:52:07,499][52059] Updated weights for policy 1, policy_version 420 (0.0008) [2023-10-07 23:52:07,860][52059] Updated weights for policy 1, policy_version 430 (0.0007) [2023-10-07 23:52:08,227][52059] Updated weights for policy 1, policy_version 440 (0.0009) [2023-10-07 23:52:08,602][52060] Updated weights for policy 0, policy_version 420 (0.0009) [2023-10-07 23:52:08,974][52060] Updated weights for policy 0, policy_version 430 (0.0008) [2023-10-07 23:52:09,334][52060] Updated weights for policy 0, policy_version 440 (0.0009) [2023-10-07 23:52:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 12621.6). Total num frames: 917504. Throughput: 0: 1684.7, 1: 1719.2. Samples: 236412. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-07 23:52:11,211][50642] Avg episode reward: [(0, '2.670'), (1, '2.300')] [2023-10-07 23:52:11,212][51710] Saving new best policy, reward=2.300! [2023-10-07 23:52:11,212][51605] Saving new best policy, reward=2.670! [2023-10-07 23:52:12,325][52059] Updated weights for policy 1, policy_version 450 (0.0009) [2023-10-07 23:52:12,679][52059] Updated weights for policy 1, policy_version 460 (0.0009) [2023-10-07 23:52:13,041][52059] Updated weights for policy 1, policy_version 470 (0.0009) [2023-10-07 23:52:13,408][52059] Updated weights for policy 1, policy_version 480 (0.0009) [2023-10-07 23:52:13,423][52060] Updated weights for policy 0, policy_version 450 (0.0009) [2023-10-07 23:52:13,804][52060] Updated weights for policy 0, policy_version 460 (0.0009) [2023-10-07 23:52:14,168][52060] Updated weights for policy 0, policy_version 470 (0.0011) [2023-10-07 23:52:14,539][52060] Updated weights for policy 0, policy_version 480 (0.0009) [2023-10-07 23:52:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 12652.8). Total num frames: 983040. Throughput: 0: 1698.5, 1: 1743.5. Samples: 257280. Policy #0 lag: (min: 1.0, avg: 5.3, max: 33.0) [2023-10-07 23:52:16,211][50642] Avg episode reward: [(0, '2.870'), (1, '2.650')] [2023-10-07 23:52:16,217][51605] Saving new best policy, reward=2.870! [2023-10-07 23:52:16,218][51710] Saving new best policy, reward=2.650! [2023-10-07 23:52:17,283][52059] Updated weights for policy 1, policy_version 490 (0.0007) [2023-10-07 23:52:17,641][52059] Updated weights for policy 1, policy_version 500 (0.0008) [2023-10-07 23:52:18,019][52059] Updated weights for policy 1, policy_version 510 (0.0007) [2023-10-07 23:52:18,614][52060] Updated weights for policy 0, policy_version 490 (0.0009) [2023-10-07 23:52:18,984][52060] Updated weights for policy 0, policy_version 500 (0.0011) [2023-10-07 23:52:19,365][52060] Updated weights for policy 0, policy_version 510 (0.0007) [2023-10-07 23:52:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 12680.3). Total num frames: 1048576. Throughput: 0: 1693.6, 1: 1708.7. Samples: 267258. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-07 23:52:21,211][50642] Avg episode reward: [(0, '3.100'), (1, '2.900')] [2023-10-07 23:52:21,212][51605] Saving new best policy, reward=3.100! [2023-10-07 23:52:21,213][51710] Saving new best policy, reward=2.900! [2023-10-07 23:52:21,897][52059] Updated weights for policy 1, policy_version 520 (0.0008) [2023-10-07 23:52:22,274][52059] Updated weights for policy 1, policy_version 530 (0.0007) [2023-10-07 23:52:22,640][52059] Updated weights for policy 1, policy_version 540 (0.0007) [2023-10-07 23:52:23,545][52060] Updated weights for policy 0, policy_version 520 (0.0009) [2023-10-07 23:52:23,912][52060] Updated weights for policy 0, policy_version 530 (0.0010) [2023-10-07 23:52:24,277][52060] Updated weights for policy 0, policy_version 540 (0.0008) [2023-10-07 23:52:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 12704.6). Total num frames: 1114112. Throughput: 0: 1684.1, 1: 1745.0. Samples: 288042. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:52:26,212][50642] Avg episode reward: [(0, '3.350'), (1, '3.480')] [2023-10-07 23:52:26,213][51605] Saving new best policy, reward=3.350! [2023-10-07 23:52:26,213][51710] Saving new best policy, reward=3.480! [2023-10-07 23:52:26,594][52059] Updated weights for policy 1, policy_version 550 (0.0010) [2023-10-07 23:52:26,955][52059] Updated weights for policy 1, policy_version 560 (0.0009) [2023-10-07 23:52:27,325][52059] Updated weights for policy 1, policy_version 570 (0.0008) [2023-10-07 23:52:28,196][52060] Updated weights for policy 0, policy_version 550 (0.0008) [2023-10-07 23:52:28,559][52060] Updated weights for policy 0, policy_version 560 (0.0007) [2023-10-07 23:52:28,935][52060] Updated weights for policy 0, policy_version 570 (0.0008) [2023-10-07 23:52:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 12726.4). Total num frames: 1179648. Throughput: 0: 1702.5, 1: 1741.6. Samples: 309002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:52:31,211][50642] Avg episode reward: [(0, '3.380'), (1, '3.710')] [2023-10-07 23:52:31,224][51605] Saving new best policy, reward=3.380! [2023-10-07 23:52:31,358][52059] Updated weights for policy 1, policy_version 580 (0.0009) [2023-10-07 23:52:31,726][52059] Updated weights for policy 1, policy_version 590 (0.0008) [2023-10-07 23:52:32,082][52059] Updated weights for policy 1, policy_version 600 (0.0007) [2023-10-07 23:52:32,373][51710] Saving new best policy, reward=3.710! [2023-10-07 23:52:32,857][52060] Updated weights for policy 0, policy_version 580 (0.0010) [2023-10-07 23:52:33,223][52060] Updated weights for policy 0, policy_version 590 (0.0009) [2023-10-07 23:52:33,589][52060] Updated weights for policy 0, policy_version 600 (0.0009) [2023-10-07 23:52:36,107][52059] Updated weights for policy 1, policy_version 610 (0.0009) [2023-10-07 23:52:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 12745.9). Total num frames: 1245184. Throughput: 0: 1677.8, 1: 1726.2. Samples: 318600. Policy #0 lag: (min: 17.0, avg: 38.8, max: 49.0) [2023-10-07 23:52:36,211][50642] Avg episode reward: [(0, '3.110'), (1, '3.690')] [2023-10-07 23:52:36,496][52059] Updated weights for policy 1, policy_version 620 (0.0008) [2023-10-07 23:52:36,861][52059] Updated weights for policy 1, policy_version 630 (0.0009) [2023-10-07 23:52:37,225][52059] Updated weights for policy 1, policy_version 640 (0.0009) [2023-10-07 23:52:37,561][52060] Updated weights for policy 0, policy_version 610 (0.0010) [2023-10-07 23:52:37,969][52060] Updated weights for policy 0, policy_version 620 (0.0010) [2023-10-07 23:52:38,331][52060] Updated weights for policy 0, policy_version 630 (0.0008) [2023-10-07 23:52:38,709][52060] Updated weights for policy 0, policy_version 640 (0.0007) [2023-10-07 23:52:41,092][52059] Updated weights for policy 1, policy_version 650 (0.0009) [2023-10-07 23:52:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 12763.5). Total num frames: 1310720. Throughput: 0: 1688.5, 1: 1743.2. Samples: 339406. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-07 23:52:41,211][50642] Avg episode reward: [(0, '3.460'), (1, '3.590')] [2023-10-07 23:52:41,211][51605] Saving new best policy, reward=3.460! [2023-10-07 23:52:41,465][52059] Updated weights for policy 1, policy_version 660 (0.0009) [2023-10-07 23:52:41,827][52059] Updated weights for policy 1, policy_version 670 (0.0008) [2023-10-07 23:52:42,690][52060] Updated weights for policy 0, policy_version 650 (0.0007) [2023-10-07 23:52:43,059][52060] Updated weights for policy 0, policy_version 660 (0.0007) [2023-10-07 23:52:43,429][52060] Updated weights for policy 0, policy_version 670 (0.0007) [2023-10-07 23:52:45,519][52059] Updated weights for policy 1, policy_version 680 (0.0009) [2023-10-07 23:52:45,887][52059] Updated weights for policy 1, policy_version 690 (0.0009) [2023-10-07 23:52:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 12779.4). Total num frames: 1376256. Throughput: 0: 1703.2, 1: 1735.4. Samples: 360600. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:52:46,211][50642] Avg episode reward: [(0, '3.430'), (1, '3.720')] [2023-10-07 23:52:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000000672_688128.pth... [2023-10-07 23:52:46,255][52059] Updated weights for policy 1, policy_version 700 (0.0009) [2023-10-07 23:52:46,397][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000000704_720896.pth... [2023-10-07 23:52:46,436][51710] Saving new best policy, reward=3.720! [2023-10-07 23:52:47,282][52060] Updated weights for policy 0, policy_version 680 (0.0007) [2023-10-07 23:52:47,645][52060] Updated weights for policy 0, policy_version 690 (0.0008) [2023-10-07 23:52:48,010][52060] Updated weights for policy 0, policy_version 700 (0.0008) [2023-10-07 23:52:50,147][52059] Updated weights for policy 1, policy_version 710 (0.0008) [2023-10-07 23:52:50,503][52059] Updated weights for policy 1, policy_version 720 (0.0009) [2023-10-07 23:52:50,867][52059] Updated weights for policy 1, policy_version 730 (0.0011) [2023-10-07 23:52:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13084.7). Total num frames: 1474560. Throughput: 0: 1684.8, 1: 1748.6. Samples: 370676. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-07 23:52:51,211][50642] Avg episode reward: [(0, '4.130'), (1, '3.650')] [2023-10-07 23:52:51,211][51605] Saving new best policy, reward=4.130! [2023-10-07 23:52:52,031][52060] Updated weights for policy 0, policy_version 710 (0.0009) [2023-10-07 23:52:52,403][52060] Updated weights for policy 0, policy_version 720 (0.0008) [2023-10-07 23:52:52,773][52060] Updated weights for policy 0, policy_version 730 (0.0007) [2023-10-07 23:52:54,923][52059] Updated weights for policy 1, policy_version 740 (0.0009) [2023-10-07 23:52:55,290][52059] Updated weights for policy 1, policy_version 750 (0.0007) [2023-10-07 23:52:55,650][52059] Updated weights for policy 1, policy_version 760 (0.0008) [2023-10-07 23:52:56,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13085.7). Total num frames: 1540096. Throughput: 0: 1710.6, 1: 1743.0. Samples: 391824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-07 23:52:56,211][50642] Avg episode reward: [(0, '4.270'), (1, '4.040')] [2023-10-07 23:52:56,213][51605] Saving new best policy, reward=4.270! [2023-10-07 23:52:56,213][51710] Saving new best policy, reward=4.040! [2023-10-07 23:52:56,753][52060] Updated weights for policy 0, policy_version 740 (0.0007) [2023-10-07 23:52:57,116][52060] Updated weights for policy 0, policy_version 750 (0.0007) [2023-10-07 23:52:57,490][52060] Updated weights for policy 0, policy_version 760 (0.0007) [2023-10-07 23:52:59,468][52059] Updated weights for policy 1, policy_version 770 (0.0008) [2023-10-07 23:52:59,824][52059] Updated weights for policy 1, policy_version 780 (0.0009) [2023-10-07 23:53:00,194][52059] Updated weights for policy 1, policy_version 790 (0.0007) [2023-10-07 23:53:00,555][52059] Updated weights for policy 1, policy_version 800 (0.0009) [2023-10-07 23:53:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13086.6). Total num frames: 1605632. Throughput: 0: 1715.4, 1: 1723.3. Samples: 412022. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-07 23:53:01,211][50642] Avg episode reward: [(0, '4.030'), (1, '4.500')] [2023-10-07 23:53:01,219][51710] Saving new best policy, reward=4.500! [2023-10-07 23:53:01,585][52060] Updated weights for policy 0, policy_version 770 (0.0008) [2023-10-07 23:53:01,958][52060] Updated weights for policy 0, policy_version 780 (0.0010) [2023-10-07 23:53:02,330][52060] Updated weights for policy 0, policy_version 790 (0.0011) [2023-10-07 23:53:02,712][52060] Updated weights for policy 0, policy_version 800 (0.0010) [2023-10-07 23:53:04,488][52059] Updated weights for policy 1, policy_version 810 (0.0007) [2023-10-07 23:53:04,858][52059] Updated weights for policy 1, policy_version 820 (0.0007) [2023-10-07 23:53:05,226][52059] Updated weights for policy 1, policy_version 830 (0.0007) [2023-10-07 23:53:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13087.4). Total num frames: 1671168. Throughput: 0: 1696.4, 1: 1754.9. Samples: 422566. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-07 23:53:06,211][50642] Avg episode reward: [(0, '4.280'), (1, '4.530')] [2023-10-07 23:53:06,212][51605] Saving new best policy, reward=4.280! [2023-10-07 23:53:06,212][51710] Saving new best policy, reward=4.530! [2023-10-07 23:53:06,713][52060] Updated weights for policy 0, policy_version 810 (0.0009) [2023-10-07 23:53:07,082][52060] Updated weights for policy 0, policy_version 820 (0.0007) [2023-10-07 23:53:07,449][52060] Updated weights for policy 0, policy_version 830 (0.0009) [2023-10-07 23:53:09,187][52059] Updated weights for policy 1, policy_version 840 (0.0007) [2023-10-07 23:53:09,548][52059] Updated weights for policy 1, policy_version 850 (0.0009) [2023-10-07 23:53:09,924][52059] Updated weights for policy 1, policy_version 860 (0.0009) [2023-10-07 23:53:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13088.1). Total num frames: 1736704. Throughput: 0: 1719.8, 1: 1721.8. Samples: 442912. Policy #0 lag: (min: 12.0, avg: 12.0, max: 12.0) [2023-10-07 23:53:11,211][50642] Avg episode reward: [(0, '4.550'), (1, '4.180')] [2023-10-07 23:53:11,347][52060] Updated weights for policy 0, policy_version 840 (0.0008) [2023-10-07 23:53:11,721][52060] Updated weights for policy 0, policy_version 850 (0.0009) [2023-10-07 23:53:12,098][52060] Updated weights for policy 0, policy_version 860 (0.0007) [2023-10-07 23:53:12,238][51605] Saving new best policy, reward=4.550! [2023-10-07 23:53:13,793][52059] Updated weights for policy 1, policy_version 870 (0.0007) [2023-10-07 23:53:14,156][52059] Updated weights for policy 1, policy_version 880 (0.0008) [2023-10-07 23:53:14,528][52059] Updated weights for policy 1, policy_version 890 (0.0009) [2023-10-07 23:53:15,987][52060] Updated weights for policy 0, policy_version 870 (0.0009) [2023-10-07 23:53:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13088.8). Total num frames: 1802240. Throughput: 0: 1720.2, 1: 1723.2. Samples: 463954. Policy #0 lag: (min: 8.0, avg: 30.3, max: 40.0) [2023-10-07 23:53:16,211][50642] Avg episode reward: [(0, '4.470'), (1, '3.980')] [2023-10-07 23:53:16,355][52060] Updated weights for policy 0, policy_version 880 (0.0007) [2023-10-07 23:53:16,728][52060] Updated weights for policy 0, policy_version 890 (0.0008) [2023-10-07 23:53:18,533][52059] Updated weights for policy 1, policy_version 900 (0.0009) [2023-10-07 23:53:18,891][52059] Updated weights for policy 1, policy_version 910 (0.0010) [2023-10-07 23:53:19,261][52059] Updated weights for policy 1, policy_version 920 (0.0011) [2023-10-07 23:53:20,780][52060] Updated weights for policy 0, policy_version 900 (0.0009) [2023-10-07 23:53:21,151][52060] Updated weights for policy 0, policy_version 910 (0.0007) [2023-10-07 23:53:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13089.5). Total num frames: 1867776. Throughput: 0: 1721.6, 1: 1737.8. Samples: 474270. Policy #0 lag: (min: 19.0, avg: 45.0, max: 48.0) [2023-10-07 23:53:21,211][50642] Avg episode reward: [(0, '4.570'), (1, '4.490')] [2023-10-07 23:53:21,523][52060] Updated weights for policy 0, policy_version 920 (0.0008) [2023-10-07 23:53:21,818][51605] Saving new best policy, reward=4.570! [2023-10-07 23:53:23,246][52059] Updated weights for policy 1, policy_version 930 (0.0009) [2023-10-07 23:53:23,622][52059] Updated weights for policy 1, policy_version 940 (0.0010) [2023-10-07 23:53:23,990][52059] Updated weights for policy 1, policy_version 950 (0.0009) [2023-10-07 23:53:24,356][52059] Updated weights for policy 1, policy_version 960 (0.0011) [2023-10-07 23:53:25,484][52060] Updated weights for policy 0, policy_version 930 (0.0010) [2023-10-07 23:53:25,878][52060] Updated weights for policy 0, policy_version 940 (0.0009) [2023-10-07 23:53:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13090.1). Total num frames: 1933312. Throughput: 0: 1728.1, 1: 1719.6. Samples: 494552. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-07 23:53:26,211][50642] Avg episode reward: [(0, '4.540'), (1, '4.440')] [2023-10-07 23:53:26,252][52060] Updated weights for policy 0, policy_version 950 (0.0008) [2023-10-07 23:53:26,629][52060] Updated weights for policy 0, policy_version 960 (0.0008) [2023-10-07 23:53:28,364][52059] Updated weights for policy 1, policy_version 970 (0.0008) [2023-10-07 23:53:28,731][52059] Updated weights for policy 1, policy_version 980 (0.0007) [2023-10-07 23:53:29,086][52059] Updated weights for policy 1, policy_version 990 (0.0007) [2023-10-07 23:53:30,523][52060] Updated weights for policy 0, policy_version 970 (0.0009) [2023-10-07 23:53:30,898][52060] Updated weights for policy 0, policy_version 980 (0.0008) [2023-10-07 23:53:31,211][50642] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13090.6). Total num frames: 1998848. Throughput: 0: 1706.4, 1: 1724.7. Samples: 515000. Policy #0 lag: (min: 31.0, avg: 32.7, max: 59.0) [2023-10-07 23:53:31,212][50642] Avg episode reward: [(0, '4.180'), (1, '4.690')] [2023-10-07 23:53:31,221][51710] Saving new best policy, reward=4.690! [2023-10-07 23:53:31,271][52060] Updated weights for policy 0, policy_version 990 (0.0009) [2023-10-07 23:53:32,874][52059] Updated weights for policy 1, policy_version 1000 (0.0008) [2023-10-07 23:53:33,237][52059] Updated weights for policy 1, policy_version 1010 (0.0009) [2023-10-07 23:53:33,595][52059] Updated weights for policy 1, policy_version 1020 (0.0008) [2023-10-07 23:53:35,213][52060] Updated weights for policy 0, policy_version 1000 (0.0009) [2023-10-07 23:53:35,584][52060] Updated weights for policy 0, policy_version 1010 (0.0008) [2023-10-07 23:53:35,961][52060] Updated weights for policy 0, policy_version 1020 (0.0008) [2023-10-07 23:53:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13298.9). Total num frames: 2097152. Throughput: 0: 1720.4, 1: 1709.9. Samples: 525038. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-07 23:53:36,211][50642] Avg episode reward: [(0, '4.260'), (1, '4.850')] [2023-10-07 23:53:36,212][51710] Saving new best policy, reward=4.850! [2023-10-07 23:53:37,669][52059] Updated weights for policy 1, policy_version 1032 (0.0009) [2023-10-07 23:53:38,031][52059] Updated weights for policy 1, policy_version 1042 (0.0009) [2023-10-07 23:53:38,402][52059] Updated weights for policy 1, policy_version 1052 (0.0009) [2023-10-07 23:53:39,876][52060] Updated weights for policy 0, policy_version 1030 (0.0007) [2023-10-07 23:53:40,247][52060] Updated weights for policy 0, policy_version 1040 (0.0008) [2023-10-07 23:53:40,607][52060] Updated weights for policy 0, policy_version 1050 (0.0009) [2023-10-07 23:53:41,210][50642] Fps is (10 sec: 16384.6, 60 sec: 14199.5, 300 sec: 13293.0). Total num frames: 2162688. Throughput: 0: 1715.2, 1: 1710.0. Samples: 545954. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-07 23:53:41,211][50642] Avg episode reward: [(0, '4.520'), (1, '4.620')] [2023-10-07 23:53:42,412][52059] Updated weights for policy 1, policy_version 1062 (0.0008) [2023-10-07 23:53:42,760][52059] Updated weights for policy 1, policy_version 1072 (0.0009) [2023-10-07 23:53:43,128][52059] Updated weights for policy 1, policy_version 1082 (0.0009) [2023-10-07 23:53:44,558][52060] Updated weights for policy 0, policy_version 1060 (0.0008) [2023-10-07 23:53:44,916][52060] Updated weights for policy 0, policy_version 1070 (0.0008) [2023-10-07 23:53:45,287][52060] Updated weights for policy 0, policy_version 1080 (0.0008) [2023-10-07 23:53:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13287.5). Total num frames: 2228224. Throughput: 0: 1694.1, 1: 1732.4. Samples: 566214. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:53:46,211][50642] Avg episode reward: [(0, '4.530'), (1, '4.180')] [2023-10-07 23:53:47,136][52059] Updated weights for policy 1, policy_version 1092 (0.0009) [2023-10-07 23:53:47,499][52059] Updated weights for policy 1, policy_version 1102 (0.0007) [2023-10-07 23:53:47,866][52059] Updated weights for policy 1, policy_version 1112 (0.0008) [2023-10-07 23:53:49,270][52060] Updated weights for policy 0, policy_version 1090 (0.0009) [2023-10-07 23:53:49,644][52060] Updated weights for policy 0, policy_version 1100 (0.0008) [2023-10-07 23:53:50,019][52060] Updated weights for policy 0, policy_version 1110 (0.0007) [2023-10-07 23:53:50,385][52060] Updated weights for policy 0, policy_version 1120 (0.0011) [2023-10-07 23:53:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13282.3). Total num frames: 2293760. Throughput: 0: 1727.4, 1: 1703.2. Samples: 576946. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-10-07 23:53:51,211][50642] Avg episode reward: [(0, '4.480'), (1, '4.090')] [2023-10-07 23:53:51,938][52059] Updated weights for policy 1, policy_version 1122 (0.0008) [2023-10-07 23:53:52,299][52059] Updated weights for policy 1, policy_version 1132 (0.0008) [2023-10-07 23:53:52,660][52059] Updated weights for policy 1, policy_version 1142 (0.0009) [2023-10-07 23:53:53,021][52059] Updated weights for policy 1, policy_version 1152 (0.0008) [2023-10-07 23:53:54,303][52060] Updated weights for policy 0, policy_version 1130 (0.0009) [2023-10-07 23:53:54,676][52060] Updated weights for policy 0, policy_version 1140 (0.0008) [2023-10-07 23:53:55,042][52060] Updated weights for policy 0, policy_version 1150 (0.0007) [2023-10-07 23:53:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13277.4). Total num frames: 2359296. Throughput: 0: 1699.2, 1: 1734.5. Samples: 597428. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-07 23:53:56,211][50642] Avg episode reward: [(0, '4.890'), (1, '4.660')] [2023-10-07 23:53:56,213][51605] Saving new best policy, reward=4.890! [2023-10-07 23:53:56,868][52059] Updated weights for policy 1, policy_version 1162 (0.0008) [2023-10-07 23:53:57,222][52059] Updated weights for policy 1, policy_version 1172 (0.0008) [2023-10-07 23:53:57,579][52059] Updated weights for policy 1, policy_version 1182 (0.0010) [2023-10-07 23:53:59,144][52060] Updated weights for policy 0, policy_version 1160 (0.0007) [2023-10-07 23:53:59,522][52060] Updated weights for policy 0, policy_version 1170 (0.0007) [2023-10-07 23:53:59,884][52060] Updated weights for policy 0, policy_version 1180 (0.0010) [2023-10-07 23:54:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13272.7). Total num frames: 2424832. Throughput: 0: 1693.1, 1: 1737.3. Samples: 618322. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:54:01,211][50642] Avg episode reward: [(0, '4.860'), (1, '4.930')] [2023-10-07 23:54:01,537][52059] Updated weights for policy 1, policy_version 1192 (0.0008) [2023-10-07 23:54:01,908][52059] Updated weights for policy 1, policy_version 1202 (0.0008) [2023-10-07 23:54:02,264][52059] Updated weights for policy 1, policy_version 1212 (0.0008) [2023-10-07 23:54:02,413][51710] Saving new best policy, reward=4.930! [2023-10-07 23:54:03,983][52060] Updated weights for policy 0, policy_version 1190 (0.0011) [2023-10-07 23:54:04,358][52060] Updated weights for policy 0, policy_version 1200 (0.0009) [2023-10-07 23:54:04,720][52060] Updated weights for policy 0, policy_version 1210 (0.0010) [2023-10-07 23:54:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13268.3). Total num frames: 2490368. Throughput: 0: 1720.0, 1: 1717.7. Samples: 628968. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-07 23:54:06,211][50642] Avg episode reward: [(0, '4.660'), (1, '4.520')] [2023-10-07 23:54:06,255][52059] Updated weights for policy 1, policy_version 1222 (0.0008) [2023-10-07 23:54:06,620][52059] Updated weights for policy 1, policy_version 1232 (0.0010) [2023-10-07 23:54:06,984][52059] Updated weights for policy 1, policy_version 1242 (0.0011) [2023-10-07 23:54:08,685][52060] Updated weights for policy 0, policy_version 1220 (0.0008) [2023-10-07 23:54:09,054][52060] Updated weights for policy 0, policy_version 1230 (0.0007) [2023-10-07 23:54:09,432][52060] Updated weights for policy 0, policy_version 1240 (0.0008) [2023-10-07 23:54:11,050][52059] Updated weights for policy 1, policy_version 1252 (0.0011) [2023-10-07 23:54:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13264.1). Total num frames: 2555904. Throughput: 0: 1692.2, 1: 1736.3. Samples: 648832. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:54:11,211][50642] Avg episode reward: [(0, '4.300'), (1, '4.750')] [2023-10-07 23:54:11,402][52059] Updated weights for policy 1, policy_version 1262 (0.0008) [2023-10-07 23:54:11,771][52059] Updated weights for policy 1, policy_version 1272 (0.0007) [2023-10-07 23:54:13,481][52060] Updated weights for policy 0, policy_version 1250 (0.0009) [2023-10-07 23:54:13,882][52060] Updated weights for policy 0, policy_version 1260 (0.0007) [2023-10-07 23:54:14,244][52060] Updated weights for policy 0, policy_version 1270 (0.0009) [2023-10-07 23:54:14,613][52060] Updated weights for policy 0, policy_version 1280 (0.0011) [2023-10-07 23:54:15,746][52059] Updated weights for policy 1, policy_version 1282 (0.0008) [2023-10-07 23:54:16,141][52059] Updated weights for policy 1, policy_version 1292 (0.0008) [2023-10-07 23:54:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13260.1). Total num frames: 2621440. Throughput: 0: 1703.2, 1: 1729.6. Samples: 669474. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:54:16,211][50642] Avg episode reward: [(0, '4.120'), (1, '4.830')] [2023-10-07 23:54:16,507][52059] Updated weights for policy 1, policy_version 1302 (0.0010) [2023-10-07 23:54:16,876][52059] Updated weights for policy 1, policy_version 1312 (0.0009) [2023-10-07 23:54:18,329][52060] Updated weights for policy 0, policy_version 1290 (0.0008) [2023-10-07 23:54:18,690][52060] Updated weights for policy 0, policy_version 1300 (0.0008) [2023-10-07 23:54:19,067][52060] Updated weights for policy 0, policy_version 1310 (0.0008) [2023-10-07 23:54:20,794][52059] Updated weights for policy 1, policy_version 1322 (0.0008) [2023-10-07 23:54:21,161][52059] Updated weights for policy 1, policy_version 1332 (0.0008) [2023-10-07 23:54:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13256.4). Total num frames: 2686976. Throughput: 0: 1698.4, 1: 1733.5. Samples: 679474. Policy #0 lag: (min: 26.0, avg: 28.2, max: 51.0) [2023-10-07 23:54:21,211][50642] Avg episode reward: [(0, '4.410'), (1, '4.570')] [2023-10-07 23:54:21,525][52059] Updated weights for policy 1, policy_version 1342 (0.0007) [2023-10-07 23:54:23,073][52060] Updated weights for policy 0, policy_version 1320 (0.0008) [2023-10-07 23:54:23,432][52060] Updated weights for policy 0, policy_version 1330 (0.0007) [2023-10-07 23:54:23,806][52060] Updated weights for policy 0, policy_version 1340 (0.0008) [2023-10-07 23:54:25,527][52059] Updated weights for policy 1, policy_version 1352 (0.0008) [2023-10-07 23:54:25,889][52059] Updated weights for policy 1, policy_version 1362 (0.0008) [2023-10-07 23:54:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13252.8). Total num frames: 2752512. Throughput: 0: 1700.6, 1: 1733.1. Samples: 700470. Policy #0 lag: (min: 28.0, avg: 31.1, max: 60.0) [2023-10-07 23:54:26,211][50642] Avg episode reward: [(0, '4.710'), (1, '4.850')] [2023-10-07 23:54:26,253][52059] Updated weights for policy 1, policy_version 1372 (0.0011) [2023-10-07 23:54:27,752][52060] Updated weights for policy 0, policy_version 1350 (0.0009) [2023-10-07 23:54:28,132][52060] Updated weights for policy 0, policy_version 1360 (0.0009) [2023-10-07 23:54:28,504][52060] Updated weights for policy 0, policy_version 1370 (0.0009) [2023-10-07 23:54:30,142][52059] Updated weights for policy 1, policy_version 1382 (0.0009) [2023-10-07 23:54:30,510][52059] Updated weights for policy 1, policy_version 1392 (0.0008) [2023-10-07 23:54:30,865][52059] Updated weights for policy 1, policy_version 1402 (0.0007) [2023-10-07 23:54:31,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.6, 300 sec: 13403.4). Total num frames: 2850816. Throughput: 0: 1721.6, 1: 1709.2. Samples: 720598. Policy #0 lag: (min: 10.0, avg: 17.9, max: 42.0) [2023-10-07 23:54:31,211][50642] Avg episode reward: [(0, '4.720'), (1, '4.730')] [2023-10-07 23:54:32,481][52060] Updated weights for policy 0, policy_version 1380 (0.0007) [2023-10-07 23:54:32,861][52060] Updated weights for policy 0, policy_version 1390 (0.0008) [2023-10-07 23:54:33,221][52060] Updated weights for policy 0, policy_version 1400 (0.0010) [2023-10-07 23:54:34,816][52059] Updated weights for policy 1, policy_version 1412 (0.0007) [2023-10-07 23:54:35,183][52059] Updated weights for policy 1, policy_version 1422 (0.0007) [2023-10-07 23:54:35,559][52059] Updated weights for policy 1, policy_version 1432 (0.0011) [2023-10-07 23:54:36,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13396.6). Total num frames: 2916352. Throughput: 0: 1690.1, 1: 1728.8. Samples: 730798. Policy #0 lag: (min: 10.0, avg: 10.0, max: 10.0) [2023-10-07 23:54:36,211][50642] Avg episode reward: [(0, '4.910'), (1, '4.790')] [2023-10-07 23:54:36,212][51605] Saving new best policy, reward=4.910! [2023-10-07 23:54:37,324][52060] Updated weights for policy 0, policy_version 1410 (0.0010) [2023-10-07 23:54:37,692][52060] Updated weights for policy 0, policy_version 1420 (0.0008) [2023-10-07 23:54:38,072][52060] Updated weights for policy 0, policy_version 1430 (0.0008) [2023-10-07 23:54:38,435][52060] Updated weights for policy 0, policy_version 1440 (0.0007) [2023-10-07 23:54:39,641][52059] Updated weights for policy 1, policy_version 1442 (0.0009) [2023-10-07 23:54:40,006][52059] Updated weights for policy 1, policy_version 1452 (0.0008) [2023-10-07 23:54:40,368][52059] Updated weights for policy 1, policy_version 1462 (0.0009) [2023-10-07 23:54:40,733][52059] Updated weights for policy 1, policy_version 1472 (0.0007) [2023-10-07 23:54:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13390.1). Total num frames: 2981888. Throughput: 0: 1713.9, 1: 1711.6. Samples: 751574. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-07 23:54:41,211][50642] Avg episode reward: [(0, '4.670'), (1, '5.040')] [2023-10-07 23:54:41,211][51710] Saving new best policy, reward=5.040! [2023-10-07 23:54:42,329][52060] Updated weights for policy 0, policy_version 1450 (0.0010) [2023-10-07 23:54:42,704][52060] Updated weights for policy 0, policy_version 1460 (0.0010) [2023-10-07 23:54:43,065][52060] Updated weights for policy 0, policy_version 1470 (0.0011) [2023-10-07 23:54:44,665][52059] Updated weights for policy 1, policy_version 1482 (0.0008) [2023-10-07 23:54:45,040][52059] Updated weights for policy 1, policy_version 1492 (0.0008) [2023-10-07 23:54:45,405][52059] Updated weights for policy 1, policy_version 1502 (0.0008) [2023-10-07 23:54:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13383.9). Total num frames: 3047424. Throughput: 0: 1722.8, 1: 1690.8. Samples: 771936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) [2023-10-07 23:54:46,211][50642] Avg episode reward: [(0, '4.900'), (1, '5.520')] [2023-10-07 23:54:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000001472_1507328.pth... [2023-10-07 23:54:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000001504_1540096.pth... [2023-10-07 23:54:46,250][51710] Saving new best policy, reward=5.520! [2023-10-07 23:54:47,095][52060] Updated weights for policy 0, policy_version 1480 (0.0011) [2023-10-07 23:54:47,470][52060] Updated weights for policy 0, policy_version 1490 (0.0008) [2023-10-07 23:54:47,831][52060] Updated weights for policy 0, policy_version 1500 (0.0008) [2023-10-07 23:54:49,428][52059] Updated weights for policy 1, policy_version 1512 (0.0008) [2023-10-07 23:54:49,794][52059] Updated weights for policy 1, policy_version 1522 (0.0009) [2023-10-07 23:54:50,157][52059] Updated weights for policy 1, policy_version 1532 (0.0008) [2023-10-07 23:54:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13378.0). Total num frames: 3112960. Throughput: 0: 1691.9, 1: 1724.0. Samples: 782684. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:54:51,211][50642] Avg episode reward: [(0, '5.110'), (1, '4.760')] [2023-10-07 23:54:51,211][51605] Saving new best policy, reward=5.110! [2023-10-07 23:54:51,745][52060] Updated weights for policy 0, policy_version 1510 (0.0008) [2023-10-07 23:54:52,113][52060] Updated weights for policy 0, policy_version 1520 (0.0007) [2023-10-07 23:54:52,491][52060] Updated weights for policy 0, policy_version 1530 (0.0007) [2023-10-07 23:54:54,179][52059] Updated weights for policy 1, policy_version 1542 (0.0008) [2023-10-07 23:54:54,553][52059] Updated weights for policy 1, policy_version 1552 (0.0008) [2023-10-07 23:54:54,918][52059] Updated weights for policy 1, policy_version 1562 (0.0008) [2023-10-07 23:54:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13372.3). Total num frames: 3178496. Throughput: 0: 1720.8, 1: 1702.3. Samples: 802872. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-07 23:54:56,211][50642] Avg episode reward: [(0, '4.670'), (1, '4.670')] [2023-10-07 23:54:56,358][52060] Updated weights for policy 0, policy_version 1540 (0.0007) [2023-10-07 23:54:56,728][52060] Updated weights for policy 0, policy_version 1550 (0.0007) [2023-10-07 23:54:57,093][52060] Updated weights for policy 0, policy_version 1560 (0.0008) [2023-10-07 23:54:58,747][52059] Updated weights for policy 1, policy_version 1572 (0.0008) [2023-10-07 23:54:59,125][52059] Updated weights for policy 1, policy_version 1582 (0.0010) [2023-10-07 23:54:59,484][52059] Updated weights for policy 1, policy_version 1592 (0.0010) [2023-10-07 23:55:01,083][52060] Updated weights for policy 0, policy_version 1570 (0.0008) [2023-10-07 23:55:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13366.8). Total num frames: 3244032. Throughput: 0: 1728.4, 1: 1702.7. Samples: 823870. Policy #0 lag: (min: 17.0, avg: 22.6, max: 49.0) [2023-10-07 23:55:01,211][50642] Avg episode reward: [(0, '4.420'), (1, '4.700')] [2023-10-07 23:55:01,468][52060] Updated weights for policy 0, policy_version 1580 (0.0008) [2023-10-07 23:55:01,835][52060] Updated weights for policy 0, policy_version 1590 (0.0009) [2023-10-07 23:55:02,205][52060] Updated weights for policy 0, policy_version 1600 (0.0009) [2023-10-07 23:55:03,426][52059] Updated weights for policy 1, policy_version 1602 (0.0007) [2023-10-07 23:55:03,830][52059] Updated weights for policy 1, policy_version 1612 (0.0009) [2023-10-07 23:55:04,201][52059] Updated weights for policy 1, policy_version 1622 (0.0010) [2023-10-07 23:55:04,556][52059] Updated weights for policy 1, policy_version 1632 (0.0008) [2023-10-07 23:55:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13361.6). Total num frames: 3309568. Throughput: 0: 1710.7, 1: 1715.6. Samples: 833654. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-07 23:55:06,211][50642] Avg episode reward: [(0, '4.830'), (1, '5.350')] [2023-10-07 23:55:06,257][52060] Updated weights for policy 0, policy_version 1610 (0.0007) [2023-10-07 23:55:06,635][52060] Updated weights for policy 0, policy_version 1620 (0.0007) [2023-10-07 23:55:07,002][52060] Updated weights for policy 0, policy_version 1630 (0.0009) [2023-10-07 23:55:08,327][52059] Updated weights for policy 1, policy_version 1642 (0.0008) [2023-10-07 23:55:08,701][52059] Updated weights for policy 1, policy_version 1652 (0.0008) [2023-10-07 23:55:09,067][52059] Updated weights for policy 1, policy_version 1662 (0.0008) [2023-10-07 23:55:11,053][52060] Updated weights for policy 0, policy_version 1640 (0.0008) [2023-10-07 23:55:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13356.5). Total num frames: 3375104. Throughput: 0: 1711.7, 1: 1707.3. Samples: 854326. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-07 23:55:11,211][50642] Avg episode reward: [(0, '4.850'), (1, '5.180')] [2023-10-07 23:55:11,423][52060] Updated weights for policy 0, policy_version 1650 (0.0011) [2023-10-07 23:55:11,794][52060] Updated weights for policy 0, policy_version 1660 (0.0009) [2023-10-07 23:55:13,041][52059] Updated weights for policy 1, policy_version 1672 (0.0008) [2023-10-07 23:55:13,409][52059] Updated weights for policy 1, policy_version 1682 (0.0007) [2023-10-07 23:55:13,766][52059] Updated weights for policy 1, policy_version 1692 (0.0009) [2023-10-07 23:55:15,901][52060] Updated weights for policy 0, policy_version 1670 (0.0008) [2023-10-07 23:55:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13351.7). Total num frames: 3440640. Throughput: 0: 1708.9, 1: 1729.0. Samples: 875304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-07 23:55:16,211][50642] Avg episode reward: [(0, '4.820'), (1, '5.430')] [2023-10-07 23:55:16,275][52060] Updated weights for policy 0, policy_version 1680 (0.0008) [2023-10-07 23:55:16,635][52060] Updated weights for policy 0, policy_version 1690 (0.0008) [2023-10-07 23:55:17,641][52059] Updated weights for policy 1, policy_version 1702 (0.0008) [2023-10-07 23:55:18,003][52059] Updated weights for policy 1, policy_version 1712 (0.0008) [2023-10-07 23:55:18,373][52059] Updated weights for policy 1, policy_version 1722 (0.0007) [2023-10-07 23:55:20,683][52060] Updated weights for policy 0, policy_version 1700 (0.0009) [2023-10-07 23:55:21,050][52060] Updated weights for policy 0, policy_version 1710 (0.0008) [2023-10-07 23:55:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13347.0). Total num frames: 3506176. Throughput: 0: 1715.1, 1: 1713.4. Samples: 885080. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-07 23:55:21,211][50642] Avg episode reward: [(0, '5.030'), (1, '5.970')] [2023-10-07 23:55:21,213][51710] Saving new best policy, reward=5.970! [2023-10-07 23:55:21,421][52060] Updated weights for policy 0, policy_version 1720 (0.0007) [2023-10-07 23:55:22,346][52059] Updated weights for policy 1, policy_version 1732 (0.0008) [2023-10-07 23:55:22,716][52059] Updated weights for policy 1, policy_version 1742 (0.0007) [2023-10-07 23:55:23,083][52059] Updated weights for policy 1, policy_version 1752 (0.0007) [2023-10-07 23:55:25,440][52060] Updated weights for policy 0, policy_version 1730 (0.0010) [2023-10-07 23:55:25,808][52060] Updated weights for policy 0, policy_version 1740 (0.0011) [2023-10-07 23:55:26,169][52060] Updated weights for policy 0, policy_version 1750 (0.0008) [2023-10-07 23:55:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13342.6). Total num frames: 3571712. Throughput: 0: 1713.2, 1: 1723.0. Samples: 906204. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-07 23:55:26,211][50642] Avg episode reward: [(0, '5.110'), (1, '6.300')] [2023-10-07 23:55:26,212][51710] Saving new best policy, reward=6.300! [2023-10-07 23:55:26,539][52060] Updated weights for policy 0, policy_version 1760 (0.0008) [2023-10-07 23:55:27,007][52059] Updated weights for policy 1, policy_version 1762 (0.0008) [2023-10-07 23:55:27,378][52059] Updated weights for policy 1, policy_version 1772 (0.0008) [2023-10-07 23:55:27,749][52059] Updated weights for policy 1, policy_version 1782 (0.0008) [2023-10-07 23:55:28,115][52059] Updated weights for policy 1, policy_version 1792 (0.0009) [2023-10-07 23:55:30,678][52060] Updated weights for policy 0, policy_version 1770 (0.0007) [2023-10-07 23:55:31,049][52060] Updated weights for policy 0, policy_version 1780 (0.0009) [2023-10-07 23:55:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13338.2). Total num frames: 3637248. Throughput: 0: 1700.1, 1: 1739.1. Samples: 926700. Policy #0 lag: (min: 31.0, avg: 32.6, max: 59.0) [2023-10-07 23:55:31,211][50642] Avg episode reward: [(0, '5.040'), (1, '6.270')] [2023-10-07 23:55:31,415][52060] Updated weights for policy 0, policy_version 1790 (0.0007) [2023-10-07 23:55:32,235][52059] Updated weights for policy 1, policy_version 1802 (0.0007) [2023-10-07 23:55:32,608][52059] Updated weights for policy 1, policy_version 1812 (0.0010) [2023-10-07 23:55:32,984][52059] Updated weights for policy 1, policy_version 1822 (0.0007) [2023-10-07 23:55:35,386][52060] Updated weights for policy 0, policy_version 1800 (0.0007) [2023-10-07 23:55:35,767][52060] Updated weights for policy 0, policy_version 1810 (0.0008) [2023-10-07 23:55:36,133][52060] Updated weights for policy 0, policy_version 1820 (0.0009) [2023-10-07 23:55:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13334.1). Total num frames: 3702784. Throughput: 0: 1714.0, 1: 1708.6. Samples: 936698. Policy #0 lag: (min: 15.0, avg: 36.8, max: 40.0) [2023-10-07 23:55:36,211][50642] Avg episode reward: [(0, '5.510'), (1, '5.980')] [2023-10-07 23:55:36,285][51605] Saving new best policy, reward=5.510! [2023-10-07 23:55:36,826][52059] Updated weights for policy 1, policy_version 1832 (0.0008) [2023-10-07 23:55:37,183][52059] Updated weights for policy 1, policy_version 1842 (0.0008) [2023-10-07 23:55:37,550][52059] Updated weights for policy 1, policy_version 1852 (0.0008) [2023-10-07 23:55:40,006][52060] Updated weights for policy 0, policy_version 1830 (0.0009) [2023-10-07 23:55:40,371][52060] Updated weights for policy 0, policy_version 1840 (0.0008) [2023-10-07 23:55:40,747][52060] Updated weights for policy 0, policy_version 1850 (0.0009) [2023-10-07 23:55:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13446.0). Total num frames: 3801088. Throughput: 0: 1712.2, 1: 1735.1. Samples: 958000. Policy #0 lag: (min: 31.0, avg: 32.0, max: 52.0) [2023-10-07 23:55:41,211][50642] Avg episode reward: [(0, '5.130'), (1, '6.500')] [2023-10-07 23:55:41,497][52059] Updated weights for policy 1, policy_version 1862 (0.0009) [2023-10-07 23:55:41,868][52059] Updated weights for policy 1, policy_version 1872 (0.0009) [2023-10-07 23:55:42,224][52059] Updated weights for policy 1, policy_version 1882 (0.0008) [2023-10-07 23:55:42,445][51710] Saving new best policy, reward=6.500! [2023-10-07 23:55:44,698][52060] Updated weights for policy 0, policy_version 1860 (0.0009) [2023-10-07 23:55:45,069][52060] Updated weights for policy 0, policy_version 1870 (0.0008) [2023-10-07 23:55:45,444][52060] Updated weights for policy 0, policy_version 1880 (0.0008) [2023-10-07 23:55:46,051][52059] Updated weights for policy 1, policy_version 1892 (0.0009) [2023-10-07 23:55:46,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13440.1). Total num frames: 3866624. Throughput: 0: 1689.4, 1: 1743.5. Samples: 978350. Policy #0 lag: (min: 26.0, avg: 28.0, max: 51.0) [2023-10-07 23:55:46,211][50642] Avg episode reward: [(0, '5.330'), (1, '6.540')] [2023-10-07 23:55:46,422][52059] Updated weights for policy 1, policy_version 1902 (0.0009) [2023-10-07 23:55:46,794][52059] Updated weights for policy 1, policy_version 1912 (0.0010) [2023-10-07 23:55:47,096][51710] Saving new best policy, reward=6.540! [2023-10-07 23:55:49,355][52060] Updated weights for policy 0, policy_version 1890 (0.0007) [2023-10-07 23:55:49,767][52060] Updated weights for policy 0, policy_version 1900 (0.0007) [2023-10-07 23:55:50,141][52060] Updated weights for policy 0, policy_version 1910 (0.0009) [2023-10-07 23:55:50,516][52060] Updated weights for policy 0, policy_version 1920 (0.0008) [2023-10-07 23:55:50,767][52059] Updated weights for policy 1, policy_version 1922 (0.0009) [2023-10-07 23:55:51,182][52059] Updated weights for policy 1, policy_version 1932 (0.0011) [2023-10-07 23:55:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13434.4). Total num frames: 3932160. Throughput: 0: 1726.6, 1: 1728.5. Samples: 989132. Policy #0 lag: (min: 31.0, avg: 40.7, max: 63.0) [2023-10-07 23:55:51,211][50642] Avg episode reward: [(0, '5.450'), (1, '5.780')] [2023-10-07 23:55:51,556][52059] Updated weights for policy 1, policy_version 1942 (0.0011) [2023-10-07 23:55:51,914][52059] Updated weights for policy 1, policy_version 1952 (0.0010) [2023-10-07 23:55:54,538][52060] Updated weights for policy 0, policy_version 1930 (0.0010) [2023-10-07 23:55:54,906][52060] Updated weights for policy 0, policy_version 1940 (0.0008) [2023-10-07 23:55:55,273][52060] Updated weights for policy 0, policy_version 1950 (0.0007) [2023-10-07 23:55:55,808][52059] Updated weights for policy 1, policy_version 1962 (0.0010) [2023-10-07 23:55:56,178][52059] Updated weights for policy 1, policy_version 1972 (0.0010) [2023-10-07 23:55:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13551.5). Total num frames: 3997696. Throughput: 0: 1708.3, 1: 1741.4. Samples: 1009562. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:55:56,211][50642] Avg episode reward: [(0, '5.560'), (1, '5.970')] [2023-10-07 23:55:56,211][51605] Saving new best policy, reward=5.560! [2023-10-07 23:55:56,551][52059] Updated weights for policy 1, policy_version 1982 (0.0009) [2023-10-07 23:55:59,169][52060] Updated weights for policy 0, policy_version 1960 (0.0009) [2023-10-07 23:55:59,546][52060] Updated weights for policy 0, policy_version 1970 (0.0008) [2023-10-07 23:55:59,914][52060] Updated weights for policy 0, policy_version 1980 (0.0008) [2023-10-07 23:56:00,312][52059] Updated weights for policy 1, policy_version 1992 (0.0009) [2023-10-07 23:56:00,679][52059] Updated weights for policy 1, policy_version 2002 (0.0008) [2023-10-07 23:56:01,043][52059] Updated weights for policy 1, policy_version 2012 (0.0008) [2023-10-07 23:56:01,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 4096000. Throughput: 0: 1699.1, 1: 1727.3. Samples: 1029492. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:56:01,211][50642] Avg episode reward: [(0, '5.610'), (1, '6.330')] [2023-10-07 23:56:01,220][51605] Saving new best policy, reward=5.610! [2023-10-07 23:56:03,957][52060] Updated weights for policy 0, policy_version 1990 (0.0008) [2023-10-07 23:56:04,326][52060] Updated weights for policy 0, policy_version 2000 (0.0007) [2023-10-07 23:56:04,700][52060] Updated weights for policy 0, policy_version 2010 (0.0008) [2023-10-07 23:56:05,007][52059] Updated weights for policy 1, policy_version 2022 (0.0008) [2023-10-07 23:56:05,372][52059] Updated weights for policy 1, policy_version 2032 (0.0009) [2023-10-07 23:56:05,747][52059] Updated weights for policy 1, policy_version 2042 (0.0009) [2023-10-07 23:56:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 4161536. Throughput: 0: 1718.4, 1: 1744.4. Samples: 1040910. Policy #0 lag: (min: 1.0, avg: 20.0, max: 33.0) [2023-10-07 23:56:06,211][50642] Avg episode reward: [(0, '5.270'), (1, '6.390')] [2023-10-07 23:56:08,619][52060] Updated weights for policy 0, policy_version 2020 (0.0007) [2023-10-07 23:56:08,998][52060] Updated weights for policy 0, policy_version 2030 (0.0008) [2023-10-07 23:56:09,361][52060] Updated weights for policy 0, policy_version 2040 (0.0009) [2023-10-07 23:56:09,655][52059] Updated weights for policy 1, policy_version 2052 (0.0009) [2023-10-07 23:56:10,021][52059] Updated weights for policy 1, policy_version 2062 (0.0011) [2023-10-07 23:56:10,392][52059] Updated weights for policy 1, policy_version 2072 (0.0010) [2023-10-07 23:56:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 4227072. Throughput: 0: 1694.4, 1: 1738.0. Samples: 1060664. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-07 23:56:11,211][50642] Avg episode reward: [(0, '5.030'), (1, '6.090')] [2023-10-07 23:56:13,281][52060] Updated weights for policy 0, policy_version 2050 (0.0008) [2023-10-07 23:56:13,659][52060] Updated weights for policy 0, policy_version 2060 (0.0009) [2023-10-07 23:56:14,027][52060] Updated weights for policy 0, policy_version 2070 (0.0008) [2023-10-07 23:56:14,399][52060] Updated weights for policy 0, policy_version 2080 (0.0011) [2023-10-07 23:56:14,493][52059] Updated weights for policy 1, policy_version 2082 (0.0011) [2023-10-07 23:56:14,847][52059] Updated weights for policy 1, policy_version 2092 (0.0010) [2023-10-07 23:56:15,203][52059] Updated weights for policy 1, policy_version 2102 (0.0009) [2023-10-07 23:56:15,575][52059] Updated weights for policy 1, policy_version 2112 (0.0007) [2023-10-07 23:56:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 4292608. Throughput: 0: 1707.5, 1: 1717.9. Samples: 1080842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:56:16,211][50642] Avg episode reward: [(0, '5.980'), (1, '6.480')] [2023-10-07 23:56:16,218][51605] Saving new best policy, reward=5.980! [2023-10-07 23:56:18,394][52060] Updated weights for policy 0, policy_version 2090 (0.0008) [2023-10-07 23:56:18,775][52060] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-10-07 23:56:19,139][52060] Updated weights for policy 0, policy_version 2110 (0.0009) [2023-10-07 23:56:19,582][52059] Updated weights for policy 1, policy_version 2122 (0.0007) [2023-10-07 23:56:19,959][52059] Updated weights for policy 1, policy_version 2132 (0.0010) [2023-10-07 23:56:20,326][52059] Updated weights for policy 1, policy_version 2142 (0.0010) [2023-10-07 23:56:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 4358144. Throughput: 0: 1701.2, 1: 1746.5. Samples: 1091846. Policy #0 lag: (min: 26.0, avg: 26.3, max: 36.0) [2023-10-07 23:56:21,211][50642] Avg episode reward: [(0, '5.360'), (1, '6.370')] [2023-10-07 23:56:23,212][52060] Updated weights for policy 0, policy_version 2120 (0.0011) [2023-10-07 23:56:23,589][52060] Updated weights for policy 0, policy_version 2130 (0.0008) [2023-10-07 23:56:23,956][52060] Updated weights for policy 0, policy_version 2140 (0.0009) [2023-10-07 23:56:24,353][52059] Updated weights for policy 1, policy_version 2152 (0.0008) [2023-10-07 23:56:24,714][52059] Updated weights for policy 1, policy_version 2162 (0.0007) [2023-10-07 23:56:25,069][52059] Updated weights for policy 1, policy_version 2172 (0.0010) [2023-10-07 23:56:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13662.6). Total num frames: 4423680. Throughput: 0: 1691.1, 1: 1720.9. Samples: 1111538. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:56:26,211][50642] Avg episode reward: [(0, '5.690'), (1, '6.970')] [2023-10-07 23:56:26,213][51710] Saving new best policy, reward=6.970! [2023-10-07 23:56:27,802][52060] Updated weights for policy 0, policy_version 2150 (0.0008) [2023-10-07 23:56:28,167][52060] Updated weights for policy 0, policy_version 2160 (0.0010) [2023-10-07 23:56:28,537][52060] Updated weights for policy 0, policy_version 2170 (0.0008) [2023-10-07 23:56:29,163][52059] Updated weights for policy 1, policy_version 2182 (0.0008) [2023-10-07 23:56:29,532][52059] Updated weights for policy 1, policy_version 2192 (0.0009) [2023-10-07 23:56:29,891][52059] Updated weights for policy 1, policy_version 2202 (0.0008) [2023-10-07 23:56:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13662.6). Total num frames: 4489216. Throughput: 0: 1715.2, 1: 1706.0. Samples: 1132302. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:56:31,211][50642] Avg episode reward: [(0, '6.020'), (1, '7.270')] [2023-10-07 23:56:31,219][51605] Saving new best policy, reward=6.020! [2023-10-07 23:56:31,219][51710] Saving new best policy, reward=7.270! [2023-10-07 23:56:32,627][52060] Updated weights for policy 0, policy_version 2180 (0.0008) [2023-10-07 23:56:33,000][52060] Updated weights for policy 0, policy_version 2190 (0.0008) [2023-10-07 23:56:33,382][52060] Updated weights for policy 0, policy_version 2200 (0.0007) [2023-10-07 23:56:33,627][52059] Updated weights for policy 1, policy_version 2212 (0.0008) [2023-10-07 23:56:33,991][52059] Updated weights for policy 1, policy_version 2222 (0.0007) [2023-10-07 23:56:34,353][52059] Updated weights for policy 1, policy_version 2232 (0.0007) [2023-10-07 23:56:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 4554752. Throughput: 0: 1681.5, 1: 1731.1. Samples: 1142696. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:56:36,211][50642] Avg episode reward: [(0, '5.950'), (1, '6.970')] [2023-10-07 23:56:37,427][52060] Updated weights for policy 0, policy_version 2210 (0.0008) [2023-10-07 23:56:37,831][52060] Updated weights for policy 0, policy_version 2220 (0.0009) [2023-10-07 23:56:38,175][52059] Updated weights for policy 1, policy_version 2242 (0.0007) [2023-10-07 23:56:38,198][52060] Updated weights for policy 0, policy_version 2230 (0.0009) [2023-10-07 23:56:38,579][52060] Updated weights for policy 0, policy_version 2240 (0.0007) [2023-10-07 23:56:38,591][52059] Updated weights for policy 1, policy_version 2252 (0.0009) [2023-10-07 23:56:38,955][52059] Updated weights for policy 1, policy_version 2262 (0.0008) [2023-10-07 23:56:39,320][52059] Updated weights for policy 1, policy_version 2272 (0.0007) [2023-10-07 23:56:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 4620288. Throughput: 0: 1699.2, 1: 1706.5. Samples: 1162818. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) [2023-10-07 23:56:41,211][50642] Avg episode reward: [(0, '5.960'), (1, '6.800')] [2023-10-07 23:56:42,481][52060] Updated weights for policy 0, policy_version 2250 (0.0007) [2023-10-07 23:56:42,859][52060] Updated weights for policy 0, policy_version 2260 (0.0008) [2023-10-07 23:56:43,229][52060] Updated weights for policy 0, policy_version 2270 (0.0007) [2023-10-07 23:56:43,382][52059] Updated weights for policy 1, policy_version 2282 (0.0008) [2023-10-07 23:56:43,742][52059] Updated weights for policy 1, policy_version 2292 (0.0008) [2023-10-07 23:56:44,113][52059] Updated weights for policy 1, policy_version 2302 (0.0007) [2023-10-07 23:56:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 4685824. Throughput: 0: 1711.9, 1: 1717.6. Samples: 1183820. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-07 23:56:46,211][50642] Avg episode reward: [(0, '5.750'), (1, '7.000')] [2023-10-07 23:56:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000002304_2359296.pth... [2023-10-07 23:56:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000002272_2326528.pth... [2023-10-07 23:56:46,253][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000000704_720896.pth [2023-10-07 23:56:46,262][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000000672_688128.pth [2023-10-07 23:56:47,338][52060] Updated weights for policy 0, policy_version 2280 (0.0009) [2023-10-07 23:56:47,699][52060] Updated weights for policy 0, policy_version 2290 (0.0009) [2023-10-07 23:56:48,059][52059] Updated weights for policy 1, policy_version 2312 (0.0008) [2023-10-07 23:56:48,075][52060] Updated weights for policy 0, policy_version 2300 (0.0008) [2023-10-07 23:56:48,419][52059] Updated weights for policy 1, policy_version 2322 (0.0007) [2023-10-07 23:56:48,790][52059] Updated weights for policy 1, policy_version 2332 (0.0008) [2023-10-07 23:56:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 4751360. Throughput: 0: 1686.4, 1: 1700.9. Samples: 1193338. Policy #0 lag: (min: 9.0, avg: 19.3, max: 41.0) [2023-10-07 23:56:51,211][50642] Avg episode reward: [(0, '5.630'), (1, '7.480')] [2023-10-07 23:56:51,213][51710] Saving new best policy, reward=7.480! [2023-10-07 23:56:52,105][52060] Updated weights for policy 0, policy_version 2310 (0.0008) [2023-10-07 23:56:52,471][52060] Updated weights for policy 0, policy_version 2320 (0.0008) [2023-10-07 23:56:52,753][52059] Updated weights for policy 1, policy_version 2342 (0.0008) [2023-10-07 23:56:52,837][52060] Updated weights for policy 0, policy_version 2330 (0.0007) [2023-10-07 23:56:53,129][52059] Updated weights for policy 1, policy_version 2352 (0.0007) [2023-10-07 23:56:53,506][52059] Updated weights for policy 1, policy_version 2362 (0.0009) [2023-10-07 23:56:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 4816896. Throughput: 0: 1709.7, 1: 1704.9. Samples: 1214322. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:56:56,211][50642] Avg episode reward: [(0, '6.370'), (1, '7.520')] [2023-10-07 23:56:56,212][51605] Saving new best policy, reward=6.370! [2023-10-07 23:56:56,213][51710] Saving new best policy, reward=7.520! [2023-10-07 23:56:56,699][52060] Updated weights for policy 0, policy_version 2340 (0.0009) [2023-10-07 23:56:57,073][52060] Updated weights for policy 0, policy_version 2350 (0.0007) [2023-10-07 23:56:57,362][52059] Updated weights for policy 1, policy_version 2372 (0.0007) [2023-10-07 23:56:57,441][52060] Updated weights for policy 0, policy_version 2360 (0.0010) [2023-10-07 23:56:57,718][52059] Updated weights for policy 1, policy_version 2382 (0.0007) [2023-10-07 23:56:58,082][52059] Updated weights for policy 1, policy_version 2392 (0.0010) [2023-10-07 23:57:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 4882432. Throughput: 0: 1707.0, 1: 1729.8. Samples: 1235496. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-07 23:57:01,211][50642] Avg episode reward: [(0, '6.110'), (1, '7.580')] [2023-10-07 23:57:01,222][51710] Saving new best policy, reward=7.580! [2023-10-07 23:57:01,560][52060] Updated weights for policy 0, policy_version 2370 (0.0008) [2023-10-07 23:57:01,929][52060] Updated weights for policy 0, policy_version 2380 (0.0009) [2023-10-07 23:57:02,105][52059] Updated weights for policy 1, policy_version 2402 (0.0010) [2023-10-07 23:57:02,299][52060] Updated weights for policy 0, policy_version 2390 (0.0009) [2023-10-07 23:57:02,473][52059] Updated weights for policy 1, policy_version 2412 (0.0007) [2023-10-07 23:57:02,665][52060] Updated weights for policy 0, policy_version 2400 (0.0007) [2023-10-07 23:57:02,833][52059] Updated weights for policy 1, policy_version 2422 (0.0007) [2023-10-07 23:57:03,195][52059] Updated weights for policy 1, policy_version 2432 (0.0010) [2023-10-07 23:57:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 4947968. Throughput: 0: 1701.0, 1: 1701.1. Samples: 1244940. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:57:06,211][50642] Avg episode reward: [(0, '5.880'), (1, '7.210')] [2023-10-07 23:57:06,548][52060] Updated weights for policy 0, policy_version 2410 (0.0008) [2023-10-07 23:57:06,921][52060] Updated weights for policy 0, policy_version 2420 (0.0009) [2023-10-07 23:57:07,255][52059] Updated weights for policy 1, policy_version 2442 (0.0009) [2023-10-07 23:57:07,287][52060] Updated weights for policy 0, policy_version 2430 (0.0009) [2023-10-07 23:57:07,619][52059] Updated weights for policy 1, policy_version 2452 (0.0010) [2023-10-07 23:57:07,990][52059] Updated weights for policy 1, policy_version 2462 (0.0008) [2023-10-07 23:57:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 5013504. Throughput: 0: 1713.7, 1: 1723.0. Samples: 1266188. Policy #0 lag: (min: 4.0, avg: 4.6, max: 22.0) [2023-10-07 23:57:11,211][50642] Avg episode reward: [(0, '6.300'), (1, '6.800')] [2023-10-07 23:57:11,352][52060] Updated weights for policy 0, policy_version 2440 (0.0009) [2023-10-07 23:57:11,713][52060] Updated weights for policy 0, policy_version 2450 (0.0010) [2023-10-07 23:57:11,867][52059] Updated weights for policy 1, policy_version 2472 (0.0008) [2023-10-07 23:57:12,088][52060] Updated weights for policy 0, policy_version 2460 (0.0008) [2023-10-07 23:57:12,234][52059] Updated weights for policy 1, policy_version 2482 (0.0008) [2023-10-07 23:57:12,597][52059] Updated weights for policy 1, policy_version 2492 (0.0009) [2023-10-07 23:57:16,020][52060] Updated weights for policy 0, policy_version 2470 (0.0007) [2023-10-07 23:57:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 5079040. Throughput: 0: 1713.8, 1: 1729.4. Samples: 1287246. Policy #0 lag: (min: 31.0, avg: 31.7, max: 49.0) [2023-10-07 23:57:16,211][50642] Avg episode reward: [(0, '6.050'), (1, '7.240')] [2023-10-07 23:57:16,383][52060] Updated weights for policy 0, policy_version 2480 (0.0008) [2023-10-07 23:57:16,668][52059] Updated weights for policy 1, policy_version 2502 (0.0008) [2023-10-07 23:57:16,756][52060] Updated weights for policy 0, policy_version 2490 (0.0009) [2023-10-07 23:57:17,028][52059] Updated weights for policy 1, policy_version 2512 (0.0009) [2023-10-07 23:57:17,396][52059] Updated weights for policy 1, policy_version 2522 (0.0009) [2023-10-07 23:57:20,660][52060] Updated weights for policy 0, policy_version 2500 (0.0008) [2023-10-07 23:57:21,016][52060] Updated weights for policy 0, policy_version 2510 (0.0010) [2023-10-07 23:57:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 5144576. Throughput: 0: 1717.5, 1: 1705.9. Samples: 1296748. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-07 23:57:21,211][50642] Avg episode reward: [(0, '6.200'), (1, '6.900')] [2023-10-07 23:57:21,352][52059] Updated weights for policy 1, policy_version 2532 (0.0007) [2023-10-07 23:57:21,398][52060] Updated weights for policy 0, policy_version 2520 (0.0008) [2023-10-07 23:57:21,718][52059] Updated weights for policy 1, policy_version 2542 (0.0008) [2023-10-07 23:57:22,081][52059] Updated weights for policy 1, policy_version 2552 (0.0007) [2023-10-07 23:57:25,457][52060] Updated weights for policy 0, policy_version 2530 (0.0009) [2023-10-07 23:57:25,838][52060] Updated weights for policy 0, policy_version 2540 (0.0008) [2023-10-07 23:57:26,002][52059] Updated weights for policy 1, policy_version 2562 (0.0008) [2023-10-07 23:57:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 5210112. Throughput: 0: 1718.3, 1: 1732.0. Samples: 1318080. Policy #0 lag: (min: 18.0, avg: 22.2, max: 50.0) [2023-10-07 23:57:26,211][52060] Updated weights for policy 0, policy_version 2550 (0.0007) [2023-10-07 23:57:26,211][50642] Avg episode reward: [(0, '5.950'), (1, '7.100')] [2023-10-07 23:57:26,437][52059] Updated weights for policy 1, policy_version 2572 (0.0008) [2023-10-07 23:57:26,586][52060] Updated weights for policy 0, policy_version 2560 (0.0008) [2023-10-07 23:57:26,809][52059] Updated weights for policy 1, policy_version 2582 (0.0009) [2023-10-07 23:57:27,174][52059] Updated weights for policy 1, policy_version 2592 (0.0008) [2023-10-07 23:57:30,501][52060] Updated weights for policy 0, policy_version 2570 (0.0009) [2023-10-07 23:57:30,867][52060] Updated weights for policy 0, policy_version 2580 (0.0007) [2023-10-07 23:57:31,091][52059] Updated weights for policy 1, policy_version 2602 (0.0007) [2023-10-07 23:57:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 5275648. Throughput: 0: 1701.3, 1: 1729.8. Samples: 1338220. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:57:31,211][50642] Avg episode reward: [(0, '6.140'), (1, '8.020')] [2023-10-07 23:57:31,232][52060] Updated weights for policy 0, policy_version 2590 (0.0007) [2023-10-07 23:57:31,461][52059] Updated weights for policy 1, policy_version 2612 (0.0007) [2023-10-07 23:57:31,822][52059] Updated weights for policy 1, policy_version 2622 (0.0008) [2023-10-07 23:57:31,890][51710] Saving new best policy, reward=8.020! [2023-10-07 23:57:35,229][52060] Updated weights for policy 0, policy_version 2600 (0.0009) [2023-10-07 23:57:35,605][52060] Updated weights for policy 0, policy_version 2610 (0.0010) [2023-10-07 23:57:35,837][52059] Updated weights for policy 1, policy_version 2632 (0.0008) [2023-10-07 23:57:35,981][52060] Updated weights for policy 0, policy_version 2620 (0.0007) [2023-10-07 23:57:36,199][52059] Updated weights for policy 1, policy_version 2642 (0.0007) [2023-10-07 23:57:36,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 5373952. Throughput: 0: 1722.5, 1: 1727.5. Samples: 1348590. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:57:36,211][50642] Avg episode reward: [(0, '6.280'), (1, '7.770')] [2023-10-07 23:57:36,567][52059] Updated weights for policy 1, policy_version 2652 (0.0009) [2023-10-07 23:57:40,010][52060] Updated weights for policy 0, policy_version 2630 (0.0007) [2023-10-07 23:57:40,304][52059] Updated weights for policy 1, policy_version 2662 (0.0008) [2023-10-07 23:57:40,372][52060] Updated weights for policy 0, policy_version 2640 (0.0007) [2023-10-07 23:57:40,669][52059] Updated weights for policy 1, policy_version 2672 (0.0008) [2023-10-07 23:57:40,743][52060] Updated weights for policy 0, policy_version 2650 (0.0010) [2023-10-07 23:57:41,033][52059] Updated weights for policy 1, policy_version 2682 (0.0007) [2023-10-07 23:57:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 5439488. Throughput: 0: 1717.2, 1: 1740.4. Samples: 1369918. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:57:41,211][50642] Avg episode reward: [(0, '6.170'), (1, '7.930')] [2023-10-07 23:57:44,819][52060] Updated weights for policy 0, policy_version 2660 (0.0008) [2023-10-07 23:57:44,977][52059] Updated weights for policy 1, policy_version 2692 (0.0008) [2023-10-07 23:57:45,185][52060] Updated weights for policy 0, policy_version 2670 (0.0009) [2023-10-07 23:57:45,350][52059] Updated weights for policy 1, policy_version 2702 (0.0007) [2023-10-07 23:57:45,563][52060] Updated weights for policy 0, policy_version 2680 (0.0009) [2023-10-07 23:57:45,710][52059] Updated weights for policy 1, policy_version 2712 (0.0008) [2023-10-07 23:57:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5537792. Throughput: 0: 1691.3, 1: 1715.7. Samples: 1388806. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:57:46,211][50642] Avg episode reward: [(0, '5.940'), (1, '7.730')] [2023-10-07 23:57:49,566][52060] Updated weights for policy 0, policy_version 2690 (0.0009) [2023-10-07 23:57:49,730][52059] Updated weights for policy 1, policy_version 2722 (0.0008) [2023-10-07 23:57:49,942][52060] Updated weights for policy 0, policy_version 2700 (0.0008) [2023-10-07 23:57:50,097][52059] Updated weights for policy 1, policy_version 2732 (0.0007) [2023-10-07 23:57:50,310][52060] Updated weights for policy 0, policy_version 2710 (0.0011) [2023-10-07 23:57:50,458][52059] Updated weights for policy 1, policy_version 2742 (0.0010) [2023-10-07 23:57:50,675][52060] Updated weights for policy 0, policy_version 2720 (0.0010) [2023-10-07 23:57:50,818][52059] Updated weights for policy 1, policy_version 2752 (0.0008) [2023-10-07 23:57:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5603328. Throughput: 0: 1718.3, 1: 1735.6. Samples: 1400364. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-07 23:57:51,211][50642] Avg episode reward: [(0, '6.480'), (1, '8.260')] [2023-10-07 23:57:51,212][51605] Saving new best policy, reward=6.480! [2023-10-07 23:57:51,213][51710] Saving new best policy, reward=8.260! [2023-10-07 23:57:54,649][52060] Updated weights for policy 0, policy_version 2730 (0.0008) [2023-10-07 23:57:54,827][52059] Updated weights for policy 1, policy_version 2762 (0.0009) [2023-10-07 23:57:55,016][52060] Updated weights for policy 0, policy_version 2740 (0.0008) [2023-10-07 23:57:55,195][52059] Updated weights for policy 1, policy_version 2772 (0.0009) [2023-10-07 23:57:55,387][52060] Updated weights for policy 0, policy_version 2750 (0.0008) [2023-10-07 23:57:55,568][52059] Updated weights for policy 1, policy_version 2782 (0.0009) [2023-10-07 23:57:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5668864. Throughput: 0: 1698.8, 1: 1728.7. Samples: 1420426. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-07 23:57:56,211][50642] Avg episode reward: [(0, '6.420'), (1, '8.500')] [2023-10-07 23:57:56,213][51710] Saving new best policy, reward=8.500! [2023-10-07 23:57:59,251][52060] Updated weights for policy 0, policy_version 2760 (0.0008) [2023-10-07 23:57:59,493][52059] Updated weights for policy 1, policy_version 2792 (0.0008) [2023-10-07 23:57:59,623][52060] Updated weights for policy 0, policy_version 2770 (0.0009) [2023-10-07 23:57:59,861][52059] Updated weights for policy 1, policy_version 2802 (0.0009) [2023-10-07 23:57:59,997][52060] Updated weights for policy 0, policy_version 2780 (0.0008) [2023-10-07 23:58:00,228][52059] Updated weights for policy 1, policy_version 2812 (0.0009) [2023-10-07 23:58:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5734400. Throughput: 0: 1682.1, 1: 1715.0. Samples: 1440116. Policy #0 lag: (min: 25.0, avg: 36.5, max: 57.0) [2023-10-07 23:58:01,211][50642] Avg episode reward: [(0, '5.910'), (1, '8.420')] [2023-10-07 23:58:04,028][52059] Updated weights for policy 1, policy_version 2822 (0.0008) [2023-10-07 23:58:04,124][52060] Updated weights for policy 0, policy_version 2790 (0.0007) [2023-10-07 23:58:04,391][52059] Updated weights for policy 1, policy_version 2832 (0.0009) [2023-10-07 23:58:04,483][52060] Updated weights for policy 0, policy_version 2800 (0.0008) [2023-10-07 23:58:04,755][52059] Updated weights for policy 1, policy_version 2842 (0.0010) [2023-10-07 23:58:04,854][52060] Updated weights for policy 0, policy_version 2810 (0.0009) [2023-10-07 23:58:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5799936. Throughput: 0: 1712.3, 1: 1743.0. Samples: 1452238. Policy #0 lag: (min: 19.0, avg: 22.1, max: 51.0) [2023-10-07 23:58:06,211][50642] Avg episode reward: [(0, '6.810'), (1, '9.010')] [2023-10-07 23:58:06,213][51605] Saving new best policy, reward=6.810! [2023-10-07 23:58:06,213][51710] Saving new best policy, reward=9.010! [2023-10-07 23:58:08,775][52060] Updated weights for policy 0, policy_version 2820 (0.0010) [2023-10-07 23:58:08,831][52059] Updated weights for policy 1, policy_version 2852 (0.0008) [2023-10-07 23:58:09,139][52060] Updated weights for policy 0, policy_version 2830 (0.0008) [2023-10-07 23:58:09,195][52059] Updated weights for policy 1, policy_version 2862 (0.0009) [2023-10-07 23:58:09,499][52060] Updated weights for policy 0, policy_version 2840 (0.0009) [2023-10-07 23:58:09,556][52059] Updated weights for policy 1, policy_version 2872 (0.0009) [2023-10-07 23:58:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5865472. Throughput: 0: 1684.1, 1: 1709.7. Samples: 1470800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:58:11,211][50642] Avg episode reward: [(0, '6.450'), (1, '8.720')] [2023-10-07 23:58:13,478][52059] Updated weights for policy 1, policy_version 2882 (0.0009) [2023-10-07 23:58:13,566][52060] Updated weights for policy 0, policy_version 2850 (0.0010) [2023-10-07 23:58:13,853][52059] Updated weights for policy 1, policy_version 2892 (0.0008) [2023-10-07 23:58:13,941][52060] Updated weights for policy 0, policy_version 2860 (0.0007) [2023-10-07 23:58:14,217][52059] Updated weights for policy 1, policy_version 2902 (0.0007) [2023-10-07 23:58:14,313][52060] Updated weights for policy 0, policy_version 2870 (0.0008) [2023-10-07 23:58:14,588][52059] Updated weights for policy 1, policy_version 2912 (0.0009) [2023-10-07 23:58:14,695][52060] Updated weights for policy 0, policy_version 2880 (0.0008) [2023-10-07 23:58:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5931008. Throughput: 0: 1696.0, 1: 1711.4. Samples: 1491554. Policy #0 lag: (min: 16.0, avg: 44.5, max: 48.0) [2023-10-07 23:58:16,211][50642] Avg episode reward: [(0, '5.880'), (1, '8.370')] [2023-10-07 23:58:18,579][52060] Updated weights for policy 0, policy_version 2890 (0.0009) [2023-10-07 23:58:18,712][52059] Updated weights for policy 1, policy_version 2922 (0.0007) [2023-10-07 23:58:18,950][52060] Updated weights for policy 0, policy_version 2900 (0.0008) [2023-10-07 23:58:19,074][52059] Updated weights for policy 1, policy_version 2932 (0.0008) [2023-10-07 23:58:19,317][52060] Updated weights for policy 0, policy_version 2910 (0.0008) [2023-10-07 23:58:19,430][52059] Updated weights for policy 1, policy_version 2942 (0.0008) [2023-10-07 23:58:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 5996544. Throughput: 0: 1690.3, 1: 1725.6. Samples: 1502302. Policy #0 lag: (min: 15.0, avg: 19.7, max: 47.0) [2023-10-07 23:58:21,211][50642] Avg episode reward: [(0, '6.480'), (1, '8.660')] [2023-10-07 23:58:23,421][52060] Updated weights for policy 0, policy_version 2920 (0.0007) [2023-10-07 23:58:23,537][52059] Updated weights for policy 1, policy_version 2952 (0.0007) [2023-10-07 23:58:23,797][52060] Updated weights for policy 0, policy_version 2930 (0.0008) [2023-10-07 23:58:23,902][52059] Updated weights for policy 1, policy_version 2962 (0.0010) [2023-10-07 23:58:24,171][52060] Updated weights for policy 0, policy_version 2940 (0.0008) [2023-10-07 23:58:24,265][52059] Updated weights for policy 1, policy_version 2972 (0.0008) [2023-10-07 23:58:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 6062080. Throughput: 0: 1678.3, 1: 1695.4. Samples: 1521732. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-07 23:58:26,211][50642] Avg episode reward: [(0, '6.810'), (1, '8.430')] [2023-10-07 23:58:28,109][52060] Updated weights for policy 0, policy_version 2950 (0.0008) [2023-10-07 23:58:28,209][52059] Updated weights for policy 1, policy_version 2982 (0.0007) [2023-10-07 23:58:28,474][52060] Updated weights for policy 0, policy_version 2960 (0.0007) [2023-10-07 23:58:28,572][52059] Updated weights for policy 1, policy_version 2992 (0.0007) [2023-10-07 23:58:28,851][52060] Updated weights for policy 0, policy_version 2970 (0.0007) [2023-10-07 23:58:28,936][52059] Updated weights for policy 1, policy_version 3002 (0.0008) [2023-10-07 23:58:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13662.6). Total num frames: 6127616. Throughput: 0: 1702.9, 1: 1721.6. Samples: 1542910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:58:31,211][50642] Avg episode reward: [(0, '6.380'), (1, '8.790')] [2023-10-07 23:58:32,901][52060] Updated weights for policy 0, policy_version 2980 (0.0007) [2023-10-07 23:58:32,905][52059] Updated weights for policy 1, policy_version 3012 (0.0008) [2023-10-07 23:58:33,264][52059] Updated weights for policy 1, policy_version 3022 (0.0008) [2023-10-07 23:58:33,278][52060] Updated weights for policy 0, policy_version 2990 (0.0008) [2023-10-07 23:58:33,628][52059] Updated weights for policy 1, policy_version 3032 (0.0009) [2023-10-07 23:58:33,644][52060] Updated weights for policy 0, policy_version 3000 (0.0007) [2023-10-07 23:58:36,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 6193152. Throughput: 0: 1677.8, 1: 1702.2. Samples: 1552464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:58:36,211][50642] Avg episode reward: [(0, '7.240'), (1, '8.860')] [2023-10-07 23:58:36,211][51605] Saving new best policy, reward=7.240! [2023-10-07 23:58:37,546][52059] Updated weights for policy 1, policy_version 3042 (0.0008) [2023-10-07 23:58:37,574][52060] Updated weights for policy 0, policy_version 3010 (0.0007) [2023-10-07 23:58:37,900][52059] Updated weights for policy 1, policy_version 3052 (0.0007) [2023-10-07 23:58:37,946][52060] Updated weights for policy 0, policy_version 3020 (0.0008) [2023-10-07 23:58:38,266][52059] Updated weights for policy 1, policy_version 3062 (0.0007) [2023-10-07 23:58:38,310][52060] Updated weights for policy 0, policy_version 3030 (0.0009) [2023-10-07 23:58:38,626][52059] Updated weights for policy 1, policy_version 3072 (0.0007) [2023-10-07 23:58:38,680][52060] Updated weights for policy 0, policy_version 3040 (0.0008) [2023-10-07 23:58:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 6258688. Throughput: 0: 1688.9, 1: 1705.0. Samples: 1573152. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:58:41,211][50642] Avg episode reward: [(0, '6.530'), (1, '9.020')] [2023-10-07 23:58:41,212][51710] Saving new best policy, reward=9.020! [2023-10-07 23:58:42,506][52059] Updated weights for policy 1, policy_version 3082 (0.0008) [2023-10-07 23:58:42,756][52060] Updated weights for policy 0, policy_version 3050 (0.0008) [2023-10-07 23:58:42,869][52059] Updated weights for policy 1, policy_version 3092 (0.0008) [2023-10-07 23:58:43,130][52060] Updated weights for policy 0, policy_version 3060 (0.0009) [2023-10-07 23:58:43,245][52059] Updated weights for policy 1, policy_version 3102 (0.0009) [2023-10-07 23:58:43,492][52060] Updated weights for policy 0, policy_version 3070 (0.0009) [2023-10-07 23:58:46,210][50642] Fps is (10 sec: 13106.7, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 6324224. Throughput: 0: 1702.1, 1: 1723.4. Samples: 1594264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:58:46,211][50642] Avg episode reward: [(0, '6.170'), (1, '8.080')] [2023-10-07 23:58:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000003072_3145728.pth... [2023-10-07 23:58:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000003104_3178496.pth... [2023-10-07 23:58:46,261][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000001472_1507328.pth [2023-10-07 23:58:46,262][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000001504_1540096.pth [2023-10-07 23:58:47,252][52059] Updated weights for policy 1, policy_version 3112 (0.0007) [2023-10-07 23:58:47,467][52060] Updated weights for policy 0, policy_version 3080 (0.0008) [2023-10-07 23:58:47,616][52059] Updated weights for policy 1, policy_version 3122 (0.0008) [2023-10-07 23:58:47,840][52060] Updated weights for policy 0, policy_version 3090 (0.0007) [2023-10-07 23:58:47,981][52059] Updated weights for policy 1, policy_version 3132 (0.0008) [2023-10-07 23:58:48,207][52060] Updated weights for policy 0, policy_version 3100 (0.0009) [2023-10-07 23:58:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 6389760. Throughput: 0: 1670.0, 1: 1692.7. Samples: 1603558. Policy #0 lag: (min: 3.0, avg: 12.2, max: 35.0) [2023-10-07 23:58:51,211][50642] Avg episode reward: [(0, '7.010'), (1, '8.720')] [2023-10-07 23:58:51,931][52059] Updated weights for policy 1, policy_version 3142 (0.0007) [2023-10-07 23:58:52,292][52059] Updated weights for policy 1, policy_version 3152 (0.0007) [2023-10-07 23:58:52,403][52060] Updated weights for policy 0, policy_version 3110 (0.0007) [2023-10-07 23:58:52,651][52059] Updated weights for policy 1, policy_version 3162 (0.0008) [2023-10-07 23:58:52,776][52060] Updated weights for policy 0, policy_version 3120 (0.0007) [2023-10-07 23:58:53,135][52060] Updated weights for policy 0, policy_version 3130 (0.0007) [2023-10-07 23:58:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 6455296. Throughput: 0: 1698.9, 1: 1725.3. Samples: 1624890. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-07 23:58:56,211][50642] Avg episode reward: [(0, '6.400'), (1, '8.520')] [2023-10-07 23:58:56,634][52059] Updated weights for policy 1, policy_version 3172 (0.0008) [2023-10-07 23:58:57,005][52059] Updated weights for policy 1, policy_version 3182 (0.0007) [2023-10-07 23:58:57,213][52060] Updated weights for policy 0, policy_version 3140 (0.0007) [2023-10-07 23:58:57,370][52059] Updated weights for policy 1, policy_version 3192 (0.0007) [2023-10-07 23:58:57,578][52060] Updated weights for policy 0, policy_version 3150 (0.0007) [2023-10-07 23:58:57,947][52060] Updated weights for policy 0, policy_version 3160 (0.0009) [2023-10-07 23:59:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 6520832. Throughput: 0: 1704.2, 1: 1734.3. Samples: 1646286. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-07 23:59:01,211][50642] Avg episode reward: [(0, '6.040'), (1, '8.680')] [2023-10-07 23:59:01,295][52059] Updated weights for policy 1, policy_version 3202 (0.0008) [2023-10-07 23:59:01,720][52059] Updated weights for policy 1, policy_version 3212 (0.0008) [2023-10-07 23:59:01,938][52060] Updated weights for policy 0, policy_version 3170 (0.0009) [2023-10-07 23:59:02,077][52059] Updated weights for policy 1, policy_version 3222 (0.0007) [2023-10-07 23:59:02,342][52060] Updated weights for policy 0, policy_version 3180 (0.0007) [2023-10-07 23:59:02,438][52059] Updated weights for policy 1, policy_version 3232 (0.0007) [2023-10-07 23:59:02,711][52060] Updated weights for policy 0, policy_version 3190 (0.0007) [2023-10-07 23:59:03,085][52060] Updated weights for policy 0, policy_version 3200 (0.0007) [2023-10-07 23:59:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 6586368. Throughput: 0: 1693.2, 1: 1713.6. Samples: 1655606. Policy #0 lag: (min: 29.0, avg: 37.0, max: 61.0) [2023-10-07 23:59:06,211][50642] Avg episode reward: [(0, '6.650'), (1, '8.870')] [2023-10-07 23:59:06,350][52059] Updated weights for policy 1, policy_version 3242 (0.0011) [2023-10-07 23:59:06,722][52059] Updated weights for policy 1, policy_version 3252 (0.0009) [2023-10-07 23:59:06,917][52060] Updated weights for policy 0, policy_version 3210 (0.0008) [2023-10-07 23:59:07,092][52059] Updated weights for policy 1, policy_version 3262 (0.0008) [2023-10-07 23:59:07,287][52060] Updated weights for policy 0, policy_version 3220 (0.0010) [2023-10-07 23:59:07,661][52060] Updated weights for policy 0, policy_version 3230 (0.0009) [2023-10-07 23:59:11,196][52059] Updated weights for policy 1, policy_version 3272 (0.0008) [2023-10-07 23:59:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 6651904. Throughput: 0: 1710.3, 1: 1731.2. Samples: 1676596. Policy #0 lag: (min: 29.0, avg: 37.0, max: 61.0) [2023-10-07 23:59:11,211][50642] Avg episode reward: [(0, '6.460'), (1, '8.890')] [2023-10-07 23:59:11,558][52059] Updated weights for policy 1, policy_version 3282 (0.0009) [2023-10-07 23:59:11,753][52060] Updated weights for policy 0, policy_version 3240 (0.0008) [2023-10-07 23:59:11,919][52059] Updated weights for policy 1, policy_version 3292 (0.0009) [2023-10-07 23:59:12,128][52060] Updated weights for policy 0, policy_version 3250 (0.0008) [2023-10-07 23:59:12,504][52060] Updated weights for policy 0, policy_version 3260 (0.0010) [2023-10-07 23:59:15,763][52059] Updated weights for policy 1, policy_version 3302 (0.0009) [2023-10-07 23:59:16,124][52059] Updated weights for policy 1, policy_version 3312 (0.0009) [2023-10-07 23:59:16,211][50642] Fps is (10 sec: 13106.6, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 6717440. Throughput: 0: 1703.1, 1: 1726.9. Samples: 1697260. Policy #0 lag: (min: 19.0, avg: 26.8, max: 51.0) [2023-10-07 23:59:16,212][50642] Avg episode reward: [(0, '6.350'), (1, '9.500')] [2023-10-07 23:59:16,495][52059] Updated weights for policy 1, policy_version 3322 (0.0008) [2023-10-07 23:59:16,637][52060] Updated weights for policy 0, policy_version 3270 (0.0010) [2023-10-07 23:59:16,714][51710] Saving new best policy, reward=9.500! [2023-10-07 23:59:17,012][52060] Updated weights for policy 0, policy_version 3280 (0.0009) [2023-10-07 23:59:17,390][52060] Updated weights for policy 0, policy_version 3290 (0.0007) [2023-10-07 23:59:20,479][52059] Updated weights for policy 1, policy_version 3332 (0.0009) [2023-10-07 23:59:20,836][52059] Updated weights for policy 1, policy_version 3342 (0.0010) [2023-10-07 23:59:21,198][52059] Updated weights for policy 1, policy_version 3352 (0.0010) [2023-10-07 23:59:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 6782976. Throughput: 0: 1700.2, 1: 1731.5. Samples: 1706890. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:59:21,211][50642] Avg episode reward: [(0, '7.150'), (1, '8.780')] [2023-10-07 23:59:21,304][52060] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-10-07 23:59:21,686][52060] Updated weights for policy 0, policy_version 3310 (0.0010) [2023-10-07 23:59:22,064][52060] Updated weights for policy 0, policy_version 3320 (0.0010) [2023-10-07 23:59:25,092][52059] Updated weights for policy 1, policy_version 3362 (0.0008) [2023-10-07 23:59:25,465][52059] Updated weights for policy 1, policy_version 3372 (0.0009) [2023-10-07 23:59:25,827][52059] Updated weights for policy 1, policy_version 3382 (0.0011) [2023-10-07 23:59:26,055][52060] Updated weights for policy 0, policy_version 3330 (0.0010) [2023-10-07 23:59:26,197][52059] Updated weights for policy 1, policy_version 3392 (0.0011) [2023-10-07 23:59:26,210][50642] Fps is (10 sec: 16384.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 6881280. Throughput: 0: 1706.3, 1: 1734.9. Samples: 1728008. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:59:26,213][50642] Avg episode reward: [(0, '6.130'), (1, '8.280')] [2023-10-07 23:59:26,429][52060] Updated weights for policy 0, policy_version 3340 (0.0008) [2023-10-07 23:59:26,812][52060] Updated weights for policy 0, policy_version 3350 (0.0007) [2023-10-07 23:59:27,192][52060] Updated weights for policy 0, policy_version 3360 (0.0009) [2023-10-07 23:59:30,151][52059] Updated weights for policy 1, policy_version 3402 (0.0009) [2023-10-07 23:59:30,509][52059] Updated weights for policy 1, policy_version 3412 (0.0010) [2023-10-07 23:59:30,876][52059] Updated weights for policy 1, policy_version 3422 (0.0010) [2023-10-07 23:59:31,146][52060] Updated weights for policy 0, policy_version 3370 (0.0008) [2023-10-07 23:59:31,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 6946816. Throughput: 0: 1704.9, 1: 1711.9. Samples: 1748020. Policy #0 lag: (min: 31.0, avg: 33.5, max: 63.0) [2023-10-07 23:59:31,211][50642] Avg episode reward: [(0, '6.670'), (1, '9.150')] [2023-10-07 23:59:31,515][52060] Updated weights for policy 0, policy_version 3380 (0.0007) [2023-10-07 23:59:31,885][52060] Updated weights for policy 0, policy_version 3390 (0.0007) [2023-10-07 23:59:35,032][52059] Updated weights for policy 1, policy_version 3432 (0.0008) [2023-10-07 23:59:35,399][52059] Updated weights for policy 1, policy_version 3442 (0.0008) [2023-10-07 23:59:35,704][52060] Updated weights for policy 0, policy_version 3400 (0.0007) [2023-10-07 23:59:35,774][52059] Updated weights for policy 1, policy_version 3452 (0.0009) [2023-10-07 23:59:36,077][52060] Updated weights for policy 0, policy_version 3410 (0.0007) [2023-10-07 23:59:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 7012352. Throughput: 0: 1709.2, 1: 1735.8. Samples: 1758582. Policy #0 lag: (min: 31.0, avg: 33.5, max: 63.0) [2023-10-07 23:59:36,211][50642] Avg episode reward: [(0, '6.940'), (1, '9.090')] [2023-10-07 23:59:36,443][52060] Updated weights for policy 0, policy_version 3420 (0.0008) [2023-10-07 23:59:39,678][52059] Updated weights for policy 1, policy_version 3462 (0.0009) [2023-10-07 23:59:40,040][52059] Updated weights for policy 1, policy_version 3472 (0.0008) [2023-10-07 23:59:40,400][52059] Updated weights for policy 1, policy_version 3482 (0.0008) [2023-10-07 23:59:40,407][52060] Updated weights for policy 0, policy_version 3430 (0.0009) [2023-10-07 23:59:40,768][52060] Updated weights for policy 0, policy_version 3440 (0.0010) [2023-10-07 23:59:41,144][52060] Updated weights for policy 0, policy_version 3450 (0.0009) [2023-10-07 23:59:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 7077888. Throughput: 0: 1717.5, 1: 1723.3. Samples: 1779726. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-07 23:59:41,211][50642] Avg episode reward: [(0, '6.710'), (1, '9.860')] [2023-10-07 23:59:41,213][51710] Saving new best policy, reward=9.860! [2023-10-07 23:59:44,262][52059] Updated weights for policy 1, policy_version 3492 (0.0009) [2023-10-07 23:59:44,628][52059] Updated weights for policy 1, policy_version 3502 (0.0011) [2023-10-07 23:59:44,981][52060] Updated weights for policy 0, policy_version 3460 (0.0008) [2023-10-07 23:59:44,996][52059] Updated weights for policy 1, policy_version 3512 (0.0008) [2023-10-07 23:59:45,356][52060] Updated weights for policy 0, policy_version 3470 (0.0007) [2023-10-07 23:59:45,733][52060] Updated weights for policy 0, policy_version 3480 (0.0008) [2023-10-07 23:59:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 7176192. Throughput: 0: 1699.9, 1: 1698.7. Samples: 1799220. Policy #0 lag: (min: 7.0, avg: 9.7, max: 39.0) [2023-10-07 23:59:46,211][50642] Avg episode reward: [(0, '6.670'), (1, '8.680')] [2023-10-07 23:59:49,044][52059] Updated weights for policy 1, policy_version 3522 (0.0007) [2023-10-07 23:59:49,457][52059] Updated weights for policy 1, policy_version 3532 (0.0007) [2023-10-07 23:59:49,791][52060] Updated weights for policy 0, policy_version 3490 (0.0008) [2023-10-07 23:59:49,815][52059] Updated weights for policy 1, policy_version 3542 (0.0009) [2023-10-07 23:59:50,184][52059] Updated weights for policy 1, policy_version 3552 (0.0008) [2023-10-07 23:59:50,196][52060] Updated weights for policy 0, policy_version 3500 (0.0009) [2023-10-07 23:59:50,577][52060] Updated weights for policy 0, policy_version 3510 (0.0010) [2023-10-07 23:59:50,947][52060] Updated weights for policy 0, policy_version 3520 (0.0010) [2023-10-07 23:59:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 7241728. Throughput: 0: 1722.5, 1: 1733.0. Samples: 1811106. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) [2023-10-07 23:59:51,211][50642] Avg episode reward: [(0, '6.840'), (1, '9.110')] [2023-10-07 23:59:53,936][52059] Updated weights for policy 1, policy_version 3562 (0.0008) [2023-10-07 23:59:54,296][52059] Updated weights for policy 1, policy_version 3572 (0.0009) [2023-10-07 23:59:54,661][52059] Updated weights for policy 1, policy_version 3582 (0.0007) [2023-10-07 23:59:54,850][52060] Updated weights for policy 0, policy_version 3530 (0.0009) [2023-10-07 23:59:55,223][52060] Updated weights for policy 0, policy_version 3540 (0.0009) [2023-10-07 23:59:55,591][52060] Updated weights for policy 0, policy_version 3550 (0.0008) [2023-10-07 23:59:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 7307264. Throughput: 0: 1711.6, 1: 1705.7. Samples: 1830376. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) [2023-10-07 23:59:56,211][50642] Avg episode reward: [(0, '6.360'), (1, '8.700')] [2023-10-07 23:59:58,712][52059] Updated weights for policy 1, policy_version 3592 (0.0010) [2023-10-07 23:59:59,071][52059] Updated weights for policy 1, policy_version 3602 (0.0011) [2023-10-07 23:59:59,441][52059] Updated weights for policy 1, policy_version 3612 (0.0008) [2023-10-07 23:59:59,498][52060] Updated weights for policy 0, policy_version 3560 (0.0008) [2023-10-07 23:59:59,874][52060] Updated weights for policy 0, policy_version 3570 (0.0009) [2023-10-08 00:00:00,248][52060] Updated weights for policy 0, policy_version 3580 (0.0010) [2023-10-08 00:00:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 7372800. Throughput: 0: 1699.7, 1: 1706.2. Samples: 1850524. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:00:01,211][50642] Avg episode reward: [(0, '6.690'), (1, '9.210')] [2023-10-08 00:00:03,327][52059] Updated weights for policy 1, policy_version 3622 (0.0009) [2023-10-08 00:00:03,691][52059] Updated weights for policy 1, policy_version 3632 (0.0010) [2023-10-08 00:00:04,051][52059] Updated weights for policy 1, policy_version 3642 (0.0009) [2023-10-08 00:00:04,094][52060] Updated weights for policy 0, policy_version 3590 (0.0009) [2023-10-08 00:00:04,460][52060] Updated weights for policy 0, policy_version 3600 (0.0007) [2023-10-08 00:00:04,836][52060] Updated weights for policy 0, policy_version 3610 (0.0007) [2023-10-08 00:00:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 7438336. Throughput: 0: 1736.2, 1: 1705.4. Samples: 1861762. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:00:06,211][50642] Avg episode reward: [(0, '6.690'), (1, '9.490')] [2023-10-08 00:00:08,094][52059] Updated weights for policy 1, policy_version 3652 (0.0008) [2023-10-08 00:00:08,457][52059] Updated weights for policy 1, policy_version 3662 (0.0010) [2023-10-08 00:00:08,829][52059] Updated weights for policy 1, policy_version 3672 (0.0008) [2023-10-08 00:00:08,886][52060] Updated weights for policy 0, policy_version 3620 (0.0008) [2023-10-08 00:00:09,249][52060] Updated weights for policy 0, policy_version 3630 (0.0008) [2023-10-08 00:00:09,622][52060] Updated weights for policy 0, policy_version 3640 (0.0008) [2023-10-08 00:00:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 7503872. Throughput: 0: 1711.6, 1: 1699.1. Samples: 1881490. Policy #0 lag: (min: 31.0, avg: 31.4, max: 43.0) [2023-10-08 00:00:11,211][50642] Avg episode reward: [(0, '6.730'), (1, '9.220')] [2023-10-08 00:00:12,695][52059] Updated weights for policy 1, policy_version 3682 (0.0008) [2023-10-08 00:00:13,072][52059] Updated weights for policy 1, policy_version 3692 (0.0008) [2023-10-08 00:00:13,436][52059] Updated weights for policy 1, policy_version 3702 (0.0007) [2023-10-08 00:00:13,753][52060] Updated weights for policy 0, policy_version 3650 (0.0007) [2023-10-08 00:00:13,798][52059] Updated weights for policy 1, policy_version 3712 (0.0008) [2023-10-08 00:00:14,114][52060] Updated weights for policy 0, policy_version 3660 (0.0009) [2023-10-08 00:00:14,485][52060] Updated weights for policy 0, policy_version 3670 (0.0009) [2023-10-08 00:00:14,858][52060] Updated weights for policy 0, policy_version 3680 (0.0009) [2023-10-08 00:00:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.6, 300 sec: 13773.7). Total num frames: 7569408. Throughput: 0: 1708.4, 1: 1724.2. Samples: 1902486. Policy #0 lag: (min: 31.0, avg: 31.4, max: 43.0) [2023-10-08 00:00:16,211][50642] Avg episode reward: [(0, '6.770'), (1, '9.890')] [2023-10-08 00:00:16,218][51710] Saving new best policy, reward=9.890! [2023-10-08 00:00:17,769][52059] Updated weights for policy 1, policy_version 3722 (0.0009) [2023-10-08 00:00:18,134][52059] Updated weights for policy 1, policy_version 3732 (0.0010) [2023-10-08 00:00:18,505][52059] Updated weights for policy 1, policy_version 3742 (0.0010) [2023-10-08 00:00:18,883][52060] Updated weights for policy 0, policy_version 3690 (0.0007) [2023-10-08 00:00:19,261][52060] Updated weights for policy 0, policy_version 3700 (0.0009) [2023-10-08 00:00:19,630][52060] Updated weights for policy 0, policy_version 3710 (0.0008) [2023-10-08 00:00:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 7634944. Throughput: 0: 1724.3, 1: 1699.0. Samples: 1912630. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) [2023-10-08 00:00:21,211][50642] Avg episode reward: [(0, '6.240'), (1, '9.810')] [2023-10-08 00:00:22,275][52059] Updated weights for policy 1, policy_version 3752 (0.0010) [2023-10-08 00:00:22,654][52059] Updated weights for policy 1, policy_version 3762 (0.0009) [2023-10-08 00:00:23,015][52059] Updated weights for policy 1, policy_version 3772 (0.0011) [2023-10-08 00:00:23,593][52060] Updated weights for policy 0, policy_version 3720 (0.0009) [2023-10-08 00:00:23,967][52060] Updated weights for policy 0, policy_version 3730 (0.0010) [2023-10-08 00:00:24,348][52060] Updated weights for policy 0, policy_version 3740 (0.0011) [2023-10-08 00:00:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 7700480. Throughput: 0: 1694.2, 1: 1712.8. Samples: 1933044. Policy #0 lag: (min: 31.0, avg: 38.6, max: 63.0) [2023-10-08 00:00:26,211][50642] Avg episode reward: [(0, '7.150'), (1, '10.420')] [2023-10-08 00:00:26,212][51710] Saving new best policy, reward=10.420! [2023-10-08 00:00:27,192][52059] Updated weights for policy 1, policy_version 3782 (0.0009) [2023-10-08 00:00:27,556][52059] Updated weights for policy 1, policy_version 3792 (0.0008) [2023-10-08 00:00:27,929][52059] Updated weights for policy 1, policy_version 3802 (0.0009) [2023-10-08 00:00:28,431][52060] Updated weights for policy 0, policy_version 3750 (0.0009) [2023-10-08 00:00:28,798][52060] Updated weights for policy 0, policy_version 3760 (0.0008) [2023-10-08 00:00:29,170][52060] Updated weights for policy 0, policy_version 3770 (0.0010) [2023-10-08 00:00:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 7766016. Throughput: 0: 1713.2, 1: 1729.6. Samples: 1954148. Policy #0 lag: (min: 16.0, avg: 38.4, max: 48.0) [2023-10-08 00:00:31,211][50642] Avg episode reward: [(0, '6.690'), (1, '9.590')] [2023-10-08 00:00:31,829][52059] Updated weights for policy 1, policy_version 3812 (0.0009) [2023-10-08 00:00:32,196][52059] Updated weights for policy 1, policy_version 3822 (0.0007) [2023-10-08 00:00:32,554][52059] Updated weights for policy 1, policy_version 3832 (0.0008) [2023-10-08 00:00:33,102][52060] Updated weights for policy 0, policy_version 3780 (0.0009) [2023-10-08 00:00:33,475][52060] Updated weights for policy 0, policy_version 3790 (0.0009) [2023-10-08 00:00:33,850][52060] Updated weights for policy 0, policy_version 3800 (0.0008) [2023-10-08 00:00:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 7831552. Throughput: 0: 1695.8, 1: 1698.1. Samples: 1963832. Policy #0 lag: (min: 16.0, avg: 38.4, max: 48.0) [2023-10-08 00:00:36,211][50642] Avg episode reward: [(0, '6.430'), (1, '10.090')] [2023-10-08 00:00:36,569][52059] Updated weights for policy 1, policy_version 3842 (0.0007) [2023-10-08 00:00:36,942][52059] Updated weights for policy 1, policy_version 3852 (0.0007) [2023-10-08 00:00:37,311][52059] Updated weights for policy 1, policy_version 3862 (0.0008) [2023-10-08 00:00:37,676][52059] Updated weights for policy 1, policy_version 3872 (0.0009) [2023-10-08 00:00:37,765][52060] Updated weights for policy 0, policy_version 3810 (0.0009) [2023-10-08 00:00:38,162][52060] Updated weights for policy 0, policy_version 3820 (0.0009) [2023-10-08 00:00:38,532][52060] Updated weights for policy 0, policy_version 3830 (0.0009) [2023-10-08 00:00:38,901][52060] Updated weights for policy 0, policy_version 3840 (0.0008) [2023-10-08 00:00:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 7897088. Throughput: 0: 1697.2, 1: 1720.2. Samples: 1984156. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:00:41,211][50642] Avg episode reward: [(0, '7.170'), (1, '9.660')] [2023-10-08 00:00:41,812][52059] Updated weights for policy 1, policy_version 3882 (0.0007) [2023-10-08 00:00:42,180][52059] Updated weights for policy 1, policy_version 3892 (0.0007) [2023-10-08 00:00:42,543][52059] Updated weights for policy 1, policy_version 3902 (0.0008) [2023-10-08 00:00:43,079][52060] Updated weights for policy 0, policy_version 3850 (0.0008) [2023-10-08 00:00:43,458][52060] Updated weights for policy 0, policy_version 3860 (0.0009) [2023-10-08 00:00:43,824][52060] Updated weights for policy 0, policy_version 3870 (0.0009) [2023-10-08 00:00:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 7962624. Throughput: 0: 1710.1, 1: 1721.4. Samples: 2004938. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:00:46,211][50642] Avg episode reward: [(0, '6.980'), (1, '9.770')] [2023-10-08 00:00:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000003872_3964928.pth... [2023-10-08 00:00:46,248][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000002272_2326528.pth [2023-10-08 00:00:46,498][52059] Updated weights for policy 1, policy_version 3912 (0.0008) [2023-10-08 00:00:46,864][52059] Updated weights for policy 1, policy_version 3922 (0.0010) [2023-10-08 00:00:47,221][52059] Updated weights for policy 1, policy_version 3932 (0.0010) [2023-10-08 00:00:47,364][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000003936_4030464.pth... [2023-10-08 00:00:47,402][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000002304_2359296.pth [2023-10-08 00:00:47,868][52060] Updated weights for policy 0, policy_version 3880 (0.0008) [2023-10-08 00:00:48,242][52060] Updated weights for policy 0, policy_version 3890 (0.0011) [2023-10-08 00:00:48,618][52060] Updated weights for policy 0, policy_version 3900 (0.0007) [2023-10-08 00:00:51,167][52059] Updated weights for policy 1, policy_version 3942 (0.0008) [2023-10-08 00:00:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 8028160. Throughput: 0: 1672.9, 1: 1717.4. Samples: 2014326. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:00:51,211][50642] Avg episode reward: [(0, '6.770'), (1, '9.660')] [2023-10-08 00:00:51,531][52059] Updated weights for policy 1, policy_version 3952 (0.0007) [2023-10-08 00:00:51,901][52059] Updated weights for policy 1, policy_version 3962 (0.0008) [2023-10-08 00:00:52,623][52060] Updated weights for policy 0, policy_version 3910 (0.0008) [2023-10-08 00:00:52,993][52060] Updated weights for policy 0, policy_version 3920 (0.0009) [2023-10-08 00:00:53,382][52060] Updated weights for policy 0, policy_version 3930 (0.0009) [2023-10-08 00:00:55,928][52059] Updated weights for policy 1, policy_version 3972 (0.0009) [2023-10-08 00:00:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13551.5). Total num frames: 8093696. Throughput: 0: 1696.9, 1: 1722.9. Samples: 2035382. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:00:56,211][50642] Avg episode reward: [(0, '7.430'), (1, '10.030')] [2023-10-08 00:00:56,212][51605] Saving new best policy, reward=7.430! [2023-10-08 00:00:56,294][52059] Updated weights for policy 1, policy_version 3982 (0.0007) [2023-10-08 00:00:56,663][52059] Updated weights for policy 1, policy_version 3992 (0.0008) [2023-10-08 00:00:57,262][52060] Updated weights for policy 0, policy_version 3940 (0.0009) [2023-10-08 00:00:57,635][52060] Updated weights for policy 0, policy_version 3950 (0.0009) [2023-10-08 00:00:58,000][52060] Updated weights for policy 0, policy_version 3960 (0.0010) [2023-10-08 00:01:00,723][52059] Updated weights for policy 1, policy_version 4002 (0.0010) [2023-10-08 00:01:01,084][52059] Updated weights for policy 1, policy_version 4012 (0.0008) [2023-10-08 00:01:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13551.5). Total num frames: 8159232. Throughput: 0: 1703.1, 1: 1713.3. Samples: 2056222. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) [2023-10-08 00:01:01,211][50642] Avg episode reward: [(0, '7.050'), (1, '10.290')] [2023-10-08 00:01:01,445][52059] Updated weights for policy 1, policy_version 4022 (0.0008) [2023-10-08 00:01:01,812][52059] Updated weights for policy 1, policy_version 4032 (0.0007) [2023-10-08 00:01:02,023][52060] Updated weights for policy 0, policy_version 3970 (0.0008) [2023-10-08 00:01:02,401][52060] Updated weights for policy 0, policy_version 3980 (0.0008) [2023-10-08 00:01:02,768][52060] Updated weights for policy 0, policy_version 3990 (0.0009) [2023-10-08 00:01:03,146][52060] Updated weights for policy 0, policy_version 4000 (0.0007) [2023-10-08 00:01:05,885][52059] Updated weights for policy 1, policy_version 4042 (0.0009) [2023-10-08 00:01:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13551.5). Total num frames: 8224768. Throughput: 0: 1682.3, 1: 1719.9. Samples: 2065726. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) [2023-10-08 00:01:06,211][50642] Avg episode reward: [(0, '6.630'), (1, '10.460')] [2023-10-08 00:01:06,248][52059] Updated weights for policy 1, policy_version 4052 (0.0009) [2023-10-08 00:01:06,613][52059] Updated weights for policy 1, policy_version 4062 (0.0012) [2023-10-08 00:01:06,677][51710] Saving new best policy, reward=10.460! [2023-10-08 00:01:07,066][52060] Updated weights for policy 0, policy_version 4010 (0.0007) [2023-10-08 00:01:07,433][52060] Updated weights for policy 0, policy_version 4020 (0.0007) [2023-10-08 00:01:07,800][52060] Updated weights for policy 0, policy_version 4030 (0.0007) [2023-10-08 00:01:10,524][52059] Updated weights for policy 1, policy_version 4072 (0.0008) [2023-10-08 00:01:10,888][52059] Updated weights for policy 1, policy_version 4082 (0.0010) [2023-10-08 00:01:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13551.5). Total num frames: 8290304. Throughput: 0: 1710.3, 1: 1716.8. Samples: 2087264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:01:11,211][50642] Avg episode reward: [(0, '7.460'), (1, '10.320')] [2023-10-08 00:01:11,211][51605] Saving new best policy, reward=7.460! [2023-10-08 00:01:11,264][52059] Updated weights for policy 1, policy_version 4092 (0.0009) [2023-10-08 00:01:11,723][52060] Updated weights for policy 0, policy_version 4040 (0.0009) [2023-10-08 00:01:12,101][52060] Updated weights for policy 0, policy_version 4050 (0.0009) [2023-10-08 00:01:12,480][52060] Updated weights for policy 0, policy_version 4060 (0.0007) [2023-10-08 00:01:14,915][52059] Updated weights for policy 1, policy_version 4102 (0.0007) [2023-10-08 00:01:15,287][52059] Updated weights for policy 1, policy_version 4112 (0.0007) [2023-10-08 00:01:15,644][52059] Updated weights for policy 1, policy_version 4122 (0.0010) [2023-10-08 00:01:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 8388608. Throughput: 0: 1709.8, 1: 1696.5. Samples: 2107428. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:01:16,211][50642] Avg episode reward: [(0, '6.950'), (1, '10.740')] [2023-10-08 00:01:16,220][51710] Saving new best policy, reward=10.740! [2023-10-08 00:01:16,517][52060] Updated weights for policy 0, policy_version 4070 (0.0008) [2023-10-08 00:01:16,893][52060] Updated weights for policy 0, policy_version 4080 (0.0007) [2023-10-08 00:01:17,268][52060] Updated weights for policy 0, policy_version 4090 (0.0007) [2023-10-08 00:01:19,542][52059] Updated weights for policy 1, policy_version 4132 (0.0009) [2023-10-08 00:01:19,903][52059] Updated weights for policy 1, policy_version 4142 (0.0010) [2023-10-08 00:01:20,271][52059] Updated weights for policy 1, policy_version 4152 (0.0008) [2023-10-08 00:01:21,128][52060] Updated weights for policy 0, policy_version 4100 (0.0007) [2023-10-08 00:01:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 8454144. Throughput: 0: 1701.2, 1: 1730.5. Samples: 2118258. Policy #0 lag: (min: 2.0, avg: 11.0, max: 34.0) [2023-10-08 00:01:21,211][50642] Avg episode reward: [(0, '6.830'), (1, '10.000')] [2023-10-08 00:01:21,510][52060] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-10-08 00:01:21,875][52060] Updated weights for policy 0, policy_version 4120 (0.0008) [2023-10-08 00:01:24,266][52059] Updated weights for policy 1, policy_version 4162 (0.0007) [2023-10-08 00:01:24,689][52059] Updated weights for policy 1, policy_version 4172 (0.0007) [2023-10-08 00:01:25,053][52059] Updated weights for policy 1, policy_version 4182 (0.0008) [2023-10-08 00:01:25,412][52059] Updated weights for policy 1, policy_version 4192 (0.0008) [2023-10-08 00:01:25,793][52060] Updated weights for policy 0, policy_version 4130 (0.0008) [2023-10-08 00:01:26,160][52060] Updated weights for policy 0, policy_version 4140 (0.0009) [2023-10-08 00:01:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 8519680. Throughput: 0: 1716.3, 1: 1724.9. Samples: 2139010. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:01:26,211][50642] Avg episode reward: [(0, '7.150'), (1, '9.880')] [2023-10-08 00:01:26,528][52060] Updated weights for policy 0, policy_version 4150 (0.0007) [2023-10-08 00:01:26,905][52060] Updated weights for policy 0, policy_version 4160 (0.0007) [2023-10-08 00:01:29,316][52059] Updated weights for policy 1, policy_version 4202 (0.0007) [2023-10-08 00:01:29,679][52059] Updated weights for policy 1, policy_version 4212 (0.0008) [2023-10-08 00:01:30,050][52059] Updated weights for policy 1, policy_version 4222 (0.0008) [2023-10-08 00:01:30,804][52060] Updated weights for policy 0, policy_version 4170 (0.0008) [2023-10-08 00:01:31,184][52060] Updated weights for policy 0, policy_version 4180 (0.0008) [2023-10-08 00:01:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 8585216. Throughput: 0: 1713.5, 1: 1711.2. Samples: 2159052. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:01:31,211][50642] Avg episode reward: [(0, '7.470'), (1, '10.250')] [2023-10-08 00:01:31,557][52060] Updated weights for policy 0, policy_version 4190 (0.0008) [2023-10-08 00:01:31,625][51605] Saving new best policy, reward=7.470! [2023-10-08 00:01:34,129][52059] Updated weights for policy 1, policy_version 4232 (0.0010) [2023-10-08 00:01:34,486][52059] Updated weights for policy 1, policy_version 4242 (0.0009) [2023-10-08 00:01:34,840][52059] Updated weights for policy 1, policy_version 4252 (0.0010) [2023-10-08 00:01:35,509][52060] Updated weights for policy 0, policy_version 4200 (0.0009) [2023-10-08 00:01:35,881][52060] Updated weights for policy 0, policy_version 4210 (0.0007) [2023-10-08 00:01:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 8650752. Throughput: 0: 1727.1, 1: 1735.0. Samples: 2170120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:01:36,211][50642] Avg episode reward: [(0, '7.050'), (1, '9.720')] [2023-10-08 00:01:36,258][52060] Updated weights for policy 0, policy_version 4220 (0.0008) [2023-10-08 00:01:38,830][52059] Updated weights for policy 1, policy_version 4262 (0.0007) [2023-10-08 00:01:39,194][52059] Updated weights for policy 1, policy_version 4272 (0.0010) [2023-10-08 00:01:39,558][52059] Updated weights for policy 1, policy_version 4282 (0.0007) [2023-10-08 00:01:40,184][52060] Updated weights for policy 0, policy_version 4230 (0.0007) [2023-10-08 00:01:40,561][52060] Updated weights for policy 0, policy_version 4240 (0.0007) [2023-10-08 00:01:40,932][52060] Updated weights for policy 0, policy_version 4250 (0.0008) [2023-10-08 00:01:41,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 8749056. Throughput: 0: 1733.8, 1: 1709.0. Samples: 2190308. Policy #0 lag: (min: 31.0, avg: 31.1, max: 38.0) [2023-10-08 00:01:41,211][50642] Avg episode reward: [(0, '7.590'), (1, '10.780')] [2023-10-08 00:01:41,212][51710] Saving new best policy, reward=10.780! [2023-10-08 00:01:41,212][51605] Saving new best policy, reward=7.590! [2023-10-08 00:01:43,576][52059] Updated weights for policy 1, policy_version 4292 (0.0009) [2023-10-08 00:01:43,946][52059] Updated weights for policy 1, policy_version 4302 (0.0012) [2023-10-08 00:01:44,314][52059] Updated weights for policy 1, policy_version 4312 (0.0009) [2023-10-08 00:01:44,740][52060] Updated weights for policy 0, policy_version 4260 (0.0008) [2023-10-08 00:01:45,114][52060] Updated weights for policy 0, policy_version 4270 (0.0008) [2023-10-08 00:01:45,479][52060] Updated weights for policy 0, policy_version 4280 (0.0008) [2023-10-08 00:01:46,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 8814592. Throughput: 0: 1705.4, 1: 1715.2. Samples: 2210150. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 00:01:46,211][50642] Avg episode reward: [(0, '7.540'), (1, '9.290')] [2023-10-08 00:01:48,282][52059] Updated weights for policy 1, policy_version 4322 (0.0009) [2023-10-08 00:01:48,647][52059] Updated weights for policy 1, policy_version 4332 (0.0010) [2023-10-08 00:01:49,013][52059] Updated weights for policy 1, policy_version 4342 (0.0010) [2023-10-08 00:01:49,379][52059] Updated weights for policy 1, policy_version 4352 (0.0008) [2023-10-08 00:01:49,488][52060] Updated weights for policy 0, policy_version 4290 (0.0009) [2023-10-08 00:01:49,851][52060] Updated weights for policy 0, policy_version 4300 (0.0007) [2023-10-08 00:01:50,214][52060] Updated weights for policy 0, policy_version 4310 (0.0008) [2023-10-08 00:01:50,580][52060] Updated weights for policy 0, policy_version 4320 (0.0008) [2023-10-08 00:01:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 8880128. Throughput: 0: 1735.6, 1: 1725.5. Samples: 2221476. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 00:01:51,211][50642] Avg episode reward: [(0, '6.880'), (1, '10.910')] [2023-10-08 00:01:51,211][51710] Saving new best policy, reward=10.910! [2023-10-08 00:01:53,288][52059] Updated weights for policy 1, policy_version 4362 (0.0007) [2023-10-08 00:01:53,652][52059] Updated weights for policy 1, policy_version 4372 (0.0010) [2023-10-08 00:01:54,012][52059] Updated weights for policy 1, policy_version 4382 (0.0010) [2023-10-08 00:01:54,633][52060] Updated weights for policy 0, policy_version 4330 (0.0011) [2023-10-08 00:01:55,001][52060] Updated weights for policy 0, policy_version 4340 (0.0009) [2023-10-08 00:01:55,370][52060] Updated weights for policy 0, policy_version 4350 (0.0010) [2023-10-08 00:01:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 8945664. Throughput: 0: 1718.0, 1: 1712.4. Samples: 2241630. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) [2023-10-08 00:01:56,211][50642] Avg episode reward: [(0, '7.290'), (1, '9.630')] [2023-10-08 00:01:57,878][52059] Updated weights for policy 1, policy_version 4392 (0.0008) [2023-10-08 00:01:58,240][52059] Updated weights for policy 1, policy_version 4402 (0.0010) [2023-10-08 00:01:58,593][52059] Updated weights for policy 1, policy_version 4412 (0.0010) [2023-10-08 00:01:59,588][52060] Updated weights for policy 0, policy_version 4360 (0.0009) [2023-10-08 00:01:59,954][52060] Updated weights for policy 0, policy_version 4370 (0.0009) [2023-10-08 00:02:00,328][52060] Updated weights for policy 0, policy_version 4380 (0.0009) [2023-10-08 00:02:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 9011200. Throughput: 0: 1698.2, 1: 1738.6. Samples: 2262086. Policy #0 lag: (min: 31.0, avg: 36.8, max: 63.0) [2023-10-08 00:02:01,211][50642] Avg episode reward: [(0, '7.290'), (1, '10.680')] [2023-10-08 00:02:02,638][52059] Updated weights for policy 1, policy_version 4422 (0.0011) [2023-10-08 00:02:02,998][52059] Updated weights for policy 1, policy_version 4432 (0.0007) [2023-10-08 00:02:03,370][52059] Updated weights for policy 1, policy_version 4442 (0.0009) [2023-10-08 00:02:04,238][52060] Updated weights for policy 0, policy_version 4390 (0.0007) [2023-10-08 00:02:04,610][52060] Updated weights for policy 0, policy_version 4400 (0.0009) [2023-10-08 00:02:04,988][52060] Updated weights for policy 0, policy_version 4410 (0.0008) [2023-10-08 00:02:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 9076736. Throughput: 0: 1735.2, 1: 1705.5. Samples: 2273090. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 00:02:06,211][50642] Avg episode reward: [(0, '7.370'), (1, '9.560')] [2023-10-08 00:02:07,316][52059] Updated weights for policy 1, policy_version 4452 (0.0010) [2023-10-08 00:02:07,687][52059] Updated weights for policy 1, policy_version 4462 (0.0009) [2023-10-08 00:02:08,051][52059] Updated weights for policy 1, policy_version 4472 (0.0008) [2023-10-08 00:02:08,938][52060] Updated weights for policy 0, policy_version 4420 (0.0009) [2023-10-08 00:02:09,310][52060] Updated weights for policy 0, policy_version 4430 (0.0009) [2023-10-08 00:02:09,679][52060] Updated weights for policy 0, policy_version 4440 (0.0007) [2023-10-08 00:02:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 9142272. Throughput: 0: 1706.9, 1: 1724.7. Samples: 2293430. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 00:02:11,211][50642] Avg episode reward: [(0, '7.370'), (1, '10.350')] [2023-10-08 00:02:11,927][52059] Updated weights for policy 1, policy_version 4482 (0.0010) [2023-10-08 00:02:12,322][52059] Updated weights for policy 1, policy_version 4492 (0.0007) [2023-10-08 00:02:12,690][52059] Updated weights for policy 1, policy_version 4502 (0.0007) [2023-10-08 00:02:13,055][52059] Updated weights for policy 1, policy_version 4512 (0.0009) [2023-10-08 00:02:13,613][52060] Updated weights for policy 0, policy_version 4450 (0.0007) [2023-10-08 00:02:14,028][52060] Updated weights for policy 0, policy_version 4460 (0.0009) [2023-10-08 00:02:14,391][52060] Updated weights for policy 0, policy_version 4470 (0.0008) [2023-10-08 00:02:14,765][52060] Updated weights for policy 0, policy_version 4480 (0.0009) [2023-10-08 00:02:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 9207808. Throughput: 0: 1713.4, 1: 1733.5. Samples: 2314160. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 00:02:16,211][50642] Avg episode reward: [(0, '7.460'), (1, '10.650')] [2023-10-08 00:02:17,050][52059] Updated weights for policy 1, policy_version 4522 (0.0007) [2023-10-08 00:02:17,420][52059] Updated weights for policy 1, policy_version 4532 (0.0007) [2023-10-08 00:02:17,791][52059] Updated weights for policy 1, policy_version 4542 (0.0008) [2023-10-08 00:02:18,665][52060] Updated weights for policy 0, policy_version 4490 (0.0009) [2023-10-08 00:02:19,032][52060] Updated weights for policy 0, policy_version 4500 (0.0008) [2023-10-08 00:02:19,402][52060] Updated weights for policy 0, policy_version 4510 (0.0009) [2023-10-08 00:02:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 9273344. Throughput: 0: 1713.9, 1: 1709.2. Samples: 2324156. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 00:02:21,211][50642] Avg episode reward: [(0, '7.860'), (1, '10.860')] [2023-10-08 00:02:21,211][51605] Saving new best policy, reward=7.860! [2023-10-08 00:02:21,727][52059] Updated weights for policy 1, policy_version 4552 (0.0008) [2023-10-08 00:02:22,088][52059] Updated weights for policy 1, policy_version 4562 (0.0008) [2023-10-08 00:02:22,465][52059] Updated weights for policy 1, policy_version 4572 (0.0008) [2023-10-08 00:02:23,281][52060] Updated weights for policy 0, policy_version 4520 (0.0009) [2023-10-08 00:02:23,650][52060] Updated weights for policy 0, policy_version 4530 (0.0008) [2023-10-08 00:02:24,014][52060] Updated weights for policy 0, policy_version 4540 (0.0010) [2023-10-08 00:02:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 9338880. Throughput: 0: 1693.0, 1: 1737.3. Samples: 2344672. Policy #0 lag: (min: 31.0, avg: 41.1, max: 63.0) [2023-10-08 00:02:26,211][50642] Avg episode reward: [(0, '7.920'), (1, '10.840')] [2023-10-08 00:02:26,212][51605] Saving new best policy, reward=7.920! [2023-10-08 00:02:26,417][52059] Updated weights for policy 1, policy_version 4582 (0.0010) [2023-10-08 00:02:26,784][52059] Updated weights for policy 1, policy_version 4592 (0.0011) [2023-10-08 00:02:27,154][52059] Updated weights for policy 1, policy_version 4602 (0.0010) [2023-10-08 00:02:28,077][52060] Updated weights for policy 0, policy_version 4550 (0.0009) [2023-10-08 00:02:28,443][52060] Updated weights for policy 0, policy_version 4560 (0.0009) [2023-10-08 00:02:28,816][52060] Updated weights for policy 0, policy_version 4570 (0.0008) [2023-10-08 00:02:30,998][52059] Updated weights for policy 1, policy_version 4612 (0.0010) [2023-10-08 00:02:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 9404416. Throughput: 0: 1719.5, 1: 1740.2. Samples: 2365836. Policy #0 lag: (min: 31.0, avg: 41.1, max: 63.0) [2023-10-08 00:02:31,211][50642] Avg episode reward: [(0, '7.810'), (1, '10.130')] [2023-10-08 00:02:31,352][52059] Updated weights for policy 1, policy_version 4622 (0.0009) [2023-10-08 00:02:31,722][52059] Updated weights for policy 1, policy_version 4632 (0.0009) [2023-10-08 00:02:32,951][52060] Updated weights for policy 0, policy_version 4580 (0.0008) [2023-10-08 00:02:33,320][52060] Updated weights for policy 0, policy_version 4590 (0.0009) [2023-10-08 00:02:33,690][52060] Updated weights for policy 0, policy_version 4600 (0.0011) [2023-10-08 00:02:35,749][52059] Updated weights for policy 1, policy_version 4642 (0.0010) [2023-10-08 00:02:36,110][52059] Updated weights for policy 1, policy_version 4652 (0.0009) [2023-10-08 00:02:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 9469952. Throughput: 0: 1695.4, 1: 1725.6. Samples: 2375420. Policy #0 lag: (min: 10.0, avg: 11.5, max: 31.0) [2023-10-08 00:02:36,211][50642] Avg episode reward: [(0, '8.110'), (1, '10.680')] [2023-10-08 00:02:36,212][51605] Saving new best policy, reward=8.110! [2023-10-08 00:02:36,468][52059] Updated weights for policy 1, policy_version 4662 (0.0007) [2023-10-08 00:02:36,832][52059] Updated weights for policy 1, policy_version 4672 (0.0007) [2023-10-08 00:02:37,655][52060] Updated weights for policy 0, policy_version 4610 (0.0011) [2023-10-08 00:02:38,029][52060] Updated weights for policy 0, policy_version 4620 (0.0011) [2023-10-08 00:02:38,396][52060] Updated weights for policy 0, policy_version 4630 (0.0009) [2023-10-08 00:02:38,764][52060] Updated weights for policy 0, policy_version 4640 (0.0008) [2023-10-08 00:02:40,654][52059] Updated weights for policy 1, policy_version 4682 (0.0008) [2023-10-08 00:02:41,030][52059] Updated weights for policy 1, policy_version 4692 (0.0009) [2023-10-08 00:02:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13551.5). Total num frames: 9535488. Throughput: 0: 1705.6, 1: 1740.5. Samples: 2396702. Policy #0 lag: (min: 10.0, avg: 11.5, max: 31.0) [2023-10-08 00:02:41,211][50642] Avg episode reward: [(0, '8.300'), (1, '10.520')] [2023-10-08 00:02:41,211][51605] Saving new best policy, reward=8.300! [2023-10-08 00:02:41,398][52059] Updated weights for policy 1, policy_version 4702 (0.0007) [2023-10-08 00:02:42,624][52060] Updated weights for policy 0, policy_version 4650 (0.0011) [2023-10-08 00:02:43,008][52060] Updated weights for policy 0, policy_version 4660 (0.0012) [2023-10-08 00:02:43,374][52060] Updated weights for policy 0, policy_version 4670 (0.0009) [2023-10-08 00:02:45,332][52059] Updated weights for policy 1, policy_version 4712 (0.0008) [2023-10-08 00:02:45,697][52059] Updated weights for policy 1, policy_version 4722 (0.0010) [2023-10-08 00:02:46,060][52059] Updated weights for policy 1, policy_version 4732 (0.0009) [2023-10-08 00:02:46,216][50642] Fps is (10 sec: 16375.4, 60 sec: 13652.2, 300 sec: 13662.3). Total num frames: 9633792. Throughput: 0: 1721.7, 1: 1713.1. Samples: 2416670. Policy #0 lag: (min: 31.0, avg: 31.4, max: 46.0) [2023-10-08 00:02:46,217][50642] Avg episode reward: [(0, '8.290'), (1, '10.590')] [2023-10-08 00:02:46,227][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000004736_4849664.pth... [2023-10-08 00:02:46,227][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000004672_4784128.pth... [2023-10-08 00:02:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000003104_3178496.pth [2023-10-08 00:02:46,264][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000003072_3145728.pth [2023-10-08 00:02:47,456][52060] Updated weights for policy 0, policy_version 4680 (0.0007) [2023-10-08 00:02:47,833][52060] Updated weights for policy 0, policy_version 4690 (0.0009) [2023-10-08 00:02:48,205][52060] Updated weights for policy 0, policy_version 4700 (0.0008) [2023-10-08 00:02:50,024][52059] Updated weights for policy 1, policy_version 4742 (0.0008) [2023-10-08 00:02:50,383][52059] Updated weights for policy 1, policy_version 4752 (0.0008) [2023-10-08 00:02:50,745][52059] Updated weights for policy 1, policy_version 4762 (0.0007) [2023-10-08 00:02:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 9699328. Throughput: 0: 1681.1, 1: 1735.6. Samples: 2426840. Policy #0 lag: (min: 31.0, avg: 31.4, max: 46.0) [2023-10-08 00:02:51,211][50642] Avg episode reward: [(0, '8.250'), (1, '10.720')] [2023-10-08 00:02:52,219][52060] Updated weights for policy 0, policy_version 4710 (0.0008) [2023-10-08 00:02:52,586][52060] Updated weights for policy 0, policy_version 4720 (0.0008) [2023-10-08 00:02:52,956][52060] Updated weights for policy 0, policy_version 4730 (0.0007) [2023-10-08 00:02:54,650][52059] Updated weights for policy 1, policy_version 4772 (0.0007) [2023-10-08 00:02:55,016][52059] Updated weights for policy 1, policy_version 4782 (0.0007) [2023-10-08 00:02:55,380][52059] Updated weights for policy 1, policy_version 4792 (0.0008) [2023-10-08 00:02:56,210][50642] Fps is (10 sec: 13114.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 9764864. Throughput: 0: 1701.8, 1: 1727.3. Samples: 2447742. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) [2023-10-08 00:02:56,211][50642] Avg episode reward: [(0, '8.170'), (1, '10.070')] [2023-10-08 00:02:56,998][52060] Updated weights for policy 0, policy_version 4740 (0.0008) [2023-10-08 00:02:57,366][52060] Updated weights for policy 0, policy_version 4750 (0.0008) [2023-10-08 00:02:57,748][52060] Updated weights for policy 0, policy_version 4760 (0.0008) [2023-10-08 00:02:59,390][52059] Updated weights for policy 1, policy_version 4802 (0.0009) [2023-10-08 00:02:59,760][52059] Updated weights for policy 1, policy_version 4812 (0.0007) [2023-10-08 00:03:00,132][52059] Updated weights for policy 1, policy_version 4822 (0.0008) [2023-10-08 00:03:00,498][52059] Updated weights for policy 1, policy_version 4832 (0.0009) [2023-10-08 00:03:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 9830400. Throughput: 0: 1706.0, 1: 1710.4. Samples: 2467898. Policy #0 lag: (min: 31.0, avg: 38.4, max: 63.0) [2023-10-08 00:03:01,211][50642] Avg episode reward: [(0, '7.960'), (1, '10.910')] [2023-10-08 00:03:01,817][52060] Updated weights for policy 0, policy_version 4770 (0.0009) [2023-10-08 00:03:02,215][52060] Updated weights for policy 0, policy_version 4780 (0.0011) [2023-10-08 00:03:02,587][52060] Updated weights for policy 0, policy_version 4790 (0.0009) [2023-10-08 00:03:02,956][52060] Updated weights for policy 0, policy_version 4800 (0.0009) [2023-10-08 00:03:04,429][52059] Updated weights for policy 1, policy_version 4842 (0.0008) [2023-10-08 00:03:04,797][52059] Updated weights for policy 1, policy_version 4852 (0.0007) [2023-10-08 00:03:05,170][52059] Updated weights for policy 1, policy_version 4862 (0.0011) [2023-10-08 00:03:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 9895936. Throughput: 0: 1684.8, 1: 1741.1. Samples: 2478320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:03:06,211][50642] Avg episode reward: [(0, '8.270'), (1, '10.280')] [2023-10-08 00:03:06,977][52060] Updated weights for policy 0, policy_version 4810 (0.0008) [2023-10-08 00:03:07,345][52060] Updated weights for policy 0, policy_version 4820 (0.0010) [2023-10-08 00:03:07,713][52060] Updated weights for policy 0, policy_version 4830 (0.0007) [2023-10-08 00:03:09,057][52059] Updated weights for policy 1, policy_version 4872 (0.0011) [2023-10-08 00:03:09,427][52059] Updated weights for policy 1, policy_version 4882 (0.0009) [2023-10-08 00:03:09,792][52059] Updated weights for policy 1, policy_version 4892 (0.0009) [2023-10-08 00:03:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 9961472. Throughput: 0: 1703.2, 1: 1717.1. Samples: 2498586. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:03:11,211][50642] Avg episode reward: [(0, '8.620'), (1, '11.390')] [2023-10-08 00:03:11,212][51710] Saving new best policy, reward=11.390! [2023-10-08 00:03:11,565][52060] Updated weights for policy 0, policy_version 4840 (0.0008) [2023-10-08 00:03:11,950][52060] Updated weights for policy 0, policy_version 4850 (0.0010) [2023-10-08 00:03:12,329][52060] Updated weights for policy 0, policy_version 4860 (0.0008) [2023-10-08 00:03:12,471][51605] Saving new best policy, reward=8.620! [2023-10-08 00:03:13,655][52059] Updated weights for policy 1, policy_version 4902 (0.0007) [2023-10-08 00:03:14,019][52059] Updated weights for policy 1, policy_version 4912 (0.0011) [2023-10-08 00:03:14,392][52059] Updated weights for policy 1, policy_version 4922 (0.0009) [2023-10-08 00:03:16,169][52060] Updated weights for policy 0, policy_version 4870 (0.0009) [2023-10-08 00:03:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 10027008. Throughput: 0: 1708.5, 1: 1710.4. Samples: 2519690. Policy #0 lag: (min: 12.0, avg: 15.0, max: 44.0) [2023-10-08 00:03:16,211][50642] Avg episode reward: [(0, '8.760'), (1, '10.330')] [2023-10-08 00:03:16,547][52060] Updated weights for policy 0, policy_version 4880 (0.0009) [2023-10-08 00:03:16,917][52060] Updated weights for policy 0, policy_version 4890 (0.0009) [2023-10-08 00:03:17,134][51605] Saving new best policy, reward=8.760! [2023-10-08 00:03:18,432][52059] Updated weights for policy 1, policy_version 4932 (0.0010) [2023-10-08 00:03:18,794][52059] Updated weights for policy 1, policy_version 4942 (0.0008) [2023-10-08 00:03:19,159][52059] Updated weights for policy 1, policy_version 4952 (0.0010) [2023-10-08 00:03:20,930][52060] Updated weights for policy 0, policy_version 4900 (0.0010) [2023-10-08 00:03:21,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.2, 300 sec: 13662.6). Total num frames: 10092544. Throughput: 0: 1700.8, 1: 1727.3. Samples: 2529682. Policy #0 lag: (min: 12.0, avg: 15.0, max: 44.0) [2023-10-08 00:03:21,211][50642] Avg episode reward: [(0, '8.590'), (1, '10.880')] [2023-10-08 00:03:21,302][52060] Updated weights for policy 0, policy_version 4910 (0.0007) [2023-10-08 00:03:21,663][52060] Updated weights for policy 0, policy_version 4920 (0.0009) [2023-10-08 00:03:23,241][52059] Updated weights for policy 1, policy_version 4962 (0.0009) [2023-10-08 00:03:23,611][52059] Updated weights for policy 1, policy_version 4972 (0.0007) [2023-10-08 00:03:23,975][52059] Updated weights for policy 1, policy_version 4982 (0.0008) [2023-10-08 00:03:24,344][52059] Updated weights for policy 1, policy_version 4992 (0.0007) [2023-10-08 00:03:25,681][52060] Updated weights for policy 0, policy_version 4930 (0.0009) [2023-10-08 00:03:26,051][52060] Updated weights for policy 0, policy_version 4940 (0.0008) [2023-10-08 00:03:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 10158080. Throughput: 0: 1706.3, 1: 1705.6. Samples: 2550236. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:03:26,211][50642] Avg episode reward: [(0, '8.860'), (1, '10.470')] [2023-10-08 00:03:26,422][52060] Updated weights for policy 0, policy_version 4950 (0.0010) [2023-10-08 00:03:26,789][51605] Saving new best policy, reward=8.860! [2023-10-08 00:03:26,790][52060] Updated weights for policy 0, policy_version 4960 (0.0008) [2023-10-08 00:03:28,235][52059] Updated weights for policy 1, policy_version 5002 (0.0008) [2023-10-08 00:03:28,598][52059] Updated weights for policy 1, policy_version 5012 (0.0007) [2023-10-08 00:03:28,965][52059] Updated weights for policy 1, policy_version 5022 (0.0007) [2023-10-08 00:03:30,616][52060] Updated weights for policy 0, policy_version 4970 (0.0009) [2023-10-08 00:03:30,989][52060] Updated weights for policy 0, policy_version 4980 (0.0008) [2023-10-08 00:03:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 10223616. Throughput: 0: 1700.8, 1: 1733.4. Samples: 2571188. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:03:31,211][50642] Avg episode reward: [(0, '8.790'), (1, '10.960')] [2023-10-08 00:03:31,350][52060] Updated weights for policy 0, policy_version 4990 (0.0010) [2023-10-08 00:03:32,829][52059] Updated weights for policy 1, policy_version 5032 (0.0010) [2023-10-08 00:03:33,210][52059] Updated weights for policy 1, policy_version 5042 (0.0010) [2023-10-08 00:03:33,585][52059] Updated weights for policy 1, policy_version 5052 (0.0010) [2023-10-08 00:03:35,328][52060] Updated weights for policy 0, policy_version 5000 (0.0009) [2023-10-08 00:03:35,705][52060] Updated weights for policy 0, policy_version 5010 (0.0009) [2023-10-08 00:03:36,072][52060] Updated weights for policy 0, policy_version 5020 (0.0008) [2023-10-08 00:03:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 10289152. Throughput: 0: 1718.0, 1: 1709.3. Samples: 2581072. Policy #0 lag: (min: 8.0, avg: 31.0, max: 40.0) [2023-10-08 00:03:36,211][50642] Avg episode reward: [(0, '8.960'), (1, '10.760')] [2023-10-08 00:03:36,221][51605] Saving new best policy, reward=8.960! [2023-10-08 00:03:37,484][52059] Updated weights for policy 1, policy_version 5062 (0.0008) [2023-10-08 00:03:37,847][52059] Updated weights for policy 1, policy_version 5072 (0.0008) [2023-10-08 00:03:38,213][52059] Updated weights for policy 1, policy_version 5082 (0.0011) [2023-10-08 00:03:40,110][52060] Updated weights for policy 0, policy_version 5030 (0.0008) [2023-10-08 00:03:40,474][52060] Updated weights for policy 0, policy_version 5040 (0.0007) [2023-10-08 00:03:40,850][52060] Updated weights for policy 0, policy_version 5050 (0.0008) [2023-10-08 00:03:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 10387456. Throughput: 0: 1725.2, 1: 1717.3. Samples: 2602656. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:03:41,211][50642] Avg episode reward: [(0, '8.740'), (1, '11.040')] [2023-10-08 00:03:42,211][52059] Updated weights for policy 1, policy_version 5092 (0.0008) [2023-10-08 00:03:42,574][52059] Updated weights for policy 1, policy_version 5102 (0.0008) [2023-10-08 00:03:42,940][52059] Updated weights for policy 1, policy_version 5112 (0.0008) [2023-10-08 00:03:44,693][52060] Updated weights for policy 0, policy_version 5060 (0.0009) [2023-10-08 00:03:45,062][52060] Updated weights for policy 0, policy_version 5070 (0.0009) [2023-10-08 00:03:45,432][52060] Updated weights for policy 0, policy_version 5080 (0.0010) [2023-10-08 00:03:46,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13654.5, 300 sec: 13773.7). Total num frames: 10452992. Throughput: 0: 1694.1, 1: 1746.5. Samples: 2622724. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:03:46,211][50642] Avg episode reward: [(0, '8.810'), (1, '10.880')] [2023-10-08 00:03:46,829][52059] Updated weights for policy 1, policy_version 5122 (0.0008) [2023-10-08 00:03:47,236][52059] Updated weights for policy 1, policy_version 5132 (0.0010) [2023-10-08 00:03:47,595][52059] Updated weights for policy 1, policy_version 5142 (0.0008) [2023-10-08 00:03:47,964][52059] Updated weights for policy 1, policy_version 5152 (0.0008) [2023-10-08 00:03:49,606][52060] Updated weights for policy 0, policy_version 5090 (0.0009) [2023-10-08 00:03:49,986][52060] Updated weights for policy 0, policy_version 5100 (0.0009) [2023-10-08 00:03:50,364][52060] Updated weights for policy 0, policy_version 5110 (0.0011) [2023-10-08 00:03:50,731][52060] Updated weights for policy 0, policy_version 5120 (0.0010) [2023-10-08 00:03:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 10518528. Throughput: 0: 1728.6, 1: 1710.6. Samples: 2633084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:03:51,211][50642] Avg episode reward: [(0, '8.990'), (1, '11.300')] [2023-10-08 00:03:51,212][51605] Saving new best policy, reward=8.990! [2023-10-08 00:03:51,781][52059] Updated weights for policy 1, policy_version 5162 (0.0009) [2023-10-08 00:03:52,145][52059] Updated weights for policy 1, policy_version 5172 (0.0009) [2023-10-08 00:03:52,519][52059] Updated weights for policy 1, policy_version 5182 (0.0008) [2023-10-08 00:03:54,625][52060] Updated weights for policy 0, policy_version 5130 (0.0009) [2023-10-08 00:03:54,993][52060] Updated weights for policy 0, policy_version 5140 (0.0008) [2023-10-08 00:03:55,360][52060] Updated weights for policy 0, policy_version 5150 (0.0008) [2023-10-08 00:03:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 10584064. Throughput: 0: 1711.6, 1: 1736.1. Samples: 2653736. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:03:56,211][50642] Avg episode reward: [(0, '8.850'), (1, '10.310')] [2023-10-08 00:03:56,371][52059] Updated weights for policy 1, policy_version 5192 (0.0010) [2023-10-08 00:03:56,738][52059] Updated weights for policy 1, policy_version 5202 (0.0007) [2023-10-08 00:03:57,116][52059] Updated weights for policy 1, policy_version 5212 (0.0008) [2023-10-08 00:03:59,316][52060] Updated weights for policy 0, policy_version 5160 (0.0009) [2023-10-08 00:03:59,688][52060] Updated weights for policy 0, policy_version 5170 (0.0007) [2023-10-08 00:04:00,066][52060] Updated weights for policy 0, policy_version 5180 (0.0008) [2023-10-08 00:04:01,059][52059] Updated weights for policy 1, policy_version 5222 (0.0011) [2023-10-08 00:04:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 10649600. Throughput: 0: 1696.0, 1: 1742.3. Samples: 2674414. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:04:01,211][50642] Avg episode reward: [(0, '8.980'), (1, '10.810')] [2023-10-08 00:04:01,425][52059] Updated weights for policy 1, policy_version 5232 (0.0010) [2023-10-08 00:04:01,793][52059] Updated weights for policy 1, policy_version 5242 (0.0008) [2023-10-08 00:04:04,234][52060] Updated weights for policy 0, policy_version 5190 (0.0010) [2023-10-08 00:04:04,607][52060] Updated weights for policy 0, policy_version 5200 (0.0008) [2023-10-08 00:04:04,990][52060] Updated weights for policy 0, policy_version 5210 (0.0011) [2023-10-08 00:04:05,670][52059] Updated weights for policy 1, policy_version 5252 (0.0009) [2023-10-08 00:04:06,039][52059] Updated weights for policy 1, policy_version 5262 (0.0010) [2023-10-08 00:04:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 10715136. Throughput: 0: 1727.2, 1: 1729.3. Samples: 2685224. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:04:06,211][50642] Avg episode reward: [(0, '9.650'), (1, '10.530')] [2023-10-08 00:04:06,211][51605] Saving new best policy, reward=9.650! [2023-10-08 00:04:06,393][52059] Updated weights for policy 1, policy_version 5272 (0.0009) [2023-10-08 00:04:08,969][52060] Updated weights for policy 0, policy_version 5220 (0.0009) [2023-10-08 00:04:09,343][52060] Updated weights for policy 0, policy_version 5230 (0.0007) [2023-10-08 00:04:09,713][52060] Updated weights for policy 0, policy_version 5240 (0.0009) [2023-10-08 00:04:10,346][52059] Updated weights for policy 1, policy_version 5282 (0.0010) [2023-10-08 00:04:10,705][52059] Updated weights for policy 1, policy_version 5292 (0.0008) [2023-10-08 00:04:11,070][52059] Updated weights for policy 1, policy_version 5302 (0.0009) [2023-10-08 00:04:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 10780672. Throughput: 0: 1695.4, 1: 1750.6. Samples: 2705306. Policy #0 lag: (min: 1.0, avg: 19.5, max: 33.0) [2023-10-08 00:04:11,211][50642] Avg episode reward: [(0, '8.520'), (1, '10.930')] [2023-10-08 00:04:11,441][52059] Updated weights for policy 1, policy_version 5312 (0.0011) [2023-10-08 00:04:13,587][52060] Updated weights for policy 0, policy_version 5250 (0.0008) [2023-10-08 00:04:13,962][52060] Updated weights for policy 0, policy_version 5260 (0.0007) [2023-10-08 00:04:14,333][52060] Updated weights for policy 0, policy_version 5270 (0.0007) [2023-10-08 00:04:14,702][52060] Updated weights for policy 0, policy_version 5280 (0.0007) [2023-10-08 00:04:15,260][52059] Updated weights for policy 1, policy_version 5322 (0.0009) [2023-10-08 00:04:15,629][52059] Updated weights for policy 1, policy_version 5332 (0.0010) [2023-10-08 00:04:15,993][52059] Updated weights for policy 1, policy_version 5342 (0.0010) [2023-10-08 00:04:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 10878976. Throughput: 0: 1701.7, 1: 1729.0. Samples: 2725570. Policy #0 lag: (min: 1.0, avg: 19.5, max: 33.0) [2023-10-08 00:04:16,211][50642] Avg episode reward: [(0, '8.860'), (1, '10.580')] [2023-10-08 00:04:18,699][52060] Updated weights for policy 0, policy_version 5290 (0.0008) [2023-10-08 00:04:19,066][52060] Updated weights for policy 0, policy_version 5300 (0.0009) [2023-10-08 00:04:19,442][52060] Updated weights for policy 0, policy_version 5310 (0.0007) [2023-10-08 00:04:20,053][52059] Updated weights for policy 1, policy_version 5352 (0.0008) [2023-10-08 00:04:20,419][52059] Updated weights for policy 1, policy_version 5362 (0.0008) [2023-10-08 00:04:20,789][52059] Updated weights for policy 1, policy_version 5372 (0.0010) [2023-10-08 00:04:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 10944512. Throughput: 0: 1704.7, 1: 1751.5. Samples: 2736602. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:04:21,211][50642] Avg episode reward: [(0, '8.550'), (1, '10.760')] [2023-10-08 00:04:23,460][52060] Updated weights for policy 0, policy_version 5320 (0.0009) [2023-10-08 00:04:23,824][52060] Updated weights for policy 0, policy_version 5330 (0.0008) [2023-10-08 00:04:24,198][52060] Updated weights for policy 0, policy_version 5340 (0.0007) [2023-10-08 00:04:24,715][52059] Updated weights for policy 1, policy_version 5382 (0.0009) [2023-10-08 00:04:25,086][52059] Updated weights for policy 1, policy_version 5392 (0.0009) [2023-10-08 00:04:25,448][52059] Updated weights for policy 1, policy_version 5402 (0.0009) [2023-10-08 00:04:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 11010048. Throughput: 0: 1680.3, 1: 1743.2. Samples: 2756716. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:04:26,211][50642] Avg episode reward: [(0, '9.220'), (1, '11.160')] [2023-10-08 00:04:28,204][52060] Updated weights for policy 0, policy_version 5350 (0.0008) [2023-10-08 00:04:28,573][52060] Updated weights for policy 0, policy_version 5360 (0.0007) [2023-10-08 00:04:28,945][52060] Updated weights for policy 0, policy_version 5370 (0.0009) [2023-10-08 00:04:29,392][52059] Updated weights for policy 1, policy_version 5412 (0.0008) [2023-10-08 00:04:29,760][52059] Updated weights for policy 1, policy_version 5422 (0.0007) [2023-10-08 00:04:30,122][52059] Updated weights for policy 1, policy_version 5432 (0.0009) [2023-10-08 00:04:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 11075584. Throughput: 0: 1715.2, 1: 1720.3. Samples: 2777324. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 00:04:31,211][50642] Avg episode reward: [(0, '9.160'), (1, '10.890')] [2023-10-08 00:04:33,061][52060] Updated weights for policy 0, policy_version 5380 (0.0008) [2023-10-08 00:04:33,427][52060] Updated weights for policy 0, policy_version 5390 (0.0009) [2023-10-08 00:04:33,811][52060] Updated weights for policy 0, policy_version 5400 (0.0008) [2023-10-08 00:04:34,059][52059] Updated weights for policy 1, policy_version 5442 (0.0008) [2023-10-08 00:04:34,436][52059] Updated weights for policy 1, policy_version 5452 (0.0010) [2023-10-08 00:04:34,801][52059] Updated weights for policy 1, policy_version 5462 (0.0009) [2023-10-08 00:04:35,169][52059] Updated weights for policy 1, policy_version 5472 (0.0007) [2023-10-08 00:04:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 11141120. Throughput: 0: 1694.3, 1: 1756.9. Samples: 2788390. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 00:04:36,211][50642] Avg episode reward: [(0, '9.500'), (1, '11.150')] [2023-10-08 00:04:37,770][52060] Updated weights for policy 0, policy_version 5410 (0.0009) [2023-10-08 00:04:38,167][52060] Updated weights for policy 0, policy_version 5420 (0.0009) [2023-10-08 00:04:38,532][52060] Updated weights for policy 0, policy_version 5430 (0.0007) [2023-10-08 00:04:38,907][52060] Updated weights for policy 0, policy_version 5440 (0.0007) [2023-10-08 00:04:38,932][52059] Updated weights for policy 1, policy_version 5482 (0.0008) [2023-10-08 00:04:39,296][52059] Updated weights for policy 1, policy_version 5492 (0.0010) [2023-10-08 00:04:39,666][52059] Updated weights for policy 1, policy_version 5502 (0.0007) [2023-10-08 00:04:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11206656. Throughput: 0: 1701.9, 1: 1728.4. Samples: 2808100. Policy #0 lag: (min: 2.0, avg: 2.4, max: 12.0) [2023-10-08 00:04:41,211][50642] Avg episode reward: [(0, '9.230'), (1, '11.220')] [2023-10-08 00:04:42,751][52060] Updated weights for policy 0, policy_version 5450 (0.0007) [2023-10-08 00:04:43,125][52060] Updated weights for policy 0, policy_version 5460 (0.0008) [2023-10-08 00:04:43,493][52060] Updated weights for policy 0, policy_version 5470 (0.0007) [2023-10-08 00:04:43,682][52059] Updated weights for policy 1, policy_version 5512 (0.0009) [2023-10-08 00:04:44,045][52059] Updated weights for policy 1, policy_version 5522 (0.0007) [2023-10-08 00:04:44,405][52059] Updated weights for policy 1, policy_version 5532 (0.0009) [2023-10-08 00:04:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11272192. Throughput: 0: 1717.1, 1: 1726.3. Samples: 2829366. Policy #0 lag: (min: 2.0, avg: 2.4, max: 12.0) [2023-10-08 00:04:46,212][50642] Avg episode reward: [(0, '9.080'), (1, '10.830')] [2023-10-08 00:04:46,224][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000005472_5603328.pth... [2023-10-08 00:04:46,224][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000005536_5668864.pth... [2023-10-08 00:04:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000003936_4030464.pth [2023-10-08 00:04:46,263][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000003872_3964928.pth [2023-10-08 00:04:47,562][52060] Updated weights for policy 0, policy_version 5480 (0.0010) [2023-10-08 00:04:47,937][52060] Updated weights for policy 0, policy_version 5490 (0.0009) [2023-10-08 00:04:48,304][52060] Updated weights for policy 0, policy_version 5500 (0.0010) [2023-10-08 00:04:48,385][52059] Updated weights for policy 1, policy_version 5542 (0.0009) [2023-10-08 00:04:48,756][52059] Updated weights for policy 1, policy_version 5552 (0.0009) [2023-10-08 00:04:49,121][52059] Updated weights for policy 1, policy_version 5562 (0.0009) [2023-10-08 00:04:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11337728. Throughput: 0: 1685.7, 1: 1735.6. Samples: 2839182. Policy #0 lag: (min: 17.0, avg: 32.8, max: 49.0) [2023-10-08 00:04:51,211][50642] Avg episode reward: [(0, '9.460'), (1, '10.950')] [2023-10-08 00:04:52,419][52060] Updated weights for policy 0, policy_version 5510 (0.0009) [2023-10-08 00:04:52,792][52060] Updated weights for policy 0, policy_version 5520 (0.0009) [2023-10-08 00:04:53,095][52059] Updated weights for policy 1, policy_version 5572 (0.0008) [2023-10-08 00:04:53,166][52060] Updated weights for policy 0, policy_version 5530 (0.0008) [2023-10-08 00:04:53,469][52059] Updated weights for policy 1, policy_version 5582 (0.0008) [2023-10-08 00:04:53,834][52059] Updated weights for policy 1, policy_version 5592 (0.0008) [2023-10-08 00:04:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11403264. Throughput: 0: 1711.5, 1: 1721.0. Samples: 2859766. Policy #0 lag: (min: 17.0, avg: 32.8, max: 49.0) [2023-10-08 00:04:56,211][50642] Avg episode reward: [(0, '9.280'), (1, '11.650')] [2023-10-08 00:04:56,213][51710] Saving new best policy, reward=11.650! [2023-10-08 00:04:57,136][52060] Updated weights for policy 0, policy_version 5540 (0.0009) [2023-10-08 00:04:57,502][52060] Updated weights for policy 0, policy_version 5550 (0.0009) [2023-10-08 00:04:57,587][52059] Updated weights for policy 1, policy_version 5602 (0.0009) [2023-10-08 00:04:57,873][52060] Updated weights for policy 0, policy_version 5560 (0.0007) [2023-10-08 00:04:57,945][52059] Updated weights for policy 1, policy_version 5612 (0.0009) [2023-10-08 00:04:58,311][52059] Updated weights for policy 1, policy_version 5622 (0.0010) [2023-10-08 00:04:58,679][52059] Updated weights for policy 1, policy_version 5632 (0.0009) [2023-10-08 00:05:01,211][50642] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11468800. Throughput: 0: 1714.7, 1: 1741.0. Samples: 2881078. Policy #0 lag: (min: 17.0, avg: 21.6, max: 49.0) [2023-10-08 00:05:01,212][50642] Avg episode reward: [(0, '9.650'), (1, '10.960')] [2023-10-08 00:05:01,819][52060] Updated weights for policy 0, policy_version 5570 (0.0009) [2023-10-08 00:05:02,174][52060] Updated weights for policy 0, policy_version 5580 (0.0009) [2023-10-08 00:05:02,550][52060] Updated weights for policy 0, policy_version 5590 (0.0009) [2023-10-08 00:05:02,605][52059] Updated weights for policy 1, policy_version 5642 (0.0007) [2023-10-08 00:05:02,921][52060] Updated weights for policy 0, policy_version 5600 (0.0008) [2023-10-08 00:05:02,968][52059] Updated weights for policy 1, policy_version 5652 (0.0007) [2023-10-08 00:05:03,333][52059] Updated weights for policy 1, policy_version 5662 (0.0010) [2023-10-08 00:05:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11534336. Throughput: 0: 1700.3, 1: 1718.8. Samples: 2890460. Policy #0 lag: (min: 17.0, avg: 21.6, max: 49.0) [2023-10-08 00:05:06,211][50642] Avg episode reward: [(0, '9.840'), (1, '12.010')] [2023-10-08 00:05:06,212][51605] Saving new best policy, reward=9.840! [2023-10-08 00:05:06,212][51710] Saving new best policy, reward=12.010! [2023-10-08 00:05:06,853][52060] Updated weights for policy 0, policy_version 5610 (0.0007) [2023-10-08 00:05:07,200][52059] Updated weights for policy 1, policy_version 5672 (0.0008) [2023-10-08 00:05:07,223][52060] Updated weights for policy 0, policy_version 5620 (0.0008) [2023-10-08 00:05:07,563][52059] Updated weights for policy 1, policy_version 5682 (0.0007) [2023-10-08 00:05:07,590][52060] Updated weights for policy 0, policy_version 5630 (0.0007) [2023-10-08 00:05:07,936][52059] Updated weights for policy 1, policy_version 5692 (0.0008) [2023-10-08 00:05:11,210][50642] Fps is (10 sec: 13107.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 11599872. Throughput: 0: 1719.2, 1: 1727.9. Samples: 2911834. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 00:05:11,211][50642] Avg episode reward: [(0, '9.000'), (1, '11.140')] [2023-10-08 00:05:11,558][52060] Updated weights for policy 0, policy_version 5640 (0.0009) [2023-10-08 00:05:11,932][52060] Updated weights for policy 0, policy_version 5650 (0.0007) [2023-10-08 00:05:11,956][52059] Updated weights for policy 1, policy_version 5702 (0.0010) [2023-10-08 00:05:12,289][52060] Updated weights for policy 0, policy_version 5660 (0.0007) [2023-10-08 00:05:12,318][52059] Updated weights for policy 1, policy_version 5712 (0.0008) [2023-10-08 00:05:12,690][52059] Updated weights for policy 1, policy_version 5722 (0.0009) [2023-10-08 00:05:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 11665408. Throughput: 0: 1711.5, 1: 1747.5. Samples: 2932978. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 00:05:16,211][50642] Avg episode reward: [(0, '9.910'), (1, '11.330')] [2023-10-08 00:05:16,380][52060] Updated weights for policy 0, policy_version 5670 (0.0010) [2023-10-08 00:05:16,585][52059] Updated weights for policy 1, policy_version 5732 (0.0009) [2023-10-08 00:05:16,756][52060] Updated weights for policy 0, policy_version 5680 (0.0010) [2023-10-08 00:05:16,955][52059] Updated weights for policy 1, policy_version 5742 (0.0009) [2023-10-08 00:05:17,113][52060] Updated weights for policy 0, policy_version 5690 (0.0009) [2023-10-08 00:05:17,326][52059] Updated weights for policy 1, policy_version 5752 (0.0009) [2023-10-08 00:05:17,338][51605] Saving new best policy, reward=9.910! [2023-10-08 00:05:21,100][52060] Updated weights for policy 0, policy_version 5700 (0.0008) [2023-10-08 00:05:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 11730944. Throughput: 0: 1705.9, 1: 1711.9. Samples: 2942188. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) [2023-10-08 00:05:21,211][50642] Avg episode reward: [(0, '9.450'), (1, '11.100')] [2023-10-08 00:05:21,313][52059] Updated weights for policy 1, policy_version 5762 (0.0008) [2023-10-08 00:05:21,467][52060] Updated weights for policy 0, policy_version 5710 (0.0008) [2023-10-08 00:05:21,684][52059] Updated weights for policy 1, policy_version 5772 (0.0007) [2023-10-08 00:05:21,845][52060] Updated weights for policy 0, policy_version 5720 (0.0008) [2023-10-08 00:05:22,042][52059] Updated weights for policy 1, policy_version 5782 (0.0007) [2023-10-08 00:05:22,407][52059] Updated weights for policy 1, policy_version 5792 (0.0007) [2023-10-08 00:05:25,898][52060] Updated weights for policy 0, policy_version 5730 (0.0007) [2023-10-08 00:05:26,132][52059] Updated weights for policy 1, policy_version 5802 (0.0007) [2023-10-08 00:05:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 11796480. Throughput: 0: 1709.9, 1: 1743.5. Samples: 2963502. Policy #0 lag: (min: 31.0, avg: 31.6, max: 48.0) [2023-10-08 00:05:26,211][50642] Avg episode reward: [(0, '9.500'), (1, '11.340')] [2023-10-08 00:05:26,277][52060] Updated weights for policy 0, policy_version 5740 (0.0009) [2023-10-08 00:05:26,499][52059] Updated weights for policy 1, policy_version 5812 (0.0007) [2023-10-08 00:05:26,644][52060] Updated weights for policy 0, policy_version 5750 (0.0008) [2023-10-08 00:05:26,861][52059] Updated weights for policy 1, policy_version 5822 (0.0008) [2023-10-08 00:05:27,009][52060] Updated weights for policy 0, policy_version 5760 (0.0007) [2023-10-08 00:05:30,763][52059] Updated weights for policy 1, policy_version 5832 (0.0007) [2023-10-08 00:05:31,114][52060] Updated weights for policy 0, policy_version 5770 (0.0008) [2023-10-08 00:05:31,124][52059] Updated weights for policy 1, policy_version 5842 (0.0007) [2023-10-08 00:05:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 11862016. Throughput: 0: 1698.4, 1: 1737.0. Samples: 2983956. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:05:31,211][50642] Avg episode reward: [(0, '9.200'), (1, '11.520')] [2023-10-08 00:05:31,487][52059] Updated weights for policy 1, policy_version 5852 (0.0007) [2023-10-08 00:05:31,491][52060] Updated weights for policy 0, policy_version 5780 (0.0008) [2023-10-08 00:05:31,855][52060] Updated weights for policy 0, policy_version 5790 (0.0009) [2023-10-08 00:05:35,488][52059] Updated weights for policy 1, policy_version 5862 (0.0008) [2023-10-08 00:05:35,799][52060] Updated weights for policy 0, policy_version 5800 (0.0008) [2023-10-08 00:05:35,854][52059] Updated weights for policy 1, policy_version 5872 (0.0007) [2023-10-08 00:05:36,171][52060] Updated weights for policy 0, policy_version 5810 (0.0007) [2023-10-08 00:05:36,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 11927552. Throughput: 0: 1704.8, 1: 1732.7. Samples: 2993870. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:05:36,211][50642] Avg episode reward: [(0, '9.200'), (1, '10.880')] [2023-10-08 00:05:36,212][52059] Updated weights for policy 1, policy_version 5882 (0.0010) [2023-10-08 00:05:36,535][52060] Updated weights for policy 0, policy_version 5820 (0.0008) [2023-10-08 00:05:40,263][52059] Updated weights for policy 1, policy_version 5892 (0.0009) [2023-10-08 00:05:40,633][52059] Updated weights for policy 1, policy_version 5902 (0.0010) [2023-10-08 00:05:40,695][52060] Updated weights for policy 0, policy_version 5830 (0.0008) [2023-10-08 00:05:40,989][52059] Updated weights for policy 1, policy_version 5912 (0.0007) [2023-10-08 00:05:41,063][52060] Updated weights for policy 0, policy_version 5840 (0.0008) [2023-10-08 00:05:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 11993088. Throughput: 0: 1705.3, 1: 1743.8. Samples: 3014974. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:05:41,211][50642] Avg episode reward: [(0, '9.360'), (1, '12.190')] [2023-10-08 00:05:41,278][51710] Saving new best policy, reward=12.190! [2023-10-08 00:05:41,439][52060] Updated weights for policy 0, policy_version 5850 (0.0008) [2023-10-08 00:05:44,888][52059] Updated weights for policy 1, policy_version 5922 (0.0008) [2023-10-08 00:05:45,254][52059] Updated weights for policy 1, policy_version 5932 (0.0008) [2023-10-08 00:05:45,346][52060] Updated weights for policy 0, policy_version 5860 (0.0009) [2023-10-08 00:05:45,639][52059] Updated weights for policy 1, policy_version 5942 (0.0009) [2023-10-08 00:05:45,714][52060] Updated weights for policy 0, policy_version 5870 (0.0008) [2023-10-08 00:05:45,998][52059] Updated weights for policy 1, policy_version 5952 (0.0007) [2023-10-08 00:05:46,083][52060] Updated weights for policy 0, policy_version 5880 (0.0008) [2023-10-08 00:05:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 12091392. Throughput: 0: 1689.5, 1: 1719.3. Samples: 3034472. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:05:46,211][50642] Avg episode reward: [(0, '9.690'), (1, '11.160')] [2023-10-08 00:05:49,944][52059] Updated weights for policy 1, policy_version 5962 (0.0009) [2023-10-08 00:05:50,143][52060] Updated weights for policy 0, policy_version 5890 (0.0008) [2023-10-08 00:05:50,298][52059] Updated weights for policy 1, policy_version 5972 (0.0009) [2023-10-08 00:05:50,518][52060] Updated weights for policy 0, policy_version 5900 (0.0010) [2023-10-08 00:05:50,666][52059] Updated weights for policy 1, policy_version 5982 (0.0008) [2023-10-08 00:05:50,891][52060] Updated weights for policy 0, policy_version 5910 (0.0009) [2023-10-08 00:05:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 12156928. Throughput: 0: 1700.2, 1: 1747.5. Samples: 3045608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:05:51,211][50642] Avg episode reward: [(0, '9.010'), (1, '12.130')] [2023-10-08 00:05:51,259][52060] Updated weights for policy 0, policy_version 5920 (0.0010) [2023-10-08 00:05:54,731][52059] Updated weights for policy 1, policy_version 5992 (0.0008) [2023-10-08 00:05:55,091][52059] Updated weights for policy 1, policy_version 6002 (0.0008) [2023-10-08 00:05:55,232][52060] Updated weights for policy 0, policy_version 5930 (0.0007) [2023-10-08 00:05:55,454][52059] Updated weights for policy 1, policy_version 6012 (0.0009) [2023-10-08 00:05:55,612][52060] Updated weights for policy 0, policy_version 5940 (0.0008) [2023-10-08 00:05:55,975][52060] Updated weights for policy 0, policy_version 5950 (0.0007) [2023-10-08 00:05:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 12255232. Throughput: 0: 1706.6, 1: 1731.2. Samples: 3066534. Policy #0 lag: (min: 17.0, avg: 20.2, max: 49.0) [2023-10-08 00:05:56,211][50642] Avg episode reward: [(0, '9.700'), (1, '11.820')] [2023-10-08 00:05:59,436][52059] Updated weights for policy 1, policy_version 6022 (0.0007) [2023-10-08 00:05:59,800][52059] Updated weights for policy 1, policy_version 6032 (0.0008) [2023-10-08 00:05:59,963][52060] Updated weights for policy 0, policy_version 5960 (0.0008) [2023-10-08 00:06:00,173][52059] Updated weights for policy 1, policy_version 6042 (0.0010) [2023-10-08 00:06:00,329][52060] Updated weights for policy 0, policy_version 5970 (0.0008) [2023-10-08 00:06:00,703][52060] Updated weights for policy 0, policy_version 5980 (0.0008) [2023-10-08 00:06:01,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 12320768. Throughput: 0: 1684.0, 1: 1709.3. Samples: 3085676. Policy #0 lag: (min: 17.0, avg: 20.2, max: 49.0) [2023-10-08 00:06:01,211][50642] Avg episode reward: [(0, '9.910'), (1, '11.950')] [2023-10-08 00:06:04,050][52059] Updated weights for policy 1, policy_version 6052 (0.0010) [2023-10-08 00:06:04,364][52060] Updated weights for policy 0, policy_version 5990 (0.0007) [2023-10-08 00:06:04,414][52059] Updated weights for policy 1, policy_version 6062 (0.0008) [2023-10-08 00:06:04,736][52060] Updated weights for policy 0, policy_version 6000 (0.0008) [2023-10-08 00:06:04,792][52059] Updated weights for policy 1, policy_version 6072 (0.0009) [2023-10-08 00:06:05,103][52060] Updated weights for policy 0, policy_version 6010 (0.0007) [2023-10-08 00:06:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 12386304. Throughput: 0: 1715.8, 1: 1740.2. Samples: 3097708. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:06:06,211][50642] Avg episode reward: [(0, '9.760'), (1, '11.770')] [2023-10-08 00:06:08,779][52059] Updated weights for policy 1, policy_version 6082 (0.0009) [2023-10-08 00:06:09,198][52059] Updated weights for policy 1, policy_version 6092 (0.0008) [2023-10-08 00:06:09,299][52060] Updated weights for policy 0, policy_version 6020 (0.0008) [2023-10-08 00:06:09,555][52059] Updated weights for policy 1, policy_version 6102 (0.0008) [2023-10-08 00:06:09,667][52060] Updated weights for policy 0, policy_version 6030 (0.0009) [2023-10-08 00:06:09,923][52059] Updated weights for policy 1, policy_version 6112 (0.0008) [2023-10-08 00:06:10,038][52060] Updated weights for policy 0, policy_version 6040 (0.0009) [2023-10-08 00:06:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 12451840. Throughput: 0: 1700.0, 1: 1707.9. Samples: 3116856. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:06:11,211][50642] Avg episode reward: [(0, '10.100'), (1, '11.440')] [2023-10-08 00:06:11,211][51605] Saving new best policy, reward=10.100! [2023-10-08 00:06:13,966][52059] Updated weights for policy 1, policy_version 6122 (0.0008) [2023-10-08 00:06:14,092][52060] Updated weights for policy 0, policy_version 6050 (0.0009) [2023-10-08 00:06:14,324][52059] Updated weights for policy 1, policy_version 6132 (0.0008) [2023-10-08 00:06:14,496][52060] Updated weights for policy 0, policy_version 6060 (0.0008) [2023-10-08 00:06:14,704][52059] Updated weights for policy 1, policy_version 6142 (0.0009) [2023-10-08 00:06:14,862][52060] Updated weights for policy 0, policy_version 6070 (0.0007) [2023-10-08 00:06:15,239][52060] Updated weights for policy 0, policy_version 6080 (0.0007) [2023-10-08 00:06:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 12517376. Throughput: 0: 1695.3, 1: 1711.5. Samples: 3137264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:06:16,211][50642] Avg episode reward: [(0, '9.280'), (1, '11.570')] [2023-10-08 00:06:18,559][52059] Updated weights for policy 1, policy_version 6152 (0.0008) [2023-10-08 00:06:18,912][52059] Updated weights for policy 1, policy_version 6162 (0.0008) [2023-10-08 00:06:19,251][52060] Updated weights for policy 0, policy_version 6090 (0.0007) [2023-10-08 00:06:19,275][52059] Updated weights for policy 1, policy_version 6172 (0.0008) [2023-10-08 00:06:19,621][52060] Updated weights for policy 0, policy_version 6100 (0.0007) [2023-10-08 00:06:19,997][52060] Updated weights for policy 0, policy_version 6110 (0.0008) [2023-10-08 00:06:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 12582912. Throughput: 0: 1716.8, 1: 1718.8. Samples: 3148470. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:06:21,211][50642] Avg episode reward: [(0, '9.960'), (1, '11.590')] [2023-10-08 00:06:23,209][52059] Updated weights for policy 1, policy_version 6182 (0.0009) [2023-10-08 00:06:23,577][52059] Updated weights for policy 1, policy_version 6192 (0.0009) [2023-10-08 00:06:23,949][52059] Updated weights for policy 1, policy_version 6202 (0.0008) [2023-10-08 00:06:23,985][52060] Updated weights for policy 0, policy_version 6120 (0.0010) [2023-10-08 00:06:24,354][52060] Updated weights for policy 0, policy_version 6130 (0.0008) [2023-10-08 00:06:24,726][52060] Updated weights for policy 0, policy_version 6140 (0.0009) [2023-10-08 00:06:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 12648448. Throughput: 0: 1689.5, 1: 1710.2. Samples: 3167960. Policy #0 lag: (min: 10.0, avg: 14.4, max: 37.0) [2023-10-08 00:06:26,211][50642] Avg episode reward: [(0, '9.850'), (1, '12.280')] [2023-10-08 00:06:26,212][51710] Saving new best policy, reward=12.280! [2023-10-08 00:06:27,834][52059] Updated weights for policy 1, policy_version 6212 (0.0009) [2023-10-08 00:06:28,208][52059] Updated weights for policy 1, policy_version 6222 (0.0011) [2023-10-08 00:06:28,573][52059] Updated weights for policy 1, policy_version 6232 (0.0007) [2023-10-08 00:06:28,790][52060] Updated weights for policy 0, policy_version 6150 (0.0009) [2023-10-08 00:06:29,161][52060] Updated weights for policy 0, policy_version 6160 (0.0010) [2023-10-08 00:06:29,537][52060] Updated weights for policy 0, policy_version 6170 (0.0011) [2023-10-08 00:06:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 12713984. Throughput: 0: 1700.6, 1: 1735.5. Samples: 3189096. Policy #0 lag: (min: 10.0, avg: 14.4, max: 37.0) [2023-10-08 00:06:31,211][50642] Avg episode reward: [(0, '9.350'), (1, '12.090')] [2023-10-08 00:06:32,459][52059] Updated weights for policy 1, policy_version 6242 (0.0009) [2023-10-08 00:06:32,825][52059] Updated weights for policy 1, policy_version 6252 (0.0007) [2023-10-08 00:06:33,191][52059] Updated weights for policy 1, policy_version 6262 (0.0007) [2023-10-08 00:06:33,431][52060] Updated weights for policy 0, policy_version 6180 (0.0009) [2023-10-08 00:06:33,552][52059] Updated weights for policy 1, policy_version 6272 (0.0007) [2023-10-08 00:06:33,805][52060] Updated weights for policy 0, policy_version 6190 (0.0009) [2023-10-08 00:06:34,178][52060] Updated weights for policy 0, policy_version 6200 (0.0007) [2023-10-08 00:06:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 12779520. Throughput: 0: 1707.8, 1: 1707.9. Samples: 3199312. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 00:06:36,211][50642] Avg episode reward: [(0, '9.550'), (1, '11.990')] [2023-10-08 00:06:37,264][52059] Updated weights for policy 1, policy_version 6282 (0.0007) [2023-10-08 00:06:37,628][52059] Updated weights for policy 1, policy_version 6292 (0.0008) [2023-10-08 00:06:37,997][52059] Updated weights for policy 1, policy_version 6302 (0.0008) [2023-10-08 00:06:38,127][52060] Updated weights for policy 0, policy_version 6210 (0.0009) [2023-10-08 00:06:38,502][52060] Updated weights for policy 0, policy_version 6220 (0.0008) [2023-10-08 00:06:38,871][52060] Updated weights for policy 0, policy_version 6230 (0.0009) [2023-10-08 00:06:39,249][52060] Updated weights for policy 0, policy_version 6240 (0.0007) [2023-10-08 00:06:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13662.6). Total num frames: 12845056. Throughput: 0: 1686.1, 1: 1724.8. Samples: 3220026. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 00:06:41,211][50642] Avg episode reward: [(0, '9.610'), (1, '11.700')] [2023-10-08 00:06:42,006][52059] Updated weights for policy 1, policy_version 6312 (0.0009) [2023-10-08 00:06:42,368][52059] Updated weights for policy 1, policy_version 6322 (0.0010) [2023-10-08 00:06:42,739][52059] Updated weights for policy 1, policy_version 6332 (0.0008) [2023-10-08 00:06:43,126][52060] Updated weights for policy 0, policy_version 6250 (0.0008) [2023-10-08 00:06:43,499][52060] Updated weights for policy 0, policy_version 6260 (0.0008) [2023-10-08 00:06:43,875][52060] Updated weights for policy 0, policy_version 6270 (0.0010) [2023-10-08 00:06:46,211][50642] Fps is (10 sec: 13106.3, 60 sec: 13653.2, 300 sec: 13662.6). Total num frames: 12910592. Throughput: 0: 1712.7, 1: 1743.1. Samples: 3241190. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:06:46,212][50642] Avg episode reward: [(0, '9.260'), (1, '11.560')] [2023-10-08 00:06:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000006336_6488064.pth... [2023-10-08 00:06:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000006272_6422528.pth... [2023-10-08 00:06:46,275][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000004672_4784128.pth [2023-10-08 00:06:46,275][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000004736_4849664.pth [2023-10-08 00:06:46,749][52059] Updated weights for policy 1, policy_version 6342 (0.0007) [2023-10-08 00:06:47,118][52059] Updated weights for policy 1, policy_version 6352 (0.0007) [2023-10-08 00:06:47,485][52059] Updated weights for policy 1, policy_version 6362 (0.0007) [2023-10-08 00:06:47,858][52060] Updated weights for policy 0, policy_version 6280 (0.0008) [2023-10-08 00:06:48,223][52060] Updated weights for policy 0, policy_version 6290 (0.0008) [2023-10-08 00:06:48,588][52060] Updated weights for policy 0, policy_version 6300 (0.0009) [2023-10-08 00:06:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 12976128. Throughput: 0: 1682.1, 1: 1716.3. Samples: 3250638. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:06:51,211][50642] Avg episode reward: [(0, '10.910'), (1, '11.410')] [2023-10-08 00:06:51,211][51605] Saving new best policy, reward=10.910! [2023-10-08 00:06:51,351][52059] Updated weights for policy 1, policy_version 6372 (0.0008) [2023-10-08 00:06:51,711][52059] Updated weights for policy 1, policy_version 6382 (0.0007) [2023-10-08 00:06:52,086][52059] Updated weights for policy 1, policy_version 6392 (0.0009) [2023-10-08 00:06:52,589][52060] Updated weights for policy 0, policy_version 6310 (0.0007) [2023-10-08 00:06:52,956][52060] Updated weights for policy 0, policy_version 6320 (0.0007) [2023-10-08 00:06:53,328][52060] Updated weights for policy 0, policy_version 6330 (0.0007) [2023-10-08 00:06:56,006][52059] Updated weights for policy 1, policy_version 6402 (0.0008) [2023-10-08 00:06:56,210][50642] Fps is (10 sec: 13107.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 13041664. Throughput: 0: 1699.5, 1: 1752.5. Samples: 3272194. Policy #0 lag: (min: 31.0, avg: 32.6, max: 58.0) [2023-10-08 00:06:56,211][50642] Avg episode reward: [(0, '9.190'), (1, '12.180')] [2023-10-08 00:06:56,398][52059] Updated weights for policy 1, policy_version 6412 (0.0007) [2023-10-08 00:06:56,765][52059] Updated weights for policy 1, policy_version 6422 (0.0007) [2023-10-08 00:06:57,126][52059] Updated weights for policy 1, policy_version 6432 (0.0008) [2023-10-08 00:06:57,240][52060] Updated weights for policy 0, policy_version 6340 (0.0007) [2023-10-08 00:06:57,608][52060] Updated weights for policy 0, policy_version 6350 (0.0007) [2023-10-08 00:06:57,987][52060] Updated weights for policy 0, policy_version 6360 (0.0009) [2023-10-08 00:07:01,018][52059] Updated weights for policy 1, policy_version 6442 (0.0010) [2023-10-08 00:07:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 13107200. Throughput: 0: 1714.2, 1: 1748.8. Samples: 3293096. Policy #0 lag: (min: 31.0, avg: 32.6, max: 58.0) [2023-10-08 00:07:01,211][50642] Avg episode reward: [(0, '10.510'), (1, '11.810')] [2023-10-08 00:07:01,378][52059] Updated weights for policy 1, policy_version 6452 (0.0009) [2023-10-08 00:07:01,747][52059] Updated weights for policy 1, policy_version 6462 (0.0009) [2023-10-08 00:07:02,049][52060] Updated weights for policy 0, policy_version 6370 (0.0008) [2023-10-08 00:07:02,455][52060] Updated weights for policy 0, policy_version 6380 (0.0009) [2023-10-08 00:07:02,835][52060] Updated weights for policy 0, policy_version 6390 (0.0010) [2023-10-08 00:07:03,211][52060] Updated weights for policy 0, policy_version 6400 (0.0008) [2023-10-08 00:07:05,702][52059] Updated weights for policy 1, policy_version 6472 (0.0009) [2023-10-08 00:07:06,073][52059] Updated weights for policy 1, policy_version 6482 (0.0007) [2023-10-08 00:07:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 13172736. Throughput: 0: 1688.6, 1: 1737.8. Samples: 3302656. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:07:06,211][50642] Avg episode reward: [(0, '9.740'), (1, '12.080')] [2023-10-08 00:07:06,443][52059] Updated weights for policy 1, policy_version 6492 (0.0007) [2023-10-08 00:07:07,153][52060] Updated weights for policy 0, policy_version 6410 (0.0010) [2023-10-08 00:07:07,528][52060] Updated weights for policy 0, policy_version 6420 (0.0008) [2023-10-08 00:07:07,892][52060] Updated weights for policy 0, policy_version 6430 (0.0007) [2023-10-08 00:07:10,275][52059] Updated weights for policy 1, policy_version 6502 (0.0007) [2023-10-08 00:07:10,639][52059] Updated weights for policy 1, policy_version 6512 (0.0008) [2023-10-08 00:07:11,006][52059] Updated weights for policy 1, policy_version 6522 (0.0008) [2023-10-08 00:07:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 13238272. Throughput: 0: 1717.5, 1: 1754.7. Samples: 3324210. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:07:11,211][50642] Avg episode reward: [(0, '9.980'), (1, '12.380')] [2023-10-08 00:07:11,216][51710] Saving new best policy, reward=12.380! [2023-10-08 00:07:11,954][52060] Updated weights for policy 0, policy_version 6440 (0.0009) [2023-10-08 00:07:12,314][52060] Updated weights for policy 0, policy_version 6450 (0.0009) [2023-10-08 00:07:12,691][52060] Updated weights for policy 0, policy_version 6460 (0.0010) [2023-10-08 00:07:14,916][52059] Updated weights for policy 1, policy_version 6532 (0.0008) [2023-10-08 00:07:15,287][52059] Updated weights for policy 1, policy_version 6542 (0.0008) [2023-10-08 00:07:15,644][52059] Updated weights for policy 1, policy_version 6552 (0.0008) [2023-10-08 00:07:16,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 13336576. Throughput: 0: 1722.0, 1: 1728.5. Samples: 3344368. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:07:16,211][50642] Avg episode reward: [(0, '10.350'), (1, '12.420')] [2023-10-08 00:07:16,217][51710] Saving new best policy, reward=12.420! [2023-10-08 00:07:16,665][52060] Updated weights for policy 0, policy_version 6470 (0.0009) [2023-10-08 00:07:17,051][52060] Updated weights for policy 0, policy_version 6480 (0.0011) [2023-10-08 00:07:17,405][52060] Updated weights for policy 0, policy_version 6490 (0.0011) [2023-10-08 00:07:19,567][52059] Updated weights for policy 1, policy_version 6562 (0.0010) [2023-10-08 00:07:19,929][52059] Updated weights for policy 1, policy_version 6572 (0.0010) [2023-10-08 00:07:20,303][52059] Updated weights for policy 1, policy_version 6582 (0.0008) [2023-10-08 00:07:20,660][52059] Updated weights for policy 1, policy_version 6592 (0.0009) [2023-10-08 00:07:21,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 13402112. Throughput: 0: 1702.1, 1: 1757.0. Samples: 3354974. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:07:21,211][50642] Avg episode reward: [(0, '9.690'), (1, '12.450')] [2023-10-08 00:07:21,213][51710] Saving new best policy, reward=12.450! [2023-10-08 00:07:21,221][52060] Updated weights for policy 0, policy_version 6500 (0.0010) [2023-10-08 00:07:21,586][52060] Updated weights for policy 0, policy_version 6510 (0.0010) [2023-10-08 00:07:21,963][52060] Updated weights for policy 0, policy_version 6520 (0.0009) [2023-10-08 00:07:24,634][52059] Updated weights for policy 1, policy_version 6602 (0.0008) [2023-10-08 00:07:24,990][52059] Updated weights for policy 1, policy_version 6612 (0.0008) [2023-10-08 00:07:25,352][52059] Updated weights for policy 1, policy_version 6622 (0.0009) [2023-10-08 00:07:25,838][52060] Updated weights for policy 0, policy_version 6530 (0.0009) [2023-10-08 00:07:26,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 13467648. Throughput: 0: 1720.0, 1: 1737.5. Samples: 3375616. Policy #0 lag: (min: 15.0, avg: 31.4, max: 32.0) [2023-10-08 00:07:26,211][50642] Avg episode reward: [(0, '10.470'), (1, '11.260')] [2023-10-08 00:07:26,217][52060] Updated weights for policy 0, policy_version 6540 (0.0008) [2023-10-08 00:07:26,593][52060] Updated weights for policy 0, policy_version 6550 (0.0007) [2023-10-08 00:07:26,975][52060] Updated weights for policy 0, policy_version 6560 (0.0008) [2023-10-08 00:07:29,371][52059] Updated weights for policy 1, policy_version 6632 (0.0007) [2023-10-08 00:07:29,732][52059] Updated weights for policy 1, policy_version 6642 (0.0007) [2023-10-08 00:07:30,095][52059] Updated weights for policy 1, policy_version 6652 (0.0007) [2023-10-08 00:07:30,846][52060] Updated weights for policy 0, policy_version 6570 (0.0010) [2023-10-08 00:07:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 13533184. Throughput: 0: 1714.5, 1: 1723.8. Samples: 3395910. Policy #0 lag: (min: 15.0, avg: 31.4, max: 32.0) [2023-10-08 00:07:31,211][50642] Avg episode reward: [(0, '10.330'), (1, '12.900')] [2023-10-08 00:07:31,216][52060] Updated weights for policy 0, policy_version 6580 (0.0011) [2023-10-08 00:07:31,217][51710] Saving new best policy, reward=12.900! [2023-10-08 00:07:31,583][52060] Updated weights for policy 0, policy_version 6590 (0.0007) [2023-10-08 00:07:33,812][52059] Updated weights for policy 1, policy_version 6662 (0.0009) [2023-10-08 00:07:34,179][52059] Updated weights for policy 1, policy_version 6672 (0.0007) [2023-10-08 00:07:34,553][52059] Updated weights for policy 1, policy_version 6682 (0.0008) [2023-10-08 00:07:35,633][52060] Updated weights for policy 0, policy_version 6600 (0.0008) [2023-10-08 00:07:36,006][52060] Updated weights for policy 0, policy_version 6610 (0.0007) [2023-10-08 00:07:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.2, 300 sec: 13773.7). Total num frames: 13598720. Throughput: 0: 1720.2, 1: 1752.2. Samples: 3406894. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) [2023-10-08 00:07:36,212][50642] Avg episode reward: [(0, '10.310'), (1, '12.010')] [2023-10-08 00:07:36,383][52060] Updated weights for policy 0, policy_version 6620 (0.0008) [2023-10-08 00:07:38,537][52059] Updated weights for policy 1, policy_version 6692 (0.0008) [2023-10-08 00:07:38,898][52059] Updated weights for policy 1, policy_version 6702 (0.0009) [2023-10-08 00:07:39,272][52059] Updated weights for policy 1, policy_version 6712 (0.0008) [2023-10-08 00:07:40,541][52060] Updated weights for policy 0, policy_version 6630 (0.0008) [2023-10-08 00:07:40,905][52060] Updated weights for policy 0, policy_version 6640 (0.0008) [2023-10-08 00:07:41,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.8). Total num frames: 13664256. Throughput: 0: 1723.3, 1: 1720.8. Samples: 3427180. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) [2023-10-08 00:07:41,211][50642] Avg episode reward: [(0, '10.530'), (1, '12.540')] [2023-10-08 00:07:41,286][52060] Updated weights for policy 0, policy_version 6650 (0.0007) [2023-10-08 00:07:43,255][52059] Updated weights for policy 1, policy_version 6722 (0.0008) [2023-10-08 00:07:43,649][52059] Updated weights for policy 1, policy_version 6732 (0.0009) [2023-10-08 00:07:44,015][52059] Updated weights for policy 1, policy_version 6742 (0.0008) [2023-10-08 00:07:44,375][52059] Updated weights for policy 1, policy_version 6752 (0.0008) [2023-10-08 00:07:45,294][52060] Updated weights for policy 0, policy_version 6660 (0.0008) [2023-10-08 00:07:45,659][52060] Updated weights for policy 0, policy_version 6670 (0.0011) [2023-10-08 00:07:46,020][52060] Updated weights for policy 0, policy_version 6680 (0.0009) [2023-10-08 00:07:46,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 13729792. Throughput: 0: 1709.4, 1: 1727.3. Samples: 3447750. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:07:46,211][50642] Avg episode reward: [(0, '10.270'), (1, '11.520')] [2023-10-08 00:07:48,321][52059] Updated weights for policy 1, policy_version 6762 (0.0009) [2023-10-08 00:07:48,685][52059] Updated weights for policy 1, policy_version 6772 (0.0007) [2023-10-08 00:07:49,058][52059] Updated weights for policy 1, policy_version 6782 (0.0009) [2023-10-08 00:07:50,051][52060] Updated weights for policy 0, policy_version 6690 (0.0011) [2023-10-08 00:07:50,438][52060] Updated weights for policy 0, policy_version 6700 (0.0007) [2023-10-08 00:07:50,807][52060] Updated weights for policy 0, policy_version 6710 (0.0009) [2023-10-08 00:07:51,178][52060] Updated weights for policy 0, policy_version 6720 (0.0009) [2023-10-08 00:07:51,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 13828096. Throughput: 0: 1724.8, 1: 1729.2. Samples: 3458086. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) [2023-10-08 00:07:51,211][50642] Avg episode reward: [(0, '10.730'), (1, '12.700')] [2023-10-08 00:07:53,023][52059] Updated weights for policy 1, policy_version 6792 (0.0011) [2023-10-08 00:07:53,386][52059] Updated weights for policy 1, policy_version 6802 (0.0010) [2023-10-08 00:07:53,748][52059] Updated weights for policy 1, policy_version 6812 (0.0008) [2023-10-08 00:07:55,123][52060] Updated weights for policy 0, policy_version 6730 (0.0008) [2023-10-08 00:07:55,490][52060] Updated weights for policy 0, policy_version 6740 (0.0009) [2023-10-08 00:07:55,855][52060] Updated weights for policy 0, policy_version 6750 (0.0009) [2023-10-08 00:07:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 13893632. Throughput: 0: 1717.3, 1: 1716.5. Samples: 3478730. Policy #0 lag: (min: 31.0, avg: 31.0, max: 33.0) [2023-10-08 00:07:56,211][50642] Avg episode reward: [(0, '10.440'), (1, '12.420')] [2023-10-08 00:07:57,681][52059] Updated weights for policy 1, policy_version 6822 (0.0007) [2023-10-08 00:07:58,050][52059] Updated weights for policy 1, policy_version 6832 (0.0009) [2023-10-08 00:07:58,413][52059] Updated weights for policy 1, policy_version 6842 (0.0007) [2023-10-08 00:07:59,900][52060] Updated weights for policy 0, policy_version 6760 (0.0007) [2023-10-08 00:08:00,270][52060] Updated weights for policy 0, policy_version 6770 (0.0009) [2023-10-08 00:08:00,641][52060] Updated weights for policy 0, policy_version 6780 (0.0008) [2023-10-08 00:08:01,210][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 13959168. Throughput: 0: 1691.1, 1: 1745.4. Samples: 3499012. Policy #0 lag: (min: 31.0, avg: 32.4, max: 56.0) [2023-10-08 00:08:01,211][50642] Avg episode reward: [(0, '10.210'), (1, '13.000')] [2023-10-08 00:08:01,222][51710] Saving new best policy, reward=13.000! [2023-10-08 00:08:02,452][52059] Updated weights for policy 1, policy_version 6852 (0.0007) [2023-10-08 00:08:02,814][52059] Updated weights for policy 1, policy_version 6862 (0.0007) [2023-10-08 00:08:03,186][52059] Updated weights for policy 1, policy_version 6872 (0.0008) [2023-10-08 00:08:04,719][52060] Updated weights for policy 0, policy_version 6790 (0.0008) [2023-10-08 00:08:05,084][52060] Updated weights for policy 0, policy_version 6800 (0.0009) [2023-10-08 00:08:05,450][52060] Updated weights for policy 0, policy_version 6810 (0.0008) [2023-10-08 00:08:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 14024704. Throughput: 0: 1721.0, 1: 1714.0. Samples: 3509548. Policy #0 lag: (min: 31.0, avg: 32.4, max: 56.0) [2023-10-08 00:08:06,211][50642] Avg episode reward: [(0, '10.500'), (1, '11.850')] [2023-10-08 00:08:06,911][52059] Updated weights for policy 1, policy_version 6882 (0.0009) [2023-10-08 00:08:07,284][52059] Updated weights for policy 1, policy_version 6892 (0.0007) [2023-10-08 00:08:07,651][52059] Updated weights for policy 1, policy_version 6902 (0.0008) [2023-10-08 00:08:08,020][52059] Updated weights for policy 1, policy_version 6912 (0.0008) [2023-10-08 00:08:09,395][52060] Updated weights for policy 0, policy_version 6820 (0.0011) [2023-10-08 00:08:09,770][52060] Updated weights for policy 0, policy_version 6830 (0.0011) [2023-10-08 00:08:10,138][52060] Updated weights for policy 0, policy_version 6840 (0.0010) [2023-10-08 00:08:11,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 14090240. Throughput: 0: 1705.9, 1: 1735.4. Samples: 3530474. Policy #0 lag: (min: 14.0, avg: 24.3, max: 46.0) [2023-10-08 00:08:11,211][50642] Avg episode reward: [(0, '9.350'), (1, '13.160')] [2023-10-08 00:08:11,211][51710] Saving new best policy, reward=13.160! [2023-10-08 00:08:11,827][52059] Updated weights for policy 1, policy_version 6922 (0.0008) [2023-10-08 00:08:12,197][52059] Updated weights for policy 1, policy_version 6932 (0.0010) [2023-10-08 00:08:12,557][52059] Updated weights for policy 1, policy_version 6942 (0.0009) [2023-10-08 00:08:13,911][52060] Updated weights for policy 0, policy_version 6850 (0.0010) [2023-10-08 00:08:14,286][52060] Updated weights for policy 0, policy_version 6860 (0.0010) [2023-10-08 00:08:14,644][52060] Updated weights for policy 0, policy_version 6870 (0.0007) [2023-10-08 00:08:15,009][52060] Updated weights for policy 0, policy_version 6880 (0.0009) [2023-10-08 00:08:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 14155776. Throughput: 0: 1698.6, 1: 1751.3. Samples: 3551158. Policy #0 lag: (min: 14.0, avg: 24.3, max: 46.0) [2023-10-08 00:08:16,211][50642] Avg episode reward: [(0, '10.670'), (1, '11.140')] [2023-10-08 00:08:16,586][52059] Updated weights for policy 1, policy_version 6952 (0.0009) [2023-10-08 00:08:16,946][52059] Updated weights for policy 1, policy_version 6962 (0.0009) [2023-10-08 00:08:17,314][52059] Updated weights for policy 1, policy_version 6972 (0.0008) [2023-10-08 00:08:18,968][52060] Updated weights for policy 0, policy_version 6890 (0.0009) [2023-10-08 00:08:19,346][52060] Updated weights for policy 0, policy_version 6900 (0.0007) [2023-10-08 00:08:19,721][52060] Updated weights for policy 0, policy_version 6910 (0.0007) [2023-10-08 00:08:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 14221312. Throughput: 0: 1719.1, 1: 1722.0. Samples: 3561742. Policy #0 lag: (min: 24.0, avg: 47.1, max: 56.0) [2023-10-08 00:08:21,211][50642] Avg episode reward: [(0, '10.340'), (1, '12.310')] [2023-10-08 00:08:21,331][52059] Updated weights for policy 1, policy_version 6982 (0.0007) [2023-10-08 00:08:21,692][52059] Updated weights for policy 1, policy_version 6992 (0.0007) [2023-10-08 00:08:22,061][52059] Updated weights for policy 1, policy_version 7002 (0.0008) [2023-10-08 00:08:23,532][52060] Updated weights for policy 0, policy_version 6920 (0.0010) [2023-10-08 00:08:23,904][52060] Updated weights for policy 0, policy_version 6930 (0.0009) [2023-10-08 00:08:24,276][52060] Updated weights for policy 0, policy_version 6940 (0.0011) [2023-10-08 00:08:26,045][52059] Updated weights for policy 1, policy_version 7012 (0.0007) [2023-10-08 00:08:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 14286848. Throughput: 0: 1691.9, 1: 1743.8. Samples: 3581786. Policy #0 lag: (min: 24.0, avg: 47.1, max: 56.0) [2023-10-08 00:08:26,211][50642] Avg episode reward: [(0, '11.170'), (1, '11.070')] [2023-10-08 00:08:26,211][51605] Saving new best policy, reward=11.170! [2023-10-08 00:08:26,404][52059] Updated weights for policy 1, policy_version 7022 (0.0007) [2023-10-08 00:08:26,774][52059] Updated weights for policy 1, policy_version 7032 (0.0007) [2023-10-08 00:08:28,332][52060] Updated weights for policy 0, policy_version 6950 (0.0009) [2023-10-08 00:08:28,708][52060] Updated weights for policy 0, policy_version 6960 (0.0007) [2023-10-08 00:08:29,076][52060] Updated weights for policy 0, policy_version 6970 (0.0007) [2023-10-08 00:08:30,589][52059] Updated weights for policy 1, policy_version 7042 (0.0010) [2023-10-08 00:08:30,992][52059] Updated weights for policy 1, policy_version 7052 (0.0010) [2023-10-08 00:08:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 14352384. Throughput: 0: 1703.5, 1: 1740.4. Samples: 3602724. Policy #0 lag: (min: 31.0, avg: 31.2, max: 42.0) [2023-10-08 00:08:31,211][50642] Avg episode reward: [(0, '10.870'), (1, '12.500')] [2023-10-08 00:08:31,357][52059] Updated weights for policy 1, policy_version 7062 (0.0010) [2023-10-08 00:08:31,725][52059] Updated weights for policy 1, policy_version 7072 (0.0007) [2023-10-08 00:08:33,063][52060] Updated weights for policy 0, policy_version 6980 (0.0008) [2023-10-08 00:08:33,439][52060] Updated weights for policy 0, policy_version 6990 (0.0007) [2023-10-08 00:08:33,798][52060] Updated weights for policy 0, policy_version 7000 (0.0007) [2023-10-08 00:08:35,679][52059] Updated weights for policy 1, policy_version 7082 (0.0009) [2023-10-08 00:08:36,048][52059] Updated weights for policy 1, policy_version 7092 (0.0008) [2023-10-08 00:08:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 14417920. Throughput: 0: 1698.8, 1: 1737.5. Samples: 3612718. Policy #0 lag: (min: 31.0, avg: 31.2, max: 42.0) [2023-10-08 00:08:36,211][50642] Avg episode reward: [(0, '10.590'), (1, '12.710')] [2023-10-08 00:08:36,404][52059] Updated weights for policy 1, policy_version 7102 (0.0009) [2023-10-08 00:08:37,903][52060] Updated weights for policy 0, policy_version 7010 (0.0009) [2023-10-08 00:08:38,299][52060] Updated weights for policy 0, policy_version 7020 (0.0009) [2023-10-08 00:08:38,671][52060] Updated weights for policy 0, policy_version 7030 (0.0008) [2023-10-08 00:08:39,043][52060] Updated weights for policy 0, policy_version 7040 (0.0009) [2023-10-08 00:08:40,328][52059] Updated weights for policy 1, policy_version 7112 (0.0008) [2023-10-08 00:08:40,705][52059] Updated weights for policy 1, policy_version 7122 (0.0008) [2023-10-08 00:08:41,075][52059] Updated weights for policy 1, policy_version 7132 (0.0010) [2023-10-08 00:08:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 14483456. Throughput: 0: 1693.5, 1: 1751.5. Samples: 3633756. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 00:08:41,211][50642] Avg episode reward: [(0, '11.130'), (1, '12.570')] [2023-10-08 00:08:43,140][52060] Updated weights for policy 0, policy_version 7050 (0.0009) [2023-10-08 00:08:43,512][52060] Updated weights for policy 0, policy_version 7060 (0.0011) [2023-10-08 00:08:43,886][52060] Updated weights for policy 0, policy_version 7070 (0.0011) [2023-10-08 00:08:44,943][52059] Updated weights for policy 1, policy_version 7142 (0.0007) [2023-10-08 00:08:45,308][52059] Updated weights for policy 1, policy_version 7152 (0.0009) [2023-10-08 00:08:45,664][52059] Updated weights for policy 1, policy_version 7162 (0.0009) [2023-10-08 00:08:46,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 14581760. Throughput: 0: 1716.7, 1: 1719.8. Samples: 3653654. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 00:08:46,211][50642] Avg episode reward: [(0, '10.260'), (1, '12.360')] [2023-10-08 00:08:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000007072_7241728.pth... [2023-10-08 00:08:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000007168_7340032.pth... [2023-10-08 00:08:46,255][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000005536_5668864.pth [2023-10-08 00:08:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000005472_5603328.pth [2023-10-08 00:08:47,854][52060] Updated weights for policy 0, policy_version 7080 (0.0009) [2023-10-08 00:08:48,221][52060] Updated weights for policy 0, policy_version 7090 (0.0008) [2023-10-08 00:08:48,591][52060] Updated weights for policy 0, policy_version 7100 (0.0008) [2023-10-08 00:08:49,673][52059] Updated weights for policy 1, policy_version 7172 (0.0008) [2023-10-08 00:08:50,038][52059] Updated weights for policy 1, policy_version 7182 (0.0007) [2023-10-08 00:08:50,394][52059] Updated weights for policy 1, policy_version 7192 (0.0008) [2023-10-08 00:08:51,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 14647296. Throughput: 0: 1686.7, 1: 1750.9. Samples: 3664242. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:08:51,211][50642] Avg episode reward: [(0, '10.910'), (1, '12.260')] [2023-10-08 00:08:52,626][52060] Updated weights for policy 0, policy_version 7110 (0.0008) [2023-10-08 00:08:53,000][52060] Updated weights for policy 0, policy_version 7120 (0.0008) [2023-10-08 00:08:53,374][52060] Updated weights for policy 0, policy_version 7130 (0.0008) [2023-10-08 00:08:54,326][52059] Updated weights for policy 1, policy_version 7202 (0.0008) [2023-10-08 00:08:54,698][52059] Updated weights for policy 1, policy_version 7212 (0.0009) [2023-10-08 00:08:55,056][52059] Updated weights for policy 1, policy_version 7222 (0.0008) [2023-10-08 00:08:55,416][52059] Updated weights for policy 1, policy_version 7232 (0.0009) [2023-10-08 00:08:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 14712832. Throughput: 0: 1700.4, 1: 1730.3. Samples: 3684856. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:08:56,211][50642] Avg episode reward: [(0, '9.560'), (1, '11.660')] [2023-10-08 00:08:57,256][52060] Updated weights for policy 0, policy_version 7140 (0.0009) [2023-10-08 00:08:57,637][52060] Updated weights for policy 0, policy_version 7150 (0.0007) [2023-10-08 00:08:58,007][52060] Updated weights for policy 0, policy_version 7160 (0.0008) [2023-10-08 00:08:59,240][52059] Updated weights for policy 1, policy_version 7242 (0.0010) [2023-10-08 00:08:59,608][52059] Updated weights for policy 1, policy_version 7252 (0.0009) [2023-10-08 00:08:59,973][52059] Updated weights for policy 1, policy_version 7262 (0.0008) [2023-10-08 00:09:01,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 14778368. Throughput: 0: 1720.7, 1: 1718.6. Samples: 3705924. Policy #0 lag: (min: 31.0, avg: 32.3, max: 57.0) [2023-10-08 00:09:01,211][50642] Avg episode reward: [(0, '11.700'), (1, '11.800')] [2023-10-08 00:09:01,217][51605] Saving new best policy, reward=11.700! [2023-10-08 00:09:01,716][52060] Updated weights for policy 0, policy_version 7170 (0.0008) [2023-10-08 00:09:02,086][52060] Updated weights for policy 0, policy_version 7180 (0.0009) [2023-10-08 00:09:02,462][52060] Updated weights for policy 0, policy_version 7190 (0.0008) [2023-10-08 00:09:02,827][52060] Updated weights for policy 0, policy_version 7200 (0.0009) [2023-10-08 00:09:03,950][52059] Updated weights for policy 1, policy_version 7272 (0.0010) [2023-10-08 00:09:04,315][52059] Updated weights for policy 1, policy_version 7282 (0.0011) [2023-10-08 00:09:04,691][52059] Updated weights for policy 1, policy_version 7292 (0.0008) [2023-10-08 00:09:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 14843904. Throughput: 0: 1691.2, 1: 1743.3. Samples: 3716296. Policy #0 lag: (min: 31.0, avg: 32.3, max: 57.0) [2023-10-08 00:09:06,211][50642] Avg episode reward: [(0, '10.130'), (1, '12.720')] [2023-10-08 00:09:06,853][52060] Updated weights for policy 0, policy_version 7210 (0.0008) [2023-10-08 00:09:07,229][52060] Updated weights for policy 0, policy_version 7220 (0.0009) [2023-10-08 00:09:07,601][52060] Updated weights for policy 0, policy_version 7230 (0.0009) [2023-10-08 00:09:08,385][52059] Updated weights for policy 1, policy_version 7302 (0.0009) [2023-10-08 00:09:08,749][52059] Updated weights for policy 1, policy_version 7312 (0.0011) [2023-10-08 00:09:09,111][52059] Updated weights for policy 1, policy_version 7322 (0.0009) [2023-10-08 00:09:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 14909440. Throughput: 0: 1719.7, 1: 1727.0. Samples: 3736888. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:09:11,211][50642] Avg episode reward: [(0, '11.620'), (1, '13.220')] [2023-10-08 00:09:11,213][51710] Saving new best policy, reward=13.220! [2023-10-08 00:09:11,372][52060] Updated weights for policy 0, policy_version 7240 (0.0011) [2023-10-08 00:09:11,735][52060] Updated weights for policy 0, policy_version 7250 (0.0009) [2023-10-08 00:09:12,109][52060] Updated weights for policy 0, policy_version 7260 (0.0009) [2023-10-08 00:09:13,105][52059] Updated weights for policy 1, policy_version 7332 (0.0007) [2023-10-08 00:09:13,474][52059] Updated weights for policy 1, policy_version 7342 (0.0008) [2023-10-08 00:09:13,841][52059] Updated weights for policy 1, policy_version 7352 (0.0008) [2023-10-08 00:09:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 14974976. Throughput: 0: 1721.6, 1: 1731.3. Samples: 3758104. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:09:16,211][50642] Avg episode reward: [(0, '10.260'), (1, '11.480')] [2023-10-08 00:09:16,323][52060] Updated weights for policy 0, policy_version 7270 (0.0008) [2023-10-08 00:09:16,684][52060] Updated weights for policy 0, policy_version 7280 (0.0008) [2023-10-08 00:09:17,053][52060] Updated weights for policy 0, policy_version 7290 (0.0008) [2023-10-08 00:09:17,771][52059] Updated weights for policy 1, policy_version 7362 (0.0007) [2023-10-08 00:09:18,191][52059] Updated weights for policy 1, policy_version 7372 (0.0007) [2023-10-08 00:09:18,552][52059] Updated weights for policy 1, policy_version 7382 (0.0008) [2023-10-08 00:09:18,921][52059] Updated weights for policy 1, policy_version 7392 (0.0009) [2023-10-08 00:09:20,965][52060] Updated weights for policy 0, policy_version 7300 (0.0009) [2023-10-08 00:09:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 15040512. Throughput: 0: 1710.7, 1: 1729.2. Samples: 3767512. Policy #0 lag: (min: 1.0, avg: 7.1, max: 33.0) [2023-10-08 00:09:21,211][50642] Avg episode reward: [(0, '11.590'), (1, '13.190')] [2023-10-08 00:09:21,347][52060] Updated weights for policy 0, policy_version 7310 (0.0007) [2023-10-08 00:09:21,712][52060] Updated weights for policy 0, policy_version 7320 (0.0007) [2023-10-08 00:09:22,783][52059] Updated weights for policy 1, policy_version 7402 (0.0009) [2023-10-08 00:09:23,140][52059] Updated weights for policy 1, policy_version 7412 (0.0009) [2023-10-08 00:09:23,503][52059] Updated weights for policy 1, policy_version 7422 (0.0008) [2023-10-08 00:09:25,822][52060] Updated weights for policy 0, policy_version 7330 (0.0008) [2023-10-08 00:09:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 15106048. Throughput: 0: 1721.2, 1: 1720.5. Samples: 3788628. Policy #0 lag: (min: 1.0, avg: 7.1, max: 33.0) [2023-10-08 00:09:26,211][50642] Avg episode reward: [(0, '11.140'), (1, '12.110')] [2023-10-08 00:09:26,214][52060] Updated weights for policy 0, policy_version 7340 (0.0008) [2023-10-08 00:09:26,580][52060] Updated weights for policy 0, policy_version 7350 (0.0007) [2023-10-08 00:09:26,949][52060] Updated weights for policy 0, policy_version 7360 (0.0009) [2023-10-08 00:09:27,474][52059] Updated weights for policy 1, policy_version 7432 (0.0009) [2023-10-08 00:09:27,835][52059] Updated weights for policy 1, policy_version 7442 (0.0007) [2023-10-08 00:09:28,197][52059] Updated weights for policy 1, policy_version 7452 (0.0007) [2023-10-08 00:09:30,964][52060] Updated weights for policy 0, policy_version 7370 (0.0007) [2023-10-08 00:09:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 15171584. Throughput: 0: 1717.0, 1: 1747.9. Samples: 3809574. Policy #0 lag: (min: 13.0, avg: 14.0, max: 35.0) [2023-10-08 00:09:31,211][50642] Avg episode reward: [(0, '11.320'), (1, '12.160')] [2023-10-08 00:09:31,336][52060] Updated weights for policy 0, policy_version 7380 (0.0009) [2023-10-08 00:09:31,707][52060] Updated weights for policy 0, policy_version 7390 (0.0009) [2023-10-08 00:09:32,076][52059] Updated weights for policy 1, policy_version 7462 (0.0008) [2023-10-08 00:09:32,442][52059] Updated weights for policy 1, policy_version 7472 (0.0007) [2023-10-08 00:09:32,795][52059] Updated weights for policy 1, policy_version 7482 (0.0008) [2023-10-08 00:09:35,627][52060] Updated weights for policy 0, policy_version 7400 (0.0007) [2023-10-08 00:09:36,002][52060] Updated weights for policy 0, policy_version 7410 (0.0007) [2023-10-08 00:09:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 15237120. Throughput: 0: 1722.5, 1: 1720.5. Samples: 3819176. Policy #0 lag: (min: 13.0, avg: 14.0, max: 35.0) [2023-10-08 00:09:36,211][50642] Avg episode reward: [(0, '10.780'), (1, '12.590')] [2023-10-08 00:09:36,378][52060] Updated weights for policy 0, policy_version 7420 (0.0008) [2023-10-08 00:09:36,722][52059] Updated weights for policy 1, policy_version 7492 (0.0009) [2023-10-08 00:09:37,092][52059] Updated weights for policy 1, policy_version 7502 (0.0008) [2023-10-08 00:09:37,454][52059] Updated weights for policy 1, policy_version 7512 (0.0009) [2023-10-08 00:09:40,426][52060] Updated weights for policy 0, policy_version 7430 (0.0012) [2023-10-08 00:09:40,800][52060] Updated weights for policy 0, policy_version 7440 (0.0009) [2023-10-08 00:09:41,167][52060] Updated weights for policy 0, policy_version 7450 (0.0009) [2023-10-08 00:09:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 15302656. Throughput: 0: 1727.3, 1: 1732.4. Samples: 3840540. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:09:41,211][50642] Avg episode reward: [(0, '10.240'), (1, '11.810')] [2023-10-08 00:09:41,457][52059] Updated weights for policy 1, policy_version 7522 (0.0008) [2023-10-08 00:09:41,816][52059] Updated weights for policy 1, policy_version 7532 (0.0007) [2023-10-08 00:09:42,186][52059] Updated weights for policy 1, policy_version 7542 (0.0008) [2023-10-08 00:09:42,548][52059] Updated weights for policy 1, policy_version 7552 (0.0008) [2023-10-08 00:09:45,114][52060] Updated weights for policy 0, policy_version 7460 (0.0008) [2023-10-08 00:09:45,477][52060] Updated weights for policy 0, policy_version 7470 (0.0008) [2023-10-08 00:09:45,846][52060] Updated weights for policy 0, policy_version 7480 (0.0008) [2023-10-08 00:09:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 15400960. Throughput: 0: 1699.7, 1: 1748.0. Samples: 3861072. Policy #0 lag: (min: 31.0, avg: 44.2, max: 63.0) [2023-10-08 00:09:46,211][50642] Avg episode reward: [(0, '10.810'), (1, '13.330')] [2023-10-08 00:09:46,485][52059] Updated weights for policy 1, policy_version 7562 (0.0008) [2023-10-08 00:09:46,845][52059] Updated weights for policy 1, policy_version 7572 (0.0009) [2023-10-08 00:09:47,212][52059] Updated weights for policy 1, policy_version 7582 (0.0007) [2023-10-08 00:09:47,287][51710] Saving new best policy, reward=13.330! [2023-10-08 00:09:49,597][52060] Updated weights for policy 0, policy_version 7490 (0.0009) [2023-10-08 00:09:49,974][52060] Updated weights for policy 0, policy_version 7500 (0.0008) [2023-10-08 00:09:50,347][52060] Updated weights for policy 0, policy_version 7510 (0.0008) [2023-10-08 00:09:50,711][52060] Updated weights for policy 0, policy_version 7520 (0.0009) [2023-10-08 00:09:51,088][52059] Updated weights for policy 1, policy_version 7592 (0.0010) [2023-10-08 00:09:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 15466496. Throughput: 0: 1723.6, 1: 1721.5. Samples: 3871322. Policy #0 lag: (min: 31.0, avg: 44.2, max: 63.0) [2023-10-08 00:09:51,211][50642] Avg episode reward: [(0, '11.120'), (1, '12.030')] [2023-10-08 00:09:51,448][52059] Updated weights for policy 1, policy_version 7602 (0.0007) [2023-10-08 00:09:51,812][52059] Updated weights for policy 1, policy_version 7612 (0.0009) [2023-10-08 00:09:54,674][52060] Updated weights for policy 0, policy_version 7530 (0.0007) [2023-10-08 00:09:55,050][52060] Updated weights for policy 0, policy_version 7540 (0.0008) [2023-10-08 00:09:55,422][52060] Updated weights for policy 0, policy_version 7550 (0.0009) [2023-10-08 00:09:55,552][52059] Updated weights for policy 1, policy_version 7622 (0.0009) [2023-10-08 00:09:55,931][52059] Updated weights for policy 1, policy_version 7632 (0.0008) [2023-10-08 00:09:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 15532032. Throughput: 0: 1710.7, 1: 1748.1. Samples: 3892530. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:09:56,211][50642] Avg episode reward: [(0, '11.180'), (1, '12.740')] [2023-10-08 00:09:56,292][52059] Updated weights for policy 1, policy_version 7642 (0.0008) [2023-10-08 00:09:59,406][52060] Updated weights for policy 0, policy_version 7560 (0.0008) [2023-10-08 00:09:59,775][52060] Updated weights for policy 0, policy_version 7570 (0.0009) [2023-10-08 00:10:00,141][52060] Updated weights for policy 0, policy_version 7580 (0.0008) [2023-10-08 00:10:00,407][52059] Updated weights for policy 1, policy_version 7652 (0.0008) [2023-10-08 00:10:00,767][52059] Updated weights for policy 1, policy_version 7662 (0.0010) [2023-10-08 00:10:01,129][52059] Updated weights for policy 1, policy_version 7672 (0.0010) [2023-10-08 00:10:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 15597568. Throughput: 0: 1695.1, 1: 1731.7. Samples: 3912312. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:10:01,211][50642] Avg episode reward: [(0, '10.490'), (1, '11.940')] [2023-10-08 00:10:04,197][52060] Updated weights for policy 0, policy_version 7590 (0.0008) [2023-10-08 00:10:04,555][52060] Updated weights for policy 0, policy_version 7600 (0.0009) [2023-10-08 00:10:04,920][52060] Updated weights for policy 0, policy_version 7610 (0.0008) [2023-10-08 00:10:05,177][52059] Updated weights for policy 1, policy_version 7682 (0.0008) [2023-10-08 00:10:05,590][52059] Updated weights for policy 1, policy_version 7692 (0.0009) [2023-10-08 00:10:05,953][52059] Updated weights for policy 1, policy_version 7702 (0.0008) [2023-10-08 00:10:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 15663104. Throughput: 0: 1729.3, 1: 1743.1. Samples: 3923768. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) [2023-10-08 00:10:06,211][50642] Avg episode reward: [(0, '11.060'), (1, '12.390')] [2023-10-08 00:10:06,317][52059] Updated weights for policy 1, policy_version 7712 (0.0007) [2023-10-08 00:10:08,875][52060] Updated weights for policy 0, policy_version 7620 (0.0008) [2023-10-08 00:10:09,246][52060] Updated weights for policy 0, policy_version 7630 (0.0008) [2023-10-08 00:10:09,619][52060] Updated weights for policy 0, policy_version 7640 (0.0009) [2023-10-08 00:10:10,252][52059] Updated weights for policy 1, policy_version 7722 (0.0009) [2023-10-08 00:10:10,613][52059] Updated weights for policy 1, policy_version 7732 (0.0008) [2023-10-08 00:10:10,965][52059] Updated weights for policy 1, policy_version 7742 (0.0008) [2023-10-08 00:10:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 15761408. Throughput: 0: 1705.0, 1: 1741.6. Samples: 3943726. Policy #0 lag: (min: 28.0, avg: 35.0, max: 60.0) [2023-10-08 00:10:11,211][50642] Avg episode reward: [(0, '10.270'), (1, '12.450')] [2023-10-08 00:10:13,641][52060] Updated weights for policy 0, policy_version 7650 (0.0007) [2023-10-08 00:10:14,038][52060] Updated weights for policy 0, policy_version 7660 (0.0009) [2023-10-08 00:10:14,414][52060] Updated weights for policy 0, policy_version 7670 (0.0010) [2023-10-08 00:10:14,795][52060] Updated weights for policy 0, policy_version 7680 (0.0007) [2023-10-08 00:10:14,898][52059] Updated weights for policy 1, policy_version 7752 (0.0007) [2023-10-08 00:10:15,260][52059] Updated weights for policy 1, policy_version 7762 (0.0007) [2023-10-08 00:10:15,627][52059] Updated weights for policy 1, policy_version 7772 (0.0007) [2023-10-08 00:10:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 15826944. Throughput: 0: 1703.4, 1: 1712.9. Samples: 3963310. Policy #0 lag: (min: 25.0, avg: 35.0, max: 57.0) [2023-10-08 00:10:16,211][50642] Avg episode reward: [(0, '10.930'), (1, '12.940')] [2023-10-08 00:10:18,919][52060] Updated weights for policy 0, policy_version 7690 (0.0008) [2023-10-08 00:10:19,278][52060] Updated weights for policy 0, policy_version 7700 (0.0010) [2023-10-08 00:10:19,315][52059] Updated weights for policy 1, policy_version 7782 (0.0009) [2023-10-08 00:10:19,647][52060] Updated weights for policy 0, policy_version 7710 (0.0008) [2023-10-08 00:10:19,679][52059] Updated weights for policy 1, policy_version 7792 (0.0009) [2023-10-08 00:10:20,048][52059] Updated weights for policy 1, policy_version 7802 (0.0008) [2023-10-08 00:10:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 15892480. Throughput: 0: 1717.7, 1: 1747.9. Samples: 3975130. Policy #0 lag: (min: 25.0, avg: 35.0, max: 57.0) [2023-10-08 00:10:21,211][50642] Avg episode reward: [(0, '10.860'), (1, '11.840')] [2023-10-08 00:10:23,824][52060] Updated weights for policy 0, policy_version 7720 (0.0009) [2023-10-08 00:10:23,876][52059] Updated weights for policy 1, policy_version 7812 (0.0007) [2023-10-08 00:10:24,188][52060] Updated weights for policy 0, policy_version 7730 (0.0009) [2023-10-08 00:10:24,233][52059] Updated weights for policy 1, policy_version 7822 (0.0007) [2023-10-08 00:10:24,572][52060] Updated weights for policy 0, policy_version 7740 (0.0009) [2023-10-08 00:10:24,604][52059] Updated weights for policy 1, policy_version 7832 (0.0008) [2023-10-08 00:10:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 15958016. Throughput: 0: 1686.0, 1: 1725.2. Samples: 3994044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:10:26,211][50642] Avg episode reward: [(0, '10.870'), (1, '13.230')] [2023-10-08 00:10:28,528][52060] Updated weights for policy 0, policy_version 7750 (0.0009) [2023-10-08 00:10:28,727][52059] Updated weights for policy 1, policy_version 7842 (0.0007) [2023-10-08 00:10:28,909][52060] Updated weights for policy 0, policy_version 7760 (0.0009) [2023-10-08 00:10:29,099][52059] Updated weights for policy 1, policy_version 7852 (0.0008) [2023-10-08 00:10:29,281][52060] Updated weights for policy 0, policy_version 7770 (0.0008) [2023-10-08 00:10:29,465][52059] Updated weights for policy 1, policy_version 7862 (0.0008) [2023-10-08 00:10:29,840][52059] Updated weights for policy 1, policy_version 7872 (0.0008) [2023-10-08 00:10:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 16023552. Throughput: 0: 1701.5, 1: 1713.2. Samples: 4014730. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:10:31,211][50642] Avg episode reward: [(0, '10.920'), (1, '12.430')] [2023-10-08 00:10:33,340][52060] Updated weights for policy 0, policy_version 7780 (0.0008) [2023-10-08 00:10:33,719][52060] Updated weights for policy 0, policy_version 7790 (0.0007) [2023-10-08 00:10:33,781][52059] Updated weights for policy 1, policy_version 7882 (0.0007) [2023-10-08 00:10:34,088][52060] Updated weights for policy 0, policy_version 7800 (0.0009) [2023-10-08 00:10:34,158][52059] Updated weights for policy 1, policy_version 7892 (0.0007) [2023-10-08 00:10:34,520][52059] Updated weights for policy 1, policy_version 7902 (0.0007) [2023-10-08 00:10:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 16089088. Throughput: 0: 1693.1, 1: 1734.5. Samples: 4025566. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) [2023-10-08 00:10:36,211][50642] Avg episode reward: [(0, '11.200'), (1, '13.160')] [2023-10-08 00:10:38,124][52060] Updated weights for policy 0, policy_version 7810 (0.0009) [2023-10-08 00:10:38,287][52059] Updated weights for policy 1, policy_version 7912 (0.0007) [2023-10-08 00:10:38,490][52060] Updated weights for policy 0, policy_version 7820 (0.0008) [2023-10-08 00:10:38,651][52059] Updated weights for policy 1, policy_version 7922 (0.0007) [2023-10-08 00:10:38,866][52060] Updated weights for policy 0, policy_version 7830 (0.0010) [2023-10-08 00:10:39,017][52059] Updated weights for policy 1, policy_version 7932 (0.0007) [2023-10-08 00:10:39,226][52060] Updated weights for policy 0, policy_version 7840 (0.0009) [2023-10-08 00:10:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 16154624. Throughput: 0: 1685.7, 1: 1712.5. Samples: 4045450. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) [2023-10-08 00:10:41,211][50642] Avg episode reward: [(0, '10.860'), (1, '12.040')] [2023-10-08 00:10:43,145][52059] Updated weights for policy 1, policy_version 7942 (0.0008) [2023-10-08 00:10:43,268][52060] Updated weights for policy 0, policy_version 7850 (0.0009) [2023-10-08 00:10:43,511][52059] Updated weights for policy 1, policy_version 7952 (0.0008) [2023-10-08 00:10:43,638][52060] Updated weights for policy 0, policy_version 7860 (0.0008) [2023-10-08 00:10:43,869][52059] Updated weights for policy 1, policy_version 7962 (0.0008) [2023-10-08 00:10:44,008][52060] Updated weights for policy 0, policy_version 7870 (0.0009) [2023-10-08 00:10:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 16220160. Throughput: 0: 1701.2, 1: 1726.7. Samples: 4066564. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 00:10:46,211][50642] Avg episode reward: [(0, '11.050'), (1, '13.670')] [2023-10-08 00:10:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000007968_8159232.pth... [2023-10-08 00:10:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000007872_8060928.pth... [2023-10-08 00:10:46,250][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000006336_6488064.pth [2023-10-08 00:10:46,253][51710] Saving new best policy, reward=13.670! [2023-10-08 00:10:46,262][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000006272_6422528.pth [2023-10-08 00:10:46,267][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000007872_8060928.pth [2023-10-08 00:10:46,286][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000007968_8159232.pth [2023-10-08 00:10:47,757][52059] Updated weights for policy 1, policy_version 7972 (0.0008) [2023-10-08 00:10:48,064][52060] Updated weights for policy 0, policy_version 7880 (0.0008) [2023-10-08 00:10:48,124][52059] Updated weights for policy 1, policy_version 7982 (0.0008) [2023-10-08 00:10:48,430][52060] Updated weights for policy 0, policy_version 7890 (0.0010) [2023-10-08 00:10:48,493][52059] Updated weights for policy 1, policy_version 7992 (0.0008) [2023-10-08 00:10:48,803][52060] Updated weights for policy 0, policy_version 7900 (0.0010) [2023-10-08 00:10:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 16285696. Throughput: 0: 1669.1, 1: 1710.1. Samples: 4075834. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 00:10:51,211][50642] Avg episode reward: [(0, '10.340'), (1, '12.350')] [2023-10-08 00:10:52,531][52059] Updated weights for policy 1, policy_version 8002 (0.0007) [2023-10-08 00:10:52,752][52060] Updated weights for policy 0, policy_version 7910 (0.0007) [2023-10-08 00:10:52,900][52059] Updated weights for policy 1, policy_version 8012 (0.0008) [2023-10-08 00:10:53,133][52060] Updated weights for policy 0, policy_version 7920 (0.0008) [2023-10-08 00:10:53,269][52059] Updated weights for policy 1, policy_version 8022 (0.0008) [2023-10-08 00:10:53,504][52060] Updated weights for policy 0, policy_version 7930 (0.0009) [2023-10-08 00:10:53,628][52059] Updated weights for policy 1, policy_version 8032 (0.0008) [2023-10-08 00:10:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 16351232. Throughput: 0: 1690.9, 1: 1713.2. Samples: 4096912. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:10:56,211][50642] Avg episode reward: [(0, '11.030'), (1, '13.370')] [2023-10-08 00:10:57,428][52060] Updated weights for policy 0, policy_version 7940 (0.0008) [2023-10-08 00:10:57,435][52059] Updated weights for policy 1, policy_version 8042 (0.0008) [2023-10-08 00:10:57,802][52059] Updated weights for policy 1, policy_version 8052 (0.0007) [2023-10-08 00:10:57,804][52060] Updated weights for policy 0, policy_version 7950 (0.0008) [2023-10-08 00:10:58,164][52059] Updated weights for policy 1, policy_version 8062 (0.0009) [2023-10-08 00:10:58,164][52060] Updated weights for policy 0, policy_version 7960 (0.0010) [2023-10-08 00:11:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 16416768. Throughput: 0: 1705.1, 1: 1739.7. Samples: 4118326. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:11:01,210][50642] Avg episode reward: [(0, '9.940'), (1, '12.320')] [2023-10-08 00:11:02,106][52060] Updated weights for policy 0, policy_version 7970 (0.0008) [2023-10-08 00:11:02,244][52059] Updated weights for policy 1, policy_version 8072 (0.0008) [2023-10-08 00:11:02,481][52060] Updated weights for policy 0, policy_version 7980 (0.0007) [2023-10-08 00:11:02,614][52059] Updated weights for policy 1, policy_version 8082 (0.0008) [2023-10-08 00:11:02,846][52060] Updated weights for policy 0, policy_version 7990 (0.0007) [2023-10-08 00:11:02,973][52059] Updated weights for policy 1, policy_version 8092 (0.0007) [2023-10-08 00:11:03,218][52060] Updated weights for policy 0, policy_version 8000 (0.0007) [2023-10-08 00:11:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 16482304. Throughput: 0: 1682.1, 1: 1703.7. Samples: 4127490. Policy #0 lag: (min: 1.0, avg: 12.8, max: 33.0) [2023-10-08 00:11:06,211][50642] Avg episode reward: [(0, '11.670'), (1, '13.670')] [2023-10-08 00:11:06,936][52059] Updated weights for policy 1, policy_version 8102 (0.0007) [2023-10-08 00:11:07,228][52060] Updated weights for policy 0, policy_version 8010 (0.0007) [2023-10-08 00:11:07,298][52059] Updated weights for policy 1, policy_version 8112 (0.0007) [2023-10-08 00:11:07,593][52060] Updated weights for policy 0, policy_version 8020 (0.0008) [2023-10-08 00:11:07,656][52059] Updated weights for policy 1, policy_version 8122 (0.0007) [2023-10-08 00:11:07,966][52060] Updated weights for policy 0, policy_version 8030 (0.0008) [2023-10-08 00:11:11,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16547840. Throughput: 0: 1709.8, 1: 1722.3. Samples: 4148490. Policy #0 lag: (min: 1.0, avg: 12.8, max: 33.0) [2023-10-08 00:11:11,211][50642] Avg episode reward: [(0, '10.880'), (1, '10.750')] [2023-10-08 00:11:11,750][52059] Updated weights for policy 1, policy_version 8132 (0.0007) [2023-10-08 00:11:11,768][52060] Updated weights for policy 0, policy_version 8040 (0.0008) [2023-10-08 00:11:12,111][52059] Updated weights for policy 1, policy_version 8142 (0.0008) [2023-10-08 00:11:12,136][52060] Updated weights for policy 0, policy_version 8050 (0.0007) [2023-10-08 00:11:12,472][52059] Updated weights for policy 1, policy_version 8152 (0.0007) [2023-10-08 00:11:12,516][52060] Updated weights for policy 0, policy_version 8060 (0.0008) [2023-10-08 00:11:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16613376. Throughput: 0: 1717.4, 1: 1733.9. Samples: 4170038. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 00:11:16,210][50642] Avg episode reward: [(0, '11.550'), (1, '14.090')] [2023-10-08 00:11:16,307][52059] Updated weights for policy 1, policy_version 8162 (0.0007) [2023-10-08 00:11:16,671][52059] Updated weights for policy 1, policy_version 8172 (0.0009) [2023-10-08 00:11:16,677][52060] Updated weights for policy 0, policy_version 8070 (0.0007) [2023-10-08 00:11:17,036][52059] Updated weights for policy 1, policy_version 8182 (0.0008) [2023-10-08 00:11:17,045][52060] Updated weights for policy 0, policy_version 8080 (0.0007) [2023-10-08 00:11:17,398][51710] Saving new best policy, reward=14.090! [2023-10-08 00:11:17,403][52059] Updated weights for policy 1, policy_version 8192 (0.0007) [2023-10-08 00:11:17,421][52060] Updated weights for policy 0, policy_version 8090 (0.0009) [2023-10-08 00:11:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16678912. Throughput: 0: 1700.7, 1: 1715.2. Samples: 4179282. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 00:11:21,211][50642] Avg episode reward: [(0, '11.110'), (1, '11.780')] [2023-10-08 00:11:21,473][52060] Updated weights for policy 0, policy_version 8100 (0.0008) [2023-10-08 00:11:21,483][52059] Updated weights for policy 1, policy_version 8202 (0.0007) [2023-10-08 00:11:21,848][52060] Updated weights for policy 0, policy_version 8110 (0.0008) [2023-10-08 00:11:21,852][52059] Updated weights for policy 1, policy_version 8212 (0.0009) [2023-10-08 00:11:22,216][52060] Updated weights for policy 0, policy_version 8120 (0.0007) [2023-10-08 00:11:22,224][52059] Updated weights for policy 1, policy_version 8222 (0.0008) [2023-10-08 00:11:26,099][52060] Updated weights for policy 0, policy_version 8130 (0.0007) [2023-10-08 00:11:26,147][52059] Updated weights for policy 1, policy_version 8232 (0.0010) [2023-10-08 00:11:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16744448. Throughput: 0: 1717.5, 1: 1725.9. Samples: 4200400. Policy #0 lag: (min: 2.0, avg: 3.7, max: 31.0) [2023-10-08 00:11:26,211][50642] Avg episode reward: [(0, '11.660'), (1, '13.030')] [2023-10-08 00:11:26,472][52060] Updated weights for policy 0, policy_version 8140 (0.0009) [2023-10-08 00:11:26,512][52059] Updated weights for policy 1, policy_version 8242 (0.0007) [2023-10-08 00:11:26,845][52060] Updated weights for policy 0, policy_version 8150 (0.0008) [2023-10-08 00:11:26,881][52059] Updated weights for policy 1, policy_version 8252 (0.0008) [2023-10-08 00:11:27,224][52060] Updated weights for policy 0, policy_version 8160 (0.0009) [2023-10-08 00:11:30,830][52059] Updated weights for policy 1, policy_version 8262 (0.0010) [2023-10-08 00:11:31,199][52059] Updated weights for policy 1, policy_version 8272 (0.0010) [2023-10-08 00:11:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16809984. Throughput: 0: 1715.9, 1: 1720.6. Samples: 4221208. Policy #0 lag: (min: 2.0, avg: 3.7, max: 31.0) [2023-10-08 00:11:31,211][50642] Avg episode reward: [(0, '11.590'), (1, '13.010')] [2023-10-08 00:11:31,236][52060] Updated weights for policy 0, policy_version 8170 (0.0009) [2023-10-08 00:11:31,562][52059] Updated weights for policy 1, policy_version 8282 (0.0009) [2023-10-08 00:11:31,607][52060] Updated weights for policy 0, policy_version 8180 (0.0007) [2023-10-08 00:11:31,984][52060] Updated weights for policy 0, policy_version 8190 (0.0008) [2023-10-08 00:11:35,536][52059] Updated weights for policy 1, policy_version 8292 (0.0007) [2023-10-08 00:11:35,878][52060] Updated weights for policy 0, policy_version 8200 (0.0008) [2023-10-08 00:11:35,897][52059] Updated weights for policy 1, policy_version 8302 (0.0009) [2023-10-08 00:11:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16875520. Throughput: 0: 1712.2, 1: 1732.0. Samples: 4230820. Policy #0 lag: (min: 15.0, avg: 21.6, max: 47.0) [2023-10-08 00:11:36,211][50642] Avg episode reward: [(0, '11.140'), (1, '12.420')] [2023-10-08 00:11:36,251][52060] Updated weights for policy 0, policy_version 8210 (0.0008) [2023-10-08 00:11:36,256][52059] Updated weights for policy 1, policy_version 8312 (0.0007) [2023-10-08 00:11:36,617][52060] Updated weights for policy 0, policy_version 8220 (0.0008) [2023-10-08 00:11:40,198][52059] Updated weights for policy 1, policy_version 8322 (0.0009) [2023-10-08 00:11:40,540][52060] Updated weights for policy 0, policy_version 8230 (0.0007) [2023-10-08 00:11:40,563][52059] Updated weights for policy 1, policy_version 8332 (0.0008) [2023-10-08 00:11:40,905][52060] Updated weights for policy 0, policy_version 8240 (0.0008) [2023-10-08 00:11:40,925][52059] Updated weights for policy 1, policy_version 8342 (0.0009) [2023-10-08 00:11:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 16941056. Throughput: 0: 1718.3, 1: 1734.6. Samples: 4252290. Policy #0 lag: (min: 15.0, avg: 21.6, max: 47.0) [2023-10-08 00:11:41,211][50642] Avg episode reward: [(0, '11.850'), (1, '13.470')] [2023-10-08 00:11:41,275][52060] Updated weights for policy 0, policy_version 8250 (0.0007) [2023-10-08 00:11:41,292][52059] Updated weights for policy 1, policy_version 8352 (0.0008) [2023-10-08 00:11:41,497][51605] Saving new best policy, reward=11.850! [2023-10-08 00:11:45,283][52059] Updated weights for policy 1, policy_version 8362 (0.0009) [2023-10-08 00:11:45,376][52060] Updated weights for policy 0, policy_version 8260 (0.0007) [2023-10-08 00:11:45,641][52059] Updated weights for policy 1, policy_version 8372 (0.0009) [2023-10-08 00:11:45,733][52060] Updated weights for policy 0, policy_version 8270 (0.0008) [2023-10-08 00:11:46,009][52059] Updated weights for policy 1, policy_version 8382 (0.0009) [2023-10-08 00:11:46,112][52060] Updated weights for policy 0, policy_version 8280 (0.0007) [2023-10-08 00:11:46,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 17039360. Throughput: 0: 1698.1, 1: 1708.0. Samples: 4271604. Policy #0 lag: (min: 30.0, avg: 37.9, max: 62.0) [2023-10-08 00:11:46,211][50642] Avg episode reward: [(0, '10.920'), (1, '12.450')] [2023-10-08 00:11:49,913][52059] Updated weights for policy 1, policy_version 8392 (0.0008) [2023-10-08 00:11:50,135][52060] Updated weights for policy 0, policy_version 8290 (0.0008) [2023-10-08 00:11:50,276][52059] Updated weights for policy 1, policy_version 8402 (0.0008) [2023-10-08 00:11:50,526][52060] Updated weights for policy 0, policy_version 8300 (0.0008) [2023-10-08 00:11:50,654][52059] Updated weights for policy 1, policy_version 8412 (0.0008) [2023-10-08 00:11:50,899][52060] Updated weights for policy 0, policy_version 8310 (0.0008) [2023-10-08 00:11:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 17104896. Throughput: 0: 1716.2, 1: 1732.7. Samples: 4282690. Policy #0 lag: (min: 30.0, avg: 37.9, max: 62.0) [2023-10-08 00:11:51,212][50642] Avg episode reward: [(0, '11.530'), (1, '12.800')] [2023-10-08 00:11:51,260][52060] Updated weights for policy 0, policy_version 8320 (0.0008) [2023-10-08 00:11:54,626][52059] Updated weights for policy 1, policy_version 8422 (0.0009) [2023-10-08 00:11:54,997][52059] Updated weights for policy 1, policy_version 8432 (0.0007) [2023-10-08 00:11:55,230][52060] Updated weights for policy 0, policy_version 8330 (0.0007) [2023-10-08 00:11:55,370][52059] Updated weights for policy 1, policy_version 8442 (0.0008) [2023-10-08 00:11:55,605][52060] Updated weights for policy 0, policy_version 8340 (0.0008) [2023-10-08 00:11:55,964][52060] Updated weights for policy 0, policy_version 8350 (0.0010) [2023-10-08 00:11:56,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 17203200. Throughput: 0: 1715.2, 1: 1729.3. Samples: 4303488. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) [2023-10-08 00:11:56,211][50642] Avg episode reward: [(0, '10.740'), (1, '12.280')] [2023-10-08 00:11:59,208][52059] Updated weights for policy 1, policy_version 8452 (0.0008) [2023-10-08 00:11:59,573][52059] Updated weights for policy 1, policy_version 8462 (0.0008) [2023-10-08 00:11:59,835][52060] Updated weights for policy 0, policy_version 8360 (0.0008) [2023-10-08 00:11:59,936][52059] Updated weights for policy 1, policy_version 8472 (0.0007) [2023-10-08 00:12:00,206][52060] Updated weights for policy 0, policy_version 8370 (0.0008) [2023-10-08 00:12:00,583][52060] Updated weights for policy 0, policy_version 8380 (0.0008) [2023-10-08 00:12:01,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 17268736. Throughput: 0: 1686.1, 1: 1708.3. Samples: 4322788. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:12:01,211][50642] Avg episode reward: [(0, '11.860'), (1, '13.060')] [2023-10-08 00:12:01,219][51605] Saving new best policy, reward=11.860! [2023-10-08 00:12:04,027][52059] Updated weights for policy 1, policy_version 8482 (0.0008) [2023-10-08 00:12:04,394][52059] Updated weights for policy 1, policy_version 8492 (0.0007) [2023-10-08 00:12:04,569][52060] Updated weights for policy 0, policy_version 8390 (0.0007) [2023-10-08 00:12:04,771][52059] Updated weights for policy 1, policy_version 8502 (0.0008) [2023-10-08 00:12:04,930][52060] Updated weights for policy 0, policy_version 8400 (0.0007) [2023-10-08 00:12:05,138][52059] Updated weights for policy 1, policy_version 8512 (0.0008) [2023-10-08 00:12:05,302][52060] Updated weights for policy 0, policy_version 8410 (0.0010) [2023-10-08 00:12:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 17334272. Throughput: 0: 1720.5, 1: 1735.7. Samples: 4334814. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:12:06,211][50642] Avg episode reward: [(0, '11.110'), (1, '12.900')] [2023-10-08 00:12:09,043][52059] Updated weights for policy 1, policy_version 8522 (0.0010) [2023-10-08 00:12:09,281][52060] Updated weights for policy 0, policy_version 8420 (0.0010) [2023-10-08 00:12:09,414][52059] Updated weights for policy 1, policy_version 8532 (0.0007) [2023-10-08 00:12:09,648][52060] Updated weights for policy 0, policy_version 8430 (0.0007) [2023-10-08 00:12:09,792][52059] Updated weights for policy 1, policy_version 8542 (0.0008) [2023-10-08 00:12:10,017][52060] Updated weights for policy 0, policy_version 8440 (0.0009) [2023-10-08 00:12:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 17399808. Throughput: 0: 1703.8, 1: 1709.6. Samples: 4354004. Policy #0 lag: (min: 21.0, avg: 28.9, max: 53.0) [2023-10-08 00:12:11,211][50642] Avg episode reward: [(0, '12.060'), (1, '12.690')] [2023-10-08 00:12:11,212][51605] Saving new best policy, reward=12.060! [2023-10-08 00:12:13,828][52059] Updated weights for policy 1, policy_version 8552 (0.0009) [2023-10-08 00:12:13,986][52060] Updated weights for policy 0, policy_version 8450 (0.0011) [2023-10-08 00:12:14,195][52059] Updated weights for policy 1, policy_version 8562 (0.0010) [2023-10-08 00:12:14,356][52060] Updated weights for policy 0, policy_version 8460 (0.0009) [2023-10-08 00:12:14,564][52059] Updated weights for policy 1, policy_version 8572 (0.0009) [2023-10-08 00:12:14,719][52060] Updated weights for policy 0, policy_version 8470 (0.0007) [2023-10-08 00:12:15,088][52060] Updated weights for policy 0, policy_version 8480 (0.0008) [2023-10-08 00:12:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 17465344. Throughput: 0: 1689.1, 1: 1718.0. Samples: 4374526. Policy #0 lag: (min: 21.0, avg: 28.9, max: 53.0) [2023-10-08 00:12:16,211][50642] Avg episode reward: [(0, '11.350'), (1, '12.770')] [2023-10-08 00:12:18,448][52059] Updated weights for policy 1, policy_version 8582 (0.0009) [2023-10-08 00:12:18,810][52059] Updated weights for policy 1, policy_version 8592 (0.0007) [2023-10-08 00:12:19,092][52060] Updated weights for policy 0, policy_version 8490 (0.0008) [2023-10-08 00:12:19,177][52059] Updated weights for policy 1, policy_version 8602 (0.0010) [2023-10-08 00:12:19,466][52060] Updated weights for policy 0, policy_version 8500 (0.0008) [2023-10-08 00:12:19,851][52060] Updated weights for policy 0, policy_version 8510 (0.0011) [2023-10-08 00:12:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 17530880. Throughput: 0: 1717.8, 1: 1727.6. Samples: 4385864. Policy #0 lag: (min: 26.0, avg: 30.5, max: 58.0) [2023-10-08 00:12:21,211][50642] Avg episode reward: [(0, '12.050'), (1, '12.570')] [2023-10-08 00:12:22,986][52059] Updated weights for policy 1, policy_version 8612 (0.0008) [2023-10-08 00:12:23,345][52059] Updated weights for policy 1, policy_version 8622 (0.0007) [2023-10-08 00:12:23,708][52059] Updated weights for policy 1, policy_version 8632 (0.0007) [2023-10-08 00:12:23,938][52060] Updated weights for policy 0, policy_version 8520 (0.0008) [2023-10-08 00:12:24,302][52060] Updated weights for policy 0, policy_version 8530 (0.0011) [2023-10-08 00:12:24,679][52060] Updated weights for policy 0, policy_version 8540 (0.0009) [2023-10-08 00:12:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 17596416. Throughput: 0: 1686.2, 1: 1711.2. Samples: 4405170. Policy #0 lag: (min: 26.0, avg: 30.5, max: 58.0) [2023-10-08 00:12:26,211][50642] Avg episode reward: [(0, '11.460'), (1, '13.600')] [2023-10-08 00:12:27,577][52059] Updated weights for policy 1, policy_version 8642 (0.0007) [2023-10-08 00:12:27,943][52059] Updated weights for policy 1, policy_version 8652 (0.0008) [2023-10-08 00:12:28,308][52059] Updated weights for policy 1, policy_version 8662 (0.0010) [2023-10-08 00:12:28,679][52059] Updated weights for policy 1, policy_version 8672 (0.0008) [2023-10-08 00:12:28,733][52060] Updated weights for policy 0, policy_version 8550 (0.0008) [2023-10-08 00:12:29,100][52060] Updated weights for policy 0, policy_version 8560 (0.0010) [2023-10-08 00:12:29,476][52060] Updated weights for policy 0, policy_version 8570 (0.0008) [2023-10-08 00:12:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 17661952. Throughput: 0: 1692.4, 1: 1741.8. Samples: 4426142. Policy #0 lag: (min: 31.0, avg: 32.9, max: 61.0) [2023-10-08 00:12:31,211][50642] Avg episode reward: [(0, '11.280'), (1, '13.200')] [2023-10-08 00:12:32,795][52059] Updated weights for policy 1, policy_version 8682 (0.0007) [2023-10-08 00:12:33,171][52059] Updated weights for policy 1, policy_version 8692 (0.0009) [2023-10-08 00:12:33,473][52060] Updated weights for policy 0, policy_version 8580 (0.0009) [2023-10-08 00:12:33,535][52059] Updated weights for policy 1, policy_version 8702 (0.0008) [2023-10-08 00:12:33,832][52060] Updated weights for policy 0, policy_version 8590 (0.0009) [2023-10-08 00:12:34,205][52060] Updated weights for policy 0, policy_version 8600 (0.0010) [2023-10-08 00:12:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 17727488. Throughput: 0: 1698.2, 1: 1711.7. Samples: 4436134. Policy #0 lag: (min: 31.0, avg: 32.9, max: 61.0) [2023-10-08 00:12:36,211][50642] Avg episode reward: [(0, '12.460'), (1, '13.160')] [2023-10-08 00:12:36,211][51605] Saving new best policy, reward=12.460! [2023-10-08 00:12:37,374][52059] Updated weights for policy 1, policy_version 8712 (0.0008) [2023-10-08 00:12:37,745][52059] Updated weights for policy 1, policy_version 8722 (0.0007) [2023-10-08 00:12:38,101][52059] Updated weights for policy 1, policy_version 8732 (0.0009) [2023-10-08 00:12:38,303][52060] Updated weights for policy 0, policy_version 8610 (0.0008) [2023-10-08 00:12:38,709][52060] Updated weights for policy 0, policy_version 8620 (0.0009) [2023-10-08 00:12:39,070][52060] Updated weights for policy 0, policy_version 8630 (0.0009) [2023-10-08 00:12:39,451][52060] Updated weights for policy 0, policy_version 8640 (0.0009) [2023-10-08 00:12:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 17793024. Throughput: 0: 1680.4, 1: 1720.3. Samples: 4456516. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:12:41,211][50642] Avg episode reward: [(0, '10.850'), (1, '13.720')] [2023-10-08 00:12:41,944][52059] Updated weights for policy 1, policy_version 8742 (0.0007) [2023-10-08 00:12:42,312][52059] Updated weights for policy 1, policy_version 8752 (0.0009) [2023-10-08 00:12:42,679][52059] Updated weights for policy 1, policy_version 8762 (0.0008) [2023-10-08 00:12:43,411][52060] Updated weights for policy 0, policy_version 8650 (0.0008) [2023-10-08 00:12:43,787][52060] Updated weights for policy 0, policy_version 8660 (0.0009) [2023-10-08 00:12:44,150][52060] Updated weights for policy 0, policy_version 8670 (0.0008) [2023-10-08 00:12:46,211][50642] Fps is (10 sec: 13106.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 17858560. Throughput: 0: 1708.7, 1: 1740.4. Samples: 4477998. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:12:46,212][50642] Avg episode reward: [(0, '13.120'), (1, '12.470')] [2023-10-08 00:12:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000008768_8978432.pth... [2023-10-08 00:12:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000008672_8880128.pth... [2023-10-08 00:12:46,250][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000007168_7340032.pth [2023-10-08 00:12:46,253][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000007072_7241728.pth [2023-10-08 00:12:46,257][51605] Saving new best policy, reward=13.120! [2023-10-08 00:12:46,732][52059] Updated weights for policy 1, policy_version 8772 (0.0008) [2023-10-08 00:12:47,109][52059] Updated weights for policy 1, policy_version 8782 (0.0007) [2023-10-08 00:12:47,471][52059] Updated weights for policy 1, policy_version 8792 (0.0007) [2023-10-08 00:12:48,049][52060] Updated weights for policy 0, policy_version 8680 (0.0008) [2023-10-08 00:12:48,418][52060] Updated weights for policy 0, policy_version 8690 (0.0007) [2023-10-08 00:12:48,787][52060] Updated weights for policy 0, policy_version 8700 (0.0008) [2023-10-08 00:12:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 17924096. Throughput: 0: 1682.6, 1: 1711.6. Samples: 4487552. Policy #0 lag: (min: 15.0, avg: 20.2, max: 47.0) [2023-10-08 00:12:51,211][50642] Avg episode reward: [(0, '10.590'), (1, '13.330')] [2023-10-08 00:12:51,242][52059] Updated weights for policy 1, policy_version 8802 (0.0008) [2023-10-08 00:12:51,611][52059] Updated weights for policy 1, policy_version 8812 (0.0007) [2023-10-08 00:12:51,967][52059] Updated weights for policy 1, policy_version 8822 (0.0007) [2023-10-08 00:12:52,337][52059] Updated weights for policy 1, policy_version 8832 (0.0007) [2023-10-08 00:12:52,730][52060] Updated weights for policy 0, policy_version 8710 (0.0010) [2023-10-08 00:12:53,102][52060] Updated weights for policy 0, policy_version 8720 (0.0007) [2023-10-08 00:12:53,469][52060] Updated weights for policy 0, policy_version 8730 (0.0008) [2023-10-08 00:12:56,130][52059] Updated weights for policy 1, policy_version 8842 (0.0007) [2023-10-08 00:12:56,210][50642] Fps is (10 sec: 13107.8, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 17989632. Throughput: 0: 1696.1, 1: 1748.5. Samples: 4509012. Policy #0 lag: (min: 15.0, avg: 20.2, max: 47.0) [2023-10-08 00:12:56,211][50642] Avg episode reward: [(0, '13.040'), (1, '12.890')] [2023-10-08 00:12:56,497][52059] Updated weights for policy 1, policy_version 8852 (0.0007) [2023-10-08 00:12:56,862][52059] Updated weights for policy 1, policy_version 8862 (0.0009) [2023-10-08 00:12:57,539][52060] Updated weights for policy 0, policy_version 8740 (0.0008) [2023-10-08 00:12:57,917][52060] Updated weights for policy 0, policy_version 8750 (0.0007) [2023-10-08 00:12:58,287][52060] Updated weights for policy 0, policy_version 8760 (0.0009) [2023-10-08 00:13:00,853][52059] Updated weights for policy 1, policy_version 8872 (0.0010) [2023-10-08 00:13:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 18055168. Throughput: 0: 1706.7, 1: 1742.1. Samples: 4529720. Policy #0 lag: (min: 9.0, avg: 24.0, max: 41.0) [2023-10-08 00:13:01,211][50642] Avg episode reward: [(0, '10.650'), (1, '13.490')] [2023-10-08 00:13:01,218][52059] Updated weights for policy 1, policy_version 8882 (0.0010) [2023-10-08 00:13:01,580][52059] Updated weights for policy 1, policy_version 8892 (0.0007) [2023-10-08 00:13:02,249][52060] Updated weights for policy 0, policy_version 8770 (0.0007) [2023-10-08 00:13:02,622][52060] Updated weights for policy 0, policy_version 8780 (0.0007) [2023-10-08 00:13:02,987][52060] Updated weights for policy 0, policy_version 8790 (0.0008) [2023-10-08 00:13:03,359][52060] Updated weights for policy 0, policy_version 8800 (0.0008) [2023-10-08 00:13:05,444][52059] Updated weights for policy 1, policy_version 8902 (0.0008) [2023-10-08 00:13:05,818][52059] Updated weights for policy 1, policy_version 8912 (0.0009) [2023-10-08 00:13:06,172][52059] Updated weights for policy 1, policy_version 8922 (0.0008) [2023-10-08 00:13:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 18120704. Throughput: 0: 1680.4, 1: 1736.2. Samples: 4539612. Policy #0 lag: (min: 9.0, avg: 24.0, max: 41.0) [2023-10-08 00:13:06,211][50642] Avg episode reward: [(0, '12.700'), (1, '13.460')] [2023-10-08 00:13:07,363][52060] Updated weights for policy 0, policy_version 8810 (0.0009) [2023-10-08 00:13:07,734][52060] Updated weights for policy 0, policy_version 8820 (0.0008) [2023-10-08 00:13:08,096][52060] Updated weights for policy 0, policy_version 8830 (0.0009) [2023-10-08 00:13:10,042][52059] Updated weights for policy 1, policy_version 8932 (0.0008) [2023-10-08 00:13:10,406][52059] Updated weights for policy 1, policy_version 8942 (0.0008) [2023-10-08 00:13:10,776][52059] Updated weights for policy 1, policy_version 8952 (0.0009) [2023-10-08 00:13:11,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 18219008. Throughput: 0: 1712.5, 1: 1752.8. Samples: 4561110. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) [2023-10-08 00:13:11,211][50642] Avg episode reward: [(0, '10.480'), (1, '13.400')] [2023-10-08 00:13:11,987][52060] Updated weights for policy 0, policy_version 8840 (0.0011) [2023-10-08 00:13:12,366][52060] Updated weights for policy 0, policy_version 8850 (0.0011) [2023-10-08 00:13:12,733][52060] Updated weights for policy 0, policy_version 8860 (0.0012) [2023-10-08 00:13:14,662][52059] Updated weights for policy 1, policy_version 8962 (0.0008) [2023-10-08 00:13:15,024][52059] Updated weights for policy 1, policy_version 8972 (0.0010) [2023-10-08 00:13:15,382][52059] Updated weights for policy 1, policy_version 8982 (0.0008) [2023-10-08 00:13:15,750][52059] Updated weights for policy 1, policy_version 8992 (0.0011) [2023-10-08 00:13:16,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 18284544. Throughput: 0: 1721.6, 1: 1721.9. Samples: 4581098. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) [2023-10-08 00:13:16,210][50642] Avg episode reward: [(0, '12.370'), (1, '13.970')] [2023-10-08 00:13:16,699][52060] Updated weights for policy 0, policy_version 8870 (0.0008) [2023-10-08 00:13:17,071][52060] Updated weights for policy 0, policy_version 8880 (0.0007) [2023-10-08 00:13:17,437][52060] Updated weights for policy 0, policy_version 8890 (0.0010) [2023-10-08 00:13:19,689][52059] Updated weights for policy 1, policy_version 9002 (0.0009) [2023-10-08 00:13:20,054][52059] Updated weights for policy 1, policy_version 9012 (0.0009) [2023-10-08 00:13:20,418][52059] Updated weights for policy 1, policy_version 9022 (0.0010) [2023-10-08 00:13:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 18350080. Throughput: 0: 1698.2, 1: 1761.3. Samples: 4591812. Policy #0 lag: (min: 17.0, avg: 32.7, max: 49.0) [2023-10-08 00:13:21,211][50642] Avg episode reward: [(0, '10.560'), (1, '13.320')] [2023-10-08 00:13:21,412][52060] Updated weights for policy 0, policy_version 8900 (0.0008) [2023-10-08 00:13:21,780][52060] Updated weights for policy 0, policy_version 8910 (0.0009) [2023-10-08 00:13:22,148][52060] Updated weights for policy 0, policy_version 8920 (0.0008) [2023-10-08 00:13:24,341][52059] Updated weights for policy 1, policy_version 9032 (0.0011) [2023-10-08 00:13:24,713][52059] Updated weights for policy 1, policy_version 9042 (0.0007) [2023-10-08 00:13:25,078][52059] Updated weights for policy 1, policy_version 9052 (0.0007) [2023-10-08 00:13:26,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 18415616. Throughput: 0: 1712.1, 1: 1738.7. Samples: 4611804. Policy #0 lag: (min: 17.0, avg: 32.7, max: 49.0) [2023-10-08 00:13:26,211][50642] Avg episode reward: [(0, '12.530'), (1, '13.770')] [2023-10-08 00:13:26,340][52060] Updated weights for policy 0, policy_version 8930 (0.0008) [2023-10-08 00:13:26,742][52060] Updated weights for policy 0, policy_version 8940 (0.0007) [2023-10-08 00:13:27,103][52060] Updated weights for policy 0, policy_version 8950 (0.0008) [2023-10-08 00:13:27,476][52060] Updated weights for policy 0, policy_version 8960 (0.0010) [2023-10-08 00:13:29,064][52059] Updated weights for policy 1, policy_version 9062 (0.0008) [2023-10-08 00:13:29,433][52059] Updated weights for policy 1, policy_version 9072 (0.0008) [2023-10-08 00:13:29,796][52059] Updated weights for policy 1, policy_version 9082 (0.0010) [2023-10-08 00:13:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 18481152. Throughput: 0: 1705.6, 1: 1729.1. Samples: 4632556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:13:31,212][50642] Avg episode reward: [(0, '10.650'), (1, '12.780')] [2023-10-08 00:13:31,627][52060] Updated weights for policy 0, policy_version 8970 (0.0009) [2023-10-08 00:13:31,998][52060] Updated weights for policy 0, policy_version 8980 (0.0008) [2023-10-08 00:13:32,367][52060] Updated weights for policy 0, policy_version 8990 (0.0007) [2023-10-08 00:13:33,595][52059] Updated weights for policy 1, policy_version 9092 (0.0008) [2023-10-08 00:13:33,961][52059] Updated weights for policy 1, policy_version 9102 (0.0010) [2023-10-08 00:13:34,323][52059] Updated weights for policy 1, policy_version 9112 (0.0008) [2023-10-08 00:13:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 18546688. Throughput: 0: 1705.4, 1: 1749.6. Samples: 4643028. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:13:36,211][50642] Avg episode reward: [(0, '12.200'), (1, '14.180')] [2023-10-08 00:13:36,213][51710] Saving new best policy, reward=14.180! [2023-10-08 00:13:36,296][52060] Updated weights for policy 0, policy_version 9000 (0.0008) [2023-10-08 00:13:36,658][52060] Updated weights for policy 0, policy_version 9010 (0.0007) [2023-10-08 00:13:37,031][52060] Updated weights for policy 0, policy_version 9020 (0.0009) [2023-10-08 00:13:38,401][52059] Updated weights for policy 1, policy_version 9122 (0.0008) [2023-10-08 00:13:38,770][52059] Updated weights for policy 1, policy_version 9132 (0.0007) [2023-10-08 00:13:39,127][52059] Updated weights for policy 1, policy_version 9142 (0.0009) [2023-10-08 00:13:39,492][52059] Updated weights for policy 1, policy_version 9152 (0.0008) [2023-10-08 00:13:40,963][52060] Updated weights for policy 0, policy_version 9030 (0.0008) [2023-10-08 00:13:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 18612224. Throughput: 0: 1712.3, 1: 1720.9. Samples: 4663506. Policy #0 lag: (min: 31.0, avg: 32.0, max: 52.0) [2023-10-08 00:13:41,211][50642] Avg episode reward: [(0, '10.720'), (1, '13.340')] [2023-10-08 00:13:41,338][52060] Updated weights for policy 0, policy_version 9040 (0.0008) [2023-10-08 00:13:41,700][52060] Updated weights for policy 0, policy_version 9050 (0.0009) [2023-10-08 00:13:43,346][52059] Updated weights for policy 1, policy_version 9162 (0.0009) [2023-10-08 00:13:43,713][52059] Updated weights for policy 1, policy_version 9172 (0.0007) [2023-10-08 00:13:44,080][52059] Updated weights for policy 1, policy_version 9182 (0.0008) [2023-10-08 00:13:45,705][52060] Updated weights for policy 0, policy_version 9060 (0.0011) [2023-10-08 00:13:46,074][52060] Updated weights for policy 0, policy_version 9070 (0.0010) [2023-10-08 00:13:46,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.5, 300 sec: 13662.6). Total num frames: 18677760. Throughput: 0: 1710.7, 1: 1726.2. Samples: 4684380. Policy #0 lag: (min: 31.0, avg: 32.0, max: 52.0) [2023-10-08 00:13:46,210][50642] Avg episode reward: [(0, '11.450'), (1, '12.710')] [2023-10-08 00:13:46,460][52060] Updated weights for policy 0, policy_version 9080 (0.0010) [2023-10-08 00:13:48,123][52059] Updated weights for policy 1, policy_version 9192 (0.0008) [2023-10-08 00:13:48,487][52059] Updated weights for policy 1, policy_version 9202 (0.0007) [2023-10-08 00:13:48,852][52059] Updated weights for policy 1, policy_version 9212 (0.0007) [2023-10-08 00:13:50,461][52060] Updated weights for policy 0, policy_version 9090 (0.0009) [2023-10-08 00:13:50,830][52060] Updated weights for policy 0, policy_version 9100 (0.0009) [2023-10-08 00:13:51,185][52060] Updated weights for policy 0, policy_version 9110 (0.0007) [2023-10-08 00:13:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 18743296. Throughput: 0: 1713.9, 1: 1721.6. Samples: 4694206. Policy #0 lag: (min: 26.0, avg: 36.1, max: 58.0) [2023-10-08 00:13:51,211][50642] Avg episode reward: [(0, '11.170'), (1, '14.490')] [2023-10-08 00:13:51,211][51710] Saving new best policy, reward=14.490! [2023-10-08 00:13:51,553][52060] Updated weights for policy 0, policy_version 9120 (0.0008) [2023-10-08 00:13:52,625][52059] Updated weights for policy 1, policy_version 9222 (0.0009) [2023-10-08 00:13:52,993][52059] Updated weights for policy 1, policy_version 9232 (0.0008) [2023-10-08 00:13:53,357][52059] Updated weights for policy 1, policy_version 9242 (0.0007) [2023-10-08 00:13:55,463][52060] Updated weights for policy 0, policy_version 9130 (0.0011) [2023-10-08 00:13:55,826][52060] Updated weights for policy 0, policy_version 9140 (0.0009) [2023-10-08 00:13:56,199][52060] Updated weights for policy 0, policy_version 9150 (0.0010) [2023-10-08 00:13:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 18808832. Throughput: 0: 1717.4, 1: 1717.5. Samples: 4715684. Policy #0 lag: (min: 26.0, avg: 36.1, max: 58.0) [2023-10-08 00:13:56,211][50642] Avg episode reward: [(0, '12.400'), (1, '12.180')] [2023-10-08 00:13:57,323][52059] Updated weights for policy 1, policy_version 9252 (0.0008) [2023-10-08 00:13:57,694][52059] Updated weights for policy 1, policy_version 9262 (0.0011) [2023-10-08 00:13:58,052][52059] Updated weights for policy 1, policy_version 9272 (0.0011) [2023-10-08 00:14:00,020][52060] Updated weights for policy 0, policy_version 9160 (0.0008) [2023-10-08 00:14:00,385][52060] Updated weights for policy 0, policy_version 9170 (0.0009) [2023-10-08 00:14:00,760][52060] Updated weights for policy 0, policy_version 9180 (0.0010) [2023-10-08 00:14:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 18907136. Throughput: 0: 1691.2, 1: 1748.3. Samples: 4735874. Policy #0 lag: (min: 31.0, avg: 31.7, max: 48.0) [2023-10-08 00:14:01,211][50642] Avg episode reward: [(0, '11.570'), (1, '14.610')] [2023-10-08 00:14:01,218][51710] Saving new best policy, reward=14.610! [2023-10-08 00:14:02,150][52059] Updated weights for policy 1, policy_version 9282 (0.0010) [2023-10-08 00:14:02,512][52059] Updated weights for policy 1, policy_version 9292 (0.0010) [2023-10-08 00:14:02,872][52059] Updated weights for policy 1, policy_version 9302 (0.0009) [2023-10-08 00:14:03,242][52059] Updated weights for policy 1, policy_version 9312 (0.0012) [2023-10-08 00:14:04,657][52060] Updated weights for policy 0, policy_version 9190 (0.0009) [2023-10-08 00:14:05,039][52060] Updated weights for policy 0, policy_version 9200 (0.0010) [2023-10-08 00:14:05,412][52060] Updated weights for policy 0, policy_version 9210 (0.0010) [2023-10-08 00:14:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 18972672. Throughput: 0: 1721.2, 1: 1712.1. Samples: 4746314. Policy #0 lag: (min: 31.0, avg: 36.2, max: 63.0) [2023-10-08 00:14:06,211][50642] Avg episode reward: [(0, '12.220'), (1, '12.040')] [2023-10-08 00:14:07,113][52059] Updated weights for policy 1, policy_version 9322 (0.0007) [2023-10-08 00:14:07,480][52059] Updated weights for policy 1, policy_version 9332 (0.0009) [2023-10-08 00:14:07,842][52059] Updated weights for policy 1, policy_version 9342 (0.0010) [2023-10-08 00:14:09,427][52060] Updated weights for policy 0, policy_version 9220 (0.0008) [2023-10-08 00:14:09,793][52060] Updated weights for policy 0, policy_version 9230 (0.0009) [2023-10-08 00:14:10,166][52060] Updated weights for policy 0, policy_version 9240 (0.0010) [2023-10-08 00:14:11,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 19038208. Throughput: 0: 1710.2, 1: 1734.7. Samples: 4766824. Policy #0 lag: (min: 31.0, avg: 36.2, max: 63.0) [2023-10-08 00:14:11,211][50642] Avg episode reward: [(0, '12.080'), (1, '12.800')] [2023-10-08 00:14:11,729][52059] Updated weights for policy 1, policy_version 9352 (0.0008) [2023-10-08 00:14:12,094][52059] Updated weights for policy 1, policy_version 9362 (0.0010) [2023-10-08 00:14:12,470][52059] Updated weights for policy 1, policy_version 9372 (0.0008) [2023-10-08 00:14:14,183][52060] Updated weights for policy 0, policy_version 9250 (0.0008) [2023-10-08 00:14:14,588][52060] Updated weights for policy 0, policy_version 9260 (0.0008) [2023-10-08 00:14:14,962][52060] Updated weights for policy 0, policy_version 9270 (0.0008) [2023-10-08 00:14:15,335][52060] Updated weights for policy 0, policy_version 9280 (0.0008) [2023-10-08 00:14:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 19103744. Throughput: 0: 1696.1, 1: 1743.5. Samples: 4787336. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) [2023-10-08 00:14:16,211][50642] Avg episode reward: [(0, '11.860'), (1, '15.180')] [2023-10-08 00:14:16,430][52059] Updated weights for policy 1, policy_version 9382 (0.0007) [2023-10-08 00:14:16,796][52059] Updated weights for policy 1, policy_version 9392 (0.0008) [2023-10-08 00:14:17,166][52059] Updated weights for policy 1, policy_version 9402 (0.0009) [2023-10-08 00:14:17,388][51710] Saving new best policy, reward=15.180! [2023-10-08 00:14:19,470][52060] Updated weights for policy 0, policy_version 9290 (0.0007) [2023-10-08 00:14:19,837][52060] Updated weights for policy 0, policy_version 9300 (0.0010) [2023-10-08 00:14:20,207][52060] Updated weights for policy 0, policy_version 9310 (0.0010) [2023-10-08 00:14:21,164][52059] Updated weights for policy 1, policy_version 9412 (0.0008) [2023-10-08 00:14:21,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 19169280. Throughput: 0: 1716.2, 1: 1725.1. Samples: 4797884. Policy #0 lag: (min: 31.0, avg: 37.1, max: 63.0) [2023-10-08 00:14:21,211][50642] Avg episode reward: [(0, '12.760'), (1, '12.090')] [2023-10-08 00:14:21,527][52059] Updated weights for policy 1, policy_version 9422 (0.0008) [2023-10-08 00:14:21,883][52059] Updated weights for policy 1, policy_version 9432 (0.0009) [2023-10-08 00:14:24,270][52060] Updated weights for policy 0, policy_version 9320 (0.0009) [2023-10-08 00:14:24,640][52060] Updated weights for policy 0, policy_version 9330 (0.0008) [2023-10-08 00:14:25,012][52060] Updated weights for policy 0, policy_version 9340 (0.0007) [2023-10-08 00:14:25,517][52059] Updated weights for policy 1, policy_version 9442 (0.0007) [2023-10-08 00:14:25,883][52059] Updated weights for policy 1, policy_version 9452 (0.0011) [2023-10-08 00:14:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 19234816. Throughput: 0: 1689.9, 1: 1752.6. Samples: 4818416. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 00:14:26,211][50642] Avg episode reward: [(0, '11.480'), (1, '14.600')] [2023-10-08 00:14:26,258][52059] Updated weights for policy 1, policy_version 9462 (0.0009) [2023-10-08 00:14:26,625][52059] Updated weights for policy 1, policy_version 9472 (0.0007) [2023-10-08 00:14:28,994][52060] Updated weights for policy 0, policy_version 9350 (0.0007) [2023-10-08 00:14:29,364][52060] Updated weights for policy 0, policy_version 9360 (0.0010) [2023-10-08 00:14:29,733][52060] Updated weights for policy 0, policy_version 9370 (0.0007) [2023-10-08 00:14:30,592][52059] Updated weights for policy 1, policy_version 9482 (0.0009) [2023-10-08 00:14:30,955][52059] Updated weights for policy 1, policy_version 9492 (0.0010) [2023-10-08 00:14:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 19300352. Throughput: 0: 1686.9, 1: 1744.4. Samples: 4838790. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 00:14:31,211][50642] Avg episode reward: [(0, '12.740'), (1, '13.350')] [2023-10-08 00:14:31,320][52059] Updated weights for policy 1, policy_version 9502 (0.0010) [2023-10-08 00:14:33,534][52060] Updated weights for policy 0, policy_version 9380 (0.0007) [2023-10-08 00:14:33,899][52060] Updated weights for policy 0, policy_version 9390 (0.0008) [2023-10-08 00:14:34,259][52060] Updated weights for policy 0, policy_version 9400 (0.0010) [2023-10-08 00:14:35,255][52059] Updated weights for policy 1, policy_version 9512 (0.0008) [2023-10-08 00:14:35,618][52059] Updated weights for policy 1, policy_version 9522 (0.0008) [2023-10-08 00:14:35,986][52059] Updated weights for policy 1, policy_version 9532 (0.0008) [2023-10-08 00:14:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 19398656. Throughput: 0: 1703.6, 1: 1750.6. Samples: 4849644. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) [2023-10-08 00:14:36,211][50642] Avg episode reward: [(0, '11.520'), (1, '12.840')] [2023-10-08 00:14:38,250][52060] Updated weights for policy 0, policy_version 9410 (0.0007) [2023-10-08 00:14:38,619][52060] Updated weights for policy 0, policy_version 9420 (0.0009) [2023-10-08 00:14:38,995][52060] Updated weights for policy 0, policy_version 9430 (0.0007) [2023-10-08 00:14:39,366][52060] Updated weights for policy 0, policy_version 9440 (0.0008) [2023-10-08 00:14:40,050][52059] Updated weights for policy 1, policy_version 9542 (0.0007) [2023-10-08 00:14:40,414][52059] Updated weights for policy 1, policy_version 9552 (0.0007) [2023-10-08 00:14:40,787][52059] Updated weights for policy 1, policy_version 9562 (0.0009) [2023-10-08 00:14:41,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 19464192. Throughput: 0: 1683.4, 1: 1748.4. Samples: 4870114. Policy #0 lag: (min: 31.0, avg: 31.1, max: 36.0) [2023-10-08 00:14:41,211][50642] Avg episode reward: [(0, '12.830'), (1, '14.390')] [2023-10-08 00:14:43,520][52060] Updated weights for policy 0, policy_version 9450 (0.0007) [2023-10-08 00:14:43,893][52060] Updated weights for policy 0, policy_version 9460 (0.0009) [2023-10-08 00:14:44,261][52060] Updated weights for policy 0, policy_version 9470 (0.0010) [2023-10-08 00:14:44,752][52059] Updated weights for policy 1, policy_version 9572 (0.0010) [2023-10-08 00:14:45,127][52059] Updated weights for policy 1, policy_version 9582 (0.0010) [2023-10-08 00:14:45,483][52059] Updated weights for policy 1, policy_version 9592 (0.0009) [2023-10-08 00:14:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 19529728. Throughput: 0: 1706.3, 1: 1715.1. Samples: 4889834. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) [2023-10-08 00:14:46,211][50642] Avg episode reward: [(0, '11.580'), (1, '11.580')] [2023-10-08 00:14:46,217][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000009472_9699328.pth... [2023-10-08 00:14:46,217][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000009600_9830400.pth... [2023-10-08 00:14:46,247][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000007968_8159232.pth [2023-10-08 00:14:46,249][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000007872_8060928.pth [2023-10-08 00:14:48,316][52060] Updated weights for policy 0, policy_version 9480 (0.0009) [2023-10-08 00:14:48,685][52060] Updated weights for policy 0, policy_version 9490 (0.0009) [2023-10-08 00:14:49,052][52060] Updated weights for policy 0, policy_version 9500 (0.0009) [2023-10-08 00:14:49,421][52059] Updated weights for policy 1, policy_version 9602 (0.0009) [2023-10-08 00:14:49,789][52059] Updated weights for policy 1, policy_version 9612 (0.0008) [2023-10-08 00:14:50,144][52059] Updated weights for policy 1, policy_version 9622 (0.0008) [2023-10-08 00:14:50,506][52059] Updated weights for policy 1, policy_version 9632 (0.0008) [2023-10-08 00:14:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 19595264. Throughput: 0: 1688.6, 1: 1749.3. Samples: 4901020. Policy #0 lag: (min: 31.0, avg: 36.7, max: 63.0) [2023-10-08 00:14:51,211][50642] Avg episode reward: [(0, '12.820'), (1, '14.960')] [2023-10-08 00:14:53,239][52060] Updated weights for policy 0, policy_version 9510 (0.0007) [2023-10-08 00:14:53,615][52060] Updated weights for policy 0, policy_version 9520 (0.0007) [2023-10-08 00:14:53,981][52060] Updated weights for policy 0, policy_version 9530 (0.0008) [2023-10-08 00:14:54,629][52059] Updated weights for policy 1, policy_version 9642 (0.0009) [2023-10-08 00:14:54,987][52059] Updated weights for policy 1, policy_version 9652 (0.0008) [2023-10-08 00:14:55,347][52059] Updated weights for policy 1, policy_version 9662 (0.0009) [2023-10-08 00:14:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 19660800. Throughput: 0: 1691.8, 1: 1730.8. Samples: 4920840. Policy #0 lag: (min: 9.0, avg: 29.5, max: 41.0) [2023-10-08 00:14:56,211][50642] Avg episode reward: [(0, '11.730'), (1, '13.680')] [2023-10-08 00:14:57,827][52060] Updated weights for policy 0, policy_version 9540 (0.0008) [2023-10-08 00:14:58,199][52060] Updated weights for policy 0, policy_version 9550 (0.0010) [2023-10-08 00:14:58,567][52060] Updated weights for policy 0, policy_version 9560 (0.0007) [2023-10-08 00:14:59,202][52059] Updated weights for policy 1, policy_version 9672 (0.0009) [2023-10-08 00:14:59,570][52059] Updated weights for policy 1, policy_version 9682 (0.0009) [2023-10-08 00:14:59,934][52059] Updated weights for policy 1, policy_version 9692 (0.0009) [2023-10-08 00:15:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 19726336. Throughput: 0: 1712.2, 1: 1716.2. Samples: 4941616. Policy #0 lag: (min: 9.0, avg: 29.5, max: 41.0) [2023-10-08 00:15:01,211][50642] Avg episode reward: [(0, '12.630'), (1, '13.270')] [2023-10-08 00:15:02,628][52060] Updated weights for policy 0, policy_version 9570 (0.0008) [2023-10-08 00:15:03,007][52060] Updated weights for policy 0, policy_version 9580 (0.0007) [2023-10-08 00:15:03,381][52060] Updated weights for policy 0, policy_version 9590 (0.0007) [2023-10-08 00:15:03,749][52060] Updated weights for policy 0, policy_version 9600 (0.0009) [2023-10-08 00:15:03,811][52059] Updated weights for policy 1, policy_version 9702 (0.0008) [2023-10-08 00:15:04,170][52059] Updated weights for policy 1, policy_version 9712 (0.0008) [2023-10-08 00:15:04,537][52059] Updated weights for policy 1, policy_version 9722 (0.0007) [2023-10-08 00:15:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 19791872. Throughput: 0: 1683.6, 1: 1741.3. Samples: 4952004. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:15:06,211][50642] Avg episode reward: [(0, '11.550'), (1, '15.080')] [2023-10-08 00:15:07,677][52060] Updated weights for policy 0, policy_version 9610 (0.0010) [2023-10-08 00:15:08,047][52060] Updated weights for policy 0, policy_version 9620 (0.0008) [2023-10-08 00:15:08,409][52060] Updated weights for policy 0, policy_version 9630 (0.0008) [2023-10-08 00:15:08,433][52059] Updated weights for policy 1, policy_version 9732 (0.0009) [2023-10-08 00:15:08,791][52059] Updated weights for policy 1, policy_version 9742 (0.0010) [2023-10-08 00:15:09,150][52059] Updated weights for policy 1, policy_version 9752 (0.0010) [2023-10-08 00:15:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 19857408. Throughput: 0: 1708.0, 1: 1710.9. Samples: 4972270. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:15:11,211][50642] Avg episode reward: [(0, '12.600'), (1, '13.390')] [2023-10-08 00:15:12,434][52060] Updated weights for policy 0, policy_version 9640 (0.0007) [2023-10-08 00:15:12,800][52060] Updated weights for policy 0, policy_version 9650 (0.0007) [2023-10-08 00:15:13,049][52059] Updated weights for policy 1, policy_version 9762 (0.0011) [2023-10-08 00:15:13,183][52060] Updated weights for policy 0, policy_version 9660 (0.0007) [2023-10-08 00:15:13,422][52059] Updated weights for policy 1, policy_version 9772 (0.0009) [2023-10-08 00:15:13,780][52059] Updated weights for policy 1, policy_version 9782 (0.0008) [2023-10-08 00:15:14,146][52059] Updated weights for policy 1, policy_version 9792 (0.0009) [2023-10-08 00:15:16,211][50642] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 19922944. Throughput: 0: 1717.9, 1: 1723.7. Samples: 4993662. Policy #0 lag: (min: 25.0, avg: 46.0, max: 57.0) [2023-10-08 00:15:16,212][50642] Avg episode reward: [(0, '11.760'), (1, '15.050')] [2023-10-08 00:15:17,238][52060] Updated weights for policy 0, policy_version 9670 (0.0007) [2023-10-08 00:15:17,607][52060] Updated weights for policy 0, policy_version 9680 (0.0009) [2023-10-08 00:15:17,976][52060] Updated weights for policy 0, policy_version 9690 (0.0009) [2023-10-08 00:15:18,137][52059] Updated weights for policy 1, policy_version 9802 (0.0009) [2023-10-08 00:15:18,498][52059] Updated weights for policy 1, policy_version 9812 (0.0008) [2023-10-08 00:15:18,863][52059] Updated weights for policy 1, policy_version 9822 (0.0007) [2023-10-08 00:15:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 19988480. Throughput: 0: 1697.5, 1: 1711.6. Samples: 5003052. Policy #0 lag: (min: 25.0, avg: 46.0, max: 57.0) [2023-10-08 00:15:21,211][50642] Avg episode reward: [(0, '12.240'), (1, '15.580')] [2023-10-08 00:15:21,213][51710] Saving new best policy, reward=15.580! [2023-10-08 00:15:21,977][52060] Updated weights for policy 0, policy_version 9700 (0.0008) [2023-10-08 00:15:22,358][52060] Updated weights for policy 0, policy_version 9710 (0.0007) [2023-10-08 00:15:22,718][52060] Updated weights for policy 0, policy_version 9720 (0.0007) [2023-10-08 00:15:22,794][52059] Updated weights for policy 1, policy_version 9832 (0.0007) [2023-10-08 00:15:23,150][52059] Updated weights for policy 1, policy_version 9842 (0.0007) [2023-10-08 00:15:23,512][52059] Updated weights for policy 1, policy_version 9852 (0.0010) [2023-10-08 00:15:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 20054016. Throughput: 0: 1711.4, 1: 1713.0. Samples: 5024214. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:15:26,211][50642] Avg episode reward: [(0, '12.050'), (1, '13.200')] [2023-10-08 00:15:26,627][52060] Updated weights for policy 0, policy_version 9730 (0.0008) [2023-10-08 00:15:26,997][52060] Updated weights for policy 0, policy_version 9740 (0.0008) [2023-10-08 00:15:27,367][52060] Updated weights for policy 0, policy_version 9750 (0.0007) [2023-10-08 00:15:27,471][52059] Updated weights for policy 1, policy_version 9862 (0.0009) [2023-10-08 00:15:27,731][52060] Updated weights for policy 0, policy_version 9760 (0.0007) [2023-10-08 00:15:27,840][52059] Updated weights for policy 1, policy_version 9872 (0.0009) [2023-10-08 00:15:28,195][52059] Updated weights for policy 1, policy_version 9882 (0.0008) [2023-10-08 00:15:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 20119552. Throughput: 0: 1713.7, 1: 1744.7. Samples: 5045460. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:15:31,211][50642] Avg episode reward: [(0, '12.280'), (1, '15.070')] [2023-10-08 00:15:31,642][52060] Updated weights for policy 0, policy_version 9770 (0.0010) [2023-10-08 00:15:32,012][52059] Updated weights for policy 1, policy_version 9892 (0.0010) [2023-10-08 00:15:32,015][52060] Updated weights for policy 0, policy_version 9780 (0.0008) [2023-10-08 00:15:32,379][52059] Updated weights for policy 1, policy_version 9902 (0.0008) [2023-10-08 00:15:32,379][52060] Updated weights for policy 0, policy_version 9790 (0.0009) [2023-10-08 00:15:32,738][52059] Updated weights for policy 1, policy_version 9912 (0.0010) [2023-10-08 00:15:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 20185088. Throughput: 0: 1702.3, 1: 1711.6. Samples: 5054648. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:15:36,211][50642] Avg episode reward: [(0, '12.690'), (1, '13.840')] [2023-10-08 00:15:36,217][52060] Updated weights for policy 0, policy_version 9800 (0.0007) [2023-10-08 00:15:36,577][52060] Updated weights for policy 0, policy_version 9810 (0.0009) [2023-10-08 00:15:36,707][52059] Updated weights for policy 1, policy_version 9922 (0.0009) [2023-10-08 00:15:36,943][52060] Updated weights for policy 0, policy_version 9820 (0.0010) [2023-10-08 00:15:37,082][52059] Updated weights for policy 1, policy_version 9932 (0.0009) [2023-10-08 00:15:37,453][52059] Updated weights for policy 1, policy_version 9942 (0.0010) [2023-10-08 00:15:37,821][52059] Updated weights for policy 1, policy_version 9952 (0.0008) [2023-10-08 00:15:41,080][52060] Updated weights for policy 0, policy_version 9830 (0.0009) [2023-10-08 00:15:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 20250624. Throughput: 0: 1711.7, 1: 1728.6. Samples: 5075654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:15:41,211][50642] Avg episode reward: [(0, '12.100'), (1, '13.110')] [2023-10-08 00:15:41,453][52060] Updated weights for policy 0, policy_version 9840 (0.0010) [2023-10-08 00:15:41,811][52060] Updated weights for policy 0, policy_version 9850 (0.0009) [2023-10-08 00:15:41,965][52059] Updated weights for policy 1, policy_version 9962 (0.0009) [2023-10-08 00:15:42,340][52059] Updated weights for policy 1, policy_version 9972 (0.0009) [2023-10-08 00:15:42,698][52059] Updated weights for policy 1, policy_version 9982 (0.0009) [2023-10-08 00:15:45,819][52060] Updated weights for policy 0, policy_version 9860 (0.0008) [2023-10-08 00:15:46,194][52060] Updated weights for policy 0, policy_version 9870 (0.0008) [2023-10-08 00:15:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 20316160. Throughput: 0: 1700.0, 1: 1727.3. Samples: 5095844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 00:15:46,211][50642] Avg episode reward: [(0, '13.030'), (1, '15.210')] [2023-10-08 00:15:46,563][52060] Updated weights for policy 0, policy_version 9880 (0.0009) [2023-10-08 00:15:46,824][52059] Updated weights for policy 1, policy_version 9992 (0.0008) [2023-10-08 00:15:47,191][52059] Updated weights for policy 1, policy_version 10002 (0.0007) [2023-10-08 00:15:47,559][52059] Updated weights for policy 1, policy_version 10012 (0.0008) [2023-10-08 00:15:50,807][52060] Updated weights for policy 0, policy_version 9890 (0.0008) [2023-10-08 00:15:51,209][52060] Updated weights for policy 0, policy_version 9900 (0.0008) [2023-10-08 00:15:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 20381696. Throughput: 0: 1705.4, 1: 1698.7. Samples: 5105186. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 00:15:51,211][50642] Avg episode reward: [(0, '11.770'), (1, '13.310')] [2023-10-08 00:15:51,581][52059] Updated weights for policy 1, policy_version 10022 (0.0008) [2023-10-08 00:15:51,590][52060] Updated weights for policy 0, policy_version 9910 (0.0007) [2023-10-08 00:15:51,951][52060] Updated weights for policy 0, policy_version 9920 (0.0010) [2023-10-08 00:15:51,954][52059] Updated weights for policy 1, policy_version 10032 (0.0008) [2023-10-08 00:15:52,328][52059] Updated weights for policy 1, policy_version 10042 (0.0010) [2023-10-08 00:15:55,829][52060] Updated weights for policy 0, policy_version 9930 (0.0008) [2023-10-08 00:15:56,190][52060] Updated weights for policy 0, policy_version 9940 (0.0009) [2023-10-08 00:15:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 20447232. Throughput: 0: 1704.5, 1: 1724.2. Samples: 5126562. Policy #0 lag: (min: 24.0, avg: 39.7, max: 56.0) [2023-10-08 00:15:56,211][50642] Avg episode reward: [(0, '13.130'), (1, '14.290')] [2023-10-08 00:15:56,321][52059] Updated weights for policy 1, policy_version 10052 (0.0008) [2023-10-08 00:15:56,569][52060] Updated weights for policy 0, policy_version 9950 (0.0008) [2023-10-08 00:15:56,632][51605] Saving new best policy, reward=13.130! [2023-10-08 00:15:56,676][52059] Updated weights for policy 1, policy_version 10062 (0.0008) [2023-10-08 00:15:57,041][52059] Updated weights for policy 1, policy_version 10072 (0.0008) [2023-10-08 00:16:00,562][52060] Updated weights for policy 0, policy_version 9960 (0.0009) [2023-10-08 00:16:00,939][52060] Updated weights for policy 0, policy_version 9970 (0.0010) [2023-10-08 00:16:01,020][52059] Updated weights for policy 1, policy_version 10082 (0.0008) [2023-10-08 00:16:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 20512768. Throughput: 0: 1694.1, 1: 1716.8. Samples: 5147150. Policy #0 lag: (min: 24.0, avg: 39.7, max: 56.0) [2023-10-08 00:16:01,211][50642] Avg episode reward: [(0, '11.580'), (1, '15.450')] [2023-10-08 00:16:01,310][52060] Updated weights for policy 0, policy_version 9980 (0.0009) [2023-10-08 00:16:01,380][52059] Updated weights for policy 1, policy_version 10092 (0.0007) [2023-10-08 00:16:01,753][52059] Updated weights for policy 1, policy_version 10102 (0.0010) [2023-10-08 00:16:02,113][52059] Updated weights for policy 1, policy_version 10112 (0.0009) [2023-10-08 00:16:05,236][52060] Updated weights for policy 0, policy_version 9990 (0.0008) [2023-10-08 00:16:05,602][52060] Updated weights for policy 0, policy_version 10000 (0.0007) [2023-10-08 00:16:05,962][52060] Updated weights for policy 0, policy_version 10010 (0.0007) [2023-10-08 00:16:05,999][52059] Updated weights for policy 1, policy_version 10122 (0.0008) [2023-10-08 00:16:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 20611072. Throughput: 0: 1709.4, 1: 1717.2. Samples: 5157248. Policy #0 lag: (min: 31.0, avg: 37.5, max: 63.0) [2023-10-08 00:16:06,211][50642] Avg episode reward: [(0, '13.010'), (1, '12.970')] [2023-10-08 00:16:06,365][52059] Updated weights for policy 1, policy_version 10132 (0.0007) [2023-10-08 00:16:06,730][52059] Updated weights for policy 1, policy_version 10142 (0.0009) [2023-10-08 00:16:09,977][52060] Updated weights for policy 0, policy_version 10020 (0.0008) [2023-10-08 00:16:10,347][52060] Updated weights for policy 0, policy_version 10030 (0.0008) [2023-10-08 00:16:10,707][52059] Updated weights for policy 1, policy_version 10152 (0.0009) [2023-10-08 00:16:10,711][52060] Updated weights for policy 0, policy_version 10040 (0.0009) [2023-10-08 00:16:11,068][52059] Updated weights for policy 1, policy_version 10162 (0.0008) [2023-10-08 00:16:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 20676608. Throughput: 0: 1711.7, 1: 1725.1. Samples: 5178870. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:16:11,211][50642] Avg episode reward: [(0, '11.720'), (1, '15.080')] [2023-10-08 00:16:11,443][52059] Updated weights for policy 1, policy_version 10172 (0.0007) [2023-10-08 00:16:14,626][52060] Updated weights for policy 0, policy_version 10050 (0.0008) [2023-10-08 00:16:15,006][52060] Updated weights for policy 0, policy_version 10060 (0.0008) [2023-10-08 00:16:15,375][52060] Updated weights for policy 0, policy_version 10070 (0.0009) [2023-10-08 00:16:15,401][52059] Updated weights for policy 1, policy_version 10182 (0.0007) [2023-10-08 00:16:15,739][52060] Updated weights for policy 0, policy_version 10080 (0.0008) [2023-10-08 00:16:15,764][52059] Updated weights for policy 1, policy_version 10192 (0.0007) [2023-10-08 00:16:16,120][52059] Updated weights for policy 1, policy_version 10202 (0.0008) [2023-10-08 00:16:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 20742144. Throughput: 0: 1686.2, 1: 1704.8. Samples: 5198056. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:16:16,211][50642] Avg episode reward: [(0, '12.800'), (1, '15.210')] [2023-10-08 00:16:19,725][52060] Updated weights for policy 0, policy_version 10090 (0.0007) [2023-10-08 00:16:20,035][52059] Updated weights for policy 1, policy_version 10212 (0.0010) [2023-10-08 00:16:20,095][52060] Updated weights for policy 0, policy_version 10100 (0.0008) [2023-10-08 00:16:20,395][52059] Updated weights for policy 1, policy_version 10222 (0.0009) [2023-10-08 00:16:20,464][52060] Updated weights for policy 0, policy_version 10110 (0.0007) [2023-10-08 00:16:20,770][52059] Updated weights for policy 1, policy_version 10232 (0.0008) [2023-10-08 00:16:21,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 20840448. Throughput: 0: 1714.1, 1: 1721.7. Samples: 5209258. Policy #0 lag: (min: 31.0, avg: 31.8, max: 50.0) [2023-10-08 00:16:21,211][50642] Avg episode reward: [(0, '12.580'), (1, '13.920')] [2023-10-08 00:16:24,513][52060] Updated weights for policy 0, policy_version 10120 (0.0008) [2023-10-08 00:16:24,776][52059] Updated weights for policy 1, policy_version 10242 (0.0008) [2023-10-08 00:16:24,874][52060] Updated weights for policy 0, policy_version 10130 (0.0007) [2023-10-08 00:16:25,136][52059] Updated weights for policy 1, policy_version 10252 (0.0010) [2023-10-08 00:16:25,249][52060] Updated weights for policy 0, policy_version 10140 (0.0009) [2023-10-08 00:16:25,503][52059] Updated weights for policy 1, policy_version 10262 (0.0010) [2023-10-08 00:16:25,876][52059] Updated weights for policy 1, policy_version 10272 (0.0009) [2023-10-08 00:16:26,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 20905984. Throughput: 0: 1700.1, 1: 1715.9. Samples: 5229374. Policy #0 lag: (min: 31.0, avg: 31.8, max: 50.0) [2023-10-08 00:16:26,211][50642] Avg episode reward: [(0, '12.600'), (1, '16.380')] [2023-10-08 00:16:26,211][51710] Saving new best policy, reward=16.380! [2023-10-08 00:16:29,257][52060] Updated weights for policy 0, policy_version 10150 (0.0009) [2023-10-08 00:16:29,631][52060] Updated weights for policy 0, policy_version 10160 (0.0009) [2023-10-08 00:16:29,994][52060] Updated weights for policy 0, policy_version 10170 (0.0009) [2023-10-08 00:16:30,000][52059] Updated weights for policy 1, policy_version 10282 (0.0010) [2023-10-08 00:16:30,370][52059] Updated weights for policy 1, policy_version 10292 (0.0007) [2023-10-08 00:16:30,744][52059] Updated weights for policy 1, policy_version 10302 (0.0007) [2023-10-08 00:16:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 20971520. Throughput: 0: 1694.2, 1: 1700.0. Samples: 5248586. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) [2023-10-08 00:16:31,211][50642] Avg episode reward: [(0, '12.310'), (1, '13.390')] [2023-10-08 00:16:34,065][52060] Updated weights for policy 0, policy_version 10180 (0.0008) [2023-10-08 00:16:34,438][52060] Updated weights for policy 0, policy_version 10190 (0.0007) [2023-10-08 00:16:34,604][52059] Updated weights for policy 1, policy_version 10312 (0.0009) [2023-10-08 00:16:34,799][52060] Updated weights for policy 0, policy_version 10200 (0.0009) [2023-10-08 00:16:34,970][52059] Updated weights for policy 1, policy_version 10322 (0.0008) [2023-10-08 00:16:35,330][52059] Updated weights for policy 1, policy_version 10332 (0.0007) [2023-10-08 00:16:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 21037056. Throughput: 0: 1718.1, 1: 1731.4. Samples: 5260414. Policy #0 lag: (min: 31.0, avg: 34.2, max: 63.0) [2023-10-08 00:16:36,211][50642] Avg episode reward: [(0, '12.800'), (1, '14.540')] [2023-10-08 00:16:38,881][52060] Updated weights for policy 0, policy_version 10210 (0.0007) [2023-10-08 00:16:39,290][52060] Updated weights for policy 0, policy_version 10220 (0.0009) [2023-10-08 00:16:39,362][52059] Updated weights for policy 1, policy_version 10342 (0.0008) [2023-10-08 00:16:39,650][52060] Updated weights for policy 0, policy_version 10230 (0.0007) [2023-10-08 00:16:39,727][52059] Updated weights for policy 1, policy_version 10352 (0.0007) [2023-10-08 00:16:40,032][52060] Updated weights for policy 0, policy_version 10240 (0.0008) [2023-10-08 00:16:40,103][52059] Updated weights for policy 1, policy_version 10362 (0.0007) [2023-10-08 00:16:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 21102592. Throughput: 0: 1690.4, 1: 1707.6. Samples: 5279468. Policy #0 lag: (min: 5.0, avg: 9.0, max: 37.0) [2023-10-08 00:16:41,211][50642] Avg episode reward: [(0, '12.480'), (1, '15.640')] [2023-10-08 00:16:43,956][52060] Updated weights for policy 0, policy_version 10250 (0.0008) [2023-10-08 00:16:44,067][52059] Updated weights for policy 1, policy_version 10372 (0.0011) [2023-10-08 00:16:44,330][52060] Updated weights for policy 0, policy_version 10260 (0.0007) [2023-10-08 00:16:44,435][52059] Updated weights for policy 1, policy_version 10382 (0.0010) [2023-10-08 00:16:44,701][52060] Updated weights for policy 0, policy_version 10270 (0.0008) [2023-10-08 00:16:44,793][52059] Updated weights for policy 1, policy_version 10392 (0.0007) [2023-10-08 00:16:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 21168128. Throughput: 0: 1695.2, 1: 1698.3. Samples: 5299854. Policy #0 lag: (min: 5.0, avg: 9.0, max: 37.0) [2023-10-08 00:16:46,211][50642] Avg episode reward: [(0, '13.010'), (1, '12.420')] [2023-10-08 00:16:46,224][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000010272_10518528.pth... [2023-10-08 00:16:46,224][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000010400_10649600.pth... [2023-10-08 00:16:46,262][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000008768_8978432.pth [2023-10-08 00:16:46,264][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000008672_8880128.pth [2023-10-08 00:16:48,664][52060] Updated weights for policy 0, policy_version 10280 (0.0009) [2023-10-08 00:16:48,676][52059] Updated weights for policy 1, policy_version 10402 (0.0010) [2023-10-08 00:16:49,026][52060] Updated weights for policy 0, policy_version 10290 (0.0007) [2023-10-08 00:16:49,050][52059] Updated weights for policy 1, policy_version 10412 (0.0010) [2023-10-08 00:16:49,400][52060] Updated weights for policy 0, policy_version 10300 (0.0011) [2023-10-08 00:16:49,414][52059] Updated weights for policy 1, policy_version 10422 (0.0009) [2023-10-08 00:16:49,775][52059] Updated weights for policy 1, policy_version 10432 (0.0009) [2023-10-08 00:16:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 21233664. Throughput: 0: 1696.6, 1: 1720.6. Samples: 5311024. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 00:16:51,211][50642] Avg episode reward: [(0, '12.380'), (1, '15.650')] [2023-10-08 00:16:53,301][52060] Updated weights for policy 0, policy_version 10310 (0.0008) [2023-10-08 00:16:53,660][52060] Updated weights for policy 0, policy_version 10320 (0.0008) [2023-10-08 00:16:53,851][52059] Updated weights for policy 1, policy_version 10442 (0.0007) [2023-10-08 00:16:54,035][52060] Updated weights for policy 0, policy_version 10330 (0.0008) [2023-10-08 00:16:54,212][52059] Updated weights for policy 1, policy_version 10452 (0.0009) [2023-10-08 00:16:54,579][52059] Updated weights for policy 1, policy_version 10462 (0.0009) [2023-10-08 00:16:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 21299200. Throughput: 0: 1678.6, 1: 1687.3. Samples: 5330338. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 00:16:56,211][50642] Avg episode reward: [(0, '12.540'), (1, '14.400')] [2023-10-08 00:16:58,160][52060] Updated weights for policy 0, policy_version 10340 (0.0008) [2023-10-08 00:16:58,387][52059] Updated weights for policy 1, policy_version 10472 (0.0008) [2023-10-08 00:16:58,527][52060] Updated weights for policy 0, policy_version 10350 (0.0008) [2023-10-08 00:16:58,747][52059] Updated weights for policy 1, policy_version 10482 (0.0007) [2023-10-08 00:16:58,897][52060] Updated weights for policy 0, policy_version 10360 (0.0008) [2023-10-08 00:16:59,100][52059] Updated weights for policy 1, policy_version 10492 (0.0008) [2023-10-08 00:17:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 21364736. Throughput: 0: 1702.0, 1: 1714.6. Samples: 5351804. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:17:01,211][50642] Avg episode reward: [(0, '12.320'), (1, '13.250')] [2023-10-08 00:17:02,778][52060] Updated weights for policy 0, policy_version 10370 (0.0007) [2023-10-08 00:17:02,990][52059] Updated weights for policy 1, policy_version 10502 (0.0008) [2023-10-08 00:17:03,150][52060] Updated weights for policy 0, policy_version 10380 (0.0008) [2023-10-08 00:17:03,353][52059] Updated weights for policy 1, policy_version 10512 (0.0007) [2023-10-08 00:17:03,515][52060] Updated weights for policy 0, policy_version 10390 (0.0008) [2023-10-08 00:17:03,709][52059] Updated weights for policy 1, policy_version 10522 (0.0007) [2023-10-08 00:17:03,880][52060] Updated weights for policy 0, policy_version 10400 (0.0008) [2023-10-08 00:17:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 21430272. Throughput: 0: 1679.6, 1: 1705.3. Samples: 5361578. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:17:06,211][50642] Avg episode reward: [(0, '11.970'), (1, '15.570')] [2023-10-08 00:17:07,744][52059] Updated weights for policy 1, policy_version 10532 (0.0010) [2023-10-08 00:17:07,993][52060] Updated weights for policy 0, policy_version 10410 (0.0007) [2023-10-08 00:17:08,103][52059] Updated weights for policy 1, policy_version 10542 (0.0009) [2023-10-08 00:17:08,357][52060] Updated weights for policy 0, policy_version 10420 (0.0008) [2023-10-08 00:17:08,473][52059] Updated weights for policy 1, policy_version 10552 (0.0010) [2023-10-08 00:17:08,726][52060] Updated weights for policy 0, policy_version 10430 (0.0008) [2023-10-08 00:17:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 21495808. Throughput: 0: 1689.9, 1: 1705.4. Samples: 5382160. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:17:11,211][50642] Avg episode reward: [(0, '12.850'), (1, '14.320')] [2023-10-08 00:17:12,468][52059] Updated weights for policy 1, policy_version 10562 (0.0009) [2023-10-08 00:17:12,831][52059] Updated weights for policy 1, policy_version 10572 (0.0009) [2023-10-08 00:17:12,867][52060] Updated weights for policy 0, policy_version 10440 (0.0008) [2023-10-08 00:17:13,195][52059] Updated weights for policy 1, policy_version 10582 (0.0007) [2023-10-08 00:17:13,236][52060] Updated weights for policy 0, policy_version 10450 (0.0009) [2023-10-08 00:17:13,561][52059] Updated weights for policy 1, policy_version 10592 (0.0009) [2023-10-08 00:17:13,610][52060] Updated weights for policy 0, policy_version 10460 (0.0008) [2023-10-08 00:17:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 21561344. Throughput: 0: 1701.3, 1: 1732.2. Samples: 5403094. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:17:16,211][50642] Avg episode reward: [(0, '12.000'), (1, '14.160')] [2023-10-08 00:17:17,629][52060] Updated weights for policy 0, policy_version 10470 (0.0009) [2023-10-08 00:17:17,630][52059] Updated weights for policy 1, policy_version 10602 (0.0009) [2023-10-08 00:17:17,987][52059] Updated weights for policy 1, policy_version 10612 (0.0007) [2023-10-08 00:17:17,994][52060] Updated weights for policy 0, policy_version 10480 (0.0007) [2023-10-08 00:17:18,352][52059] Updated weights for policy 1, policy_version 10622 (0.0009) [2023-10-08 00:17:18,369][52060] Updated weights for policy 0, policy_version 10490 (0.0007) [2023-10-08 00:17:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 21626880. Throughput: 0: 1676.6, 1: 1702.7. Samples: 5412486. Policy #0 lag: (min: 28.0, avg: 28.2, max: 37.0) [2023-10-08 00:17:21,211][50642] Avg episode reward: [(0, '13.090'), (1, '15.650')] [2023-10-08 00:17:22,118][52059] Updated weights for policy 1, policy_version 10632 (0.0007) [2023-10-08 00:17:22,391][52060] Updated weights for policy 0, policy_version 10500 (0.0008) [2023-10-08 00:17:22,480][52059] Updated weights for policy 1, policy_version 10642 (0.0008) [2023-10-08 00:17:22,751][52060] Updated weights for policy 0, policy_version 10510 (0.0009) [2023-10-08 00:17:22,849][52059] Updated weights for policy 1, policy_version 10652 (0.0009) [2023-10-08 00:17:23,129][52060] Updated weights for policy 0, policy_version 10520 (0.0010) [2023-10-08 00:17:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 21692416. Throughput: 0: 1701.2, 1: 1729.1. Samples: 5433832. Policy #0 lag: (min: 28.0, avg: 28.2, max: 37.0) [2023-10-08 00:17:26,211][50642] Avg episode reward: [(0, '12.040'), (1, '13.520')] [2023-10-08 00:17:26,680][52059] Updated weights for policy 1, policy_version 10662 (0.0008) [2023-10-08 00:17:27,042][52059] Updated weights for policy 1, policy_version 10672 (0.0009) [2023-10-08 00:17:27,209][52060] Updated weights for policy 0, policy_version 10530 (0.0009) [2023-10-08 00:17:27,414][52059] Updated weights for policy 1, policy_version 10682 (0.0008) [2023-10-08 00:17:27,625][52060] Updated weights for policy 0, policy_version 10540 (0.0009) [2023-10-08 00:17:27,993][52060] Updated weights for policy 0, policy_version 10550 (0.0008) [2023-10-08 00:17:28,367][52060] Updated weights for policy 0, policy_version 10560 (0.0007) [2023-10-08 00:17:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 21757952. Throughput: 0: 1704.1, 1: 1739.4. Samples: 5454812. Policy #0 lag: (min: 25.0, avg: 30.3, max: 57.0) [2023-10-08 00:17:31,211][50642] Avg episode reward: [(0, '12.770'), (1, '14.100')] [2023-10-08 00:17:31,442][52059] Updated weights for policy 1, policy_version 10692 (0.0008) [2023-10-08 00:17:31,810][52059] Updated weights for policy 1, policy_version 10702 (0.0009) [2023-10-08 00:17:32,185][52059] Updated weights for policy 1, policy_version 10712 (0.0009) [2023-10-08 00:17:32,375][52060] Updated weights for policy 0, policy_version 10570 (0.0008) [2023-10-08 00:17:32,750][52060] Updated weights for policy 0, policy_version 10580 (0.0009) [2023-10-08 00:17:33,118][52060] Updated weights for policy 0, policy_version 10590 (0.0009) [2023-10-08 00:17:36,004][52059] Updated weights for policy 1, policy_version 10722 (0.0007) [2023-10-08 00:17:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 21823488. Throughput: 0: 1686.1, 1: 1718.0. Samples: 5464212. Policy #0 lag: (min: 25.0, avg: 30.3, max: 57.0) [2023-10-08 00:17:36,211][50642] Avg episode reward: [(0, '12.630'), (1, '14.980')] [2023-10-08 00:17:36,374][52059] Updated weights for policy 1, policy_version 10732 (0.0010) [2023-10-08 00:17:36,742][52059] Updated weights for policy 1, policy_version 10742 (0.0010) [2023-10-08 00:17:37,027][52060] Updated weights for policy 0, policy_version 10600 (0.0009) [2023-10-08 00:17:37,110][52059] Updated weights for policy 1, policy_version 10752 (0.0008) [2023-10-08 00:17:37,387][52060] Updated weights for policy 0, policy_version 10610 (0.0010) [2023-10-08 00:17:37,765][52060] Updated weights for policy 0, policy_version 10620 (0.0010) [2023-10-08 00:17:40,971][52059] Updated weights for policy 1, policy_version 10762 (0.0009) [2023-10-08 00:17:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 21889024. Throughput: 0: 1703.4, 1: 1747.5. Samples: 5485626. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 00:17:41,211][50642] Avg episode reward: [(0, '12.570'), (1, '14.160')] [2023-10-08 00:17:41,337][52059] Updated weights for policy 1, policy_version 10772 (0.0007) [2023-10-08 00:17:41,715][52059] Updated weights for policy 1, policy_version 10782 (0.0008) [2023-10-08 00:17:41,820][52060] Updated weights for policy 0, policy_version 10630 (0.0009) [2023-10-08 00:17:42,195][52060] Updated weights for policy 0, policy_version 10640 (0.0007) [2023-10-08 00:17:42,562][52060] Updated weights for policy 0, policy_version 10650 (0.0008) [2023-10-08 00:17:45,677][52059] Updated weights for policy 1, policy_version 10792 (0.0010) [2023-10-08 00:17:46,030][52059] Updated weights for policy 1, policy_version 10802 (0.0010) [2023-10-08 00:17:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 21954560. Throughput: 0: 1703.0, 1: 1730.8. Samples: 5506322. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 00:17:46,211][50642] Avg episode reward: [(0, '11.880'), (1, '14.400')] [2023-10-08 00:17:46,392][52059] Updated weights for policy 1, policy_version 10812 (0.0010) [2023-10-08 00:17:46,557][52060] Updated weights for policy 0, policy_version 10660 (0.0008) [2023-10-08 00:17:46,937][52060] Updated weights for policy 0, policy_version 10670 (0.0009) [2023-10-08 00:17:47,307][52060] Updated weights for policy 0, policy_version 10680 (0.0009) [2023-10-08 00:17:50,348][52059] Updated weights for policy 1, policy_version 10822 (0.0008) [2023-10-08 00:17:50,714][52059] Updated weights for policy 1, policy_version 10832 (0.0010) [2023-10-08 00:17:51,091][52059] Updated weights for policy 1, policy_version 10842 (0.0009) [2023-10-08 00:17:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 22020096. Throughput: 0: 1697.1, 1: 1734.2. Samples: 5515990. Policy #0 lag: (min: 27.0, avg: 28.9, max: 57.0) [2023-10-08 00:17:51,211][50642] Avg episode reward: [(0, '12.270'), (1, '13.340')] [2023-10-08 00:17:51,332][52060] Updated weights for policy 0, policy_version 10690 (0.0008) [2023-10-08 00:17:51,706][52060] Updated weights for policy 0, policy_version 10700 (0.0010) [2023-10-08 00:17:52,077][52060] Updated weights for policy 0, policy_version 10710 (0.0008) [2023-10-08 00:17:52,441][52060] Updated weights for policy 0, policy_version 10720 (0.0008) [2023-10-08 00:17:54,933][52059] Updated weights for policy 1, policy_version 10852 (0.0009) [2023-10-08 00:17:55,307][52059] Updated weights for policy 1, policy_version 10862 (0.0011) [2023-10-08 00:17:55,669][52059] Updated weights for policy 1, policy_version 10872 (0.0011) [2023-10-08 00:17:56,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 22118400. Throughput: 0: 1701.1, 1: 1743.2. Samples: 5537152. Policy #0 lag: (min: 27.0, avg: 28.9, max: 57.0) [2023-10-08 00:17:56,211][50642] Avg episode reward: [(0, '13.000'), (1, '14.610')] [2023-10-08 00:17:56,273][52060] Updated weights for policy 0, policy_version 10730 (0.0007) [2023-10-08 00:17:56,634][52060] Updated weights for policy 0, policy_version 10740 (0.0008) [2023-10-08 00:17:57,018][52060] Updated weights for policy 0, policy_version 10750 (0.0010) [2023-10-08 00:17:59,567][52059] Updated weights for policy 1, policy_version 10882 (0.0009) [2023-10-08 00:17:59,929][52059] Updated weights for policy 1, policy_version 10892 (0.0007) [2023-10-08 00:18:00,290][52059] Updated weights for policy 1, policy_version 10902 (0.0007) [2023-10-08 00:18:00,659][52059] Updated weights for policy 1, policy_version 10912 (0.0009) [2023-10-08 00:18:00,933][52060] Updated weights for policy 0, policy_version 10760 (0.0011) [2023-10-08 00:18:01,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 22183936. Throughput: 0: 1705.5, 1: 1721.6. Samples: 5557312. Policy #0 lag: (min: 2.0, avg: 10.0, max: 34.0) [2023-10-08 00:18:01,211][50642] Avg episode reward: [(0, '12.050'), (1, '15.510')] [2023-10-08 00:18:01,310][52060] Updated weights for policy 0, policy_version 10770 (0.0008) [2023-10-08 00:18:01,680][52060] Updated weights for policy 0, policy_version 10780 (0.0008) [2023-10-08 00:18:04,716][52059] Updated weights for policy 1, policy_version 10922 (0.0008) [2023-10-08 00:18:05,089][52059] Updated weights for policy 1, policy_version 10932 (0.0009) [2023-10-08 00:18:05,452][52059] Updated weights for policy 1, policy_version 10942 (0.0007) [2023-10-08 00:18:05,847][52060] Updated weights for policy 0, policy_version 10790 (0.0008) [2023-10-08 00:18:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 22249472. Throughput: 0: 1705.5, 1: 1753.4. Samples: 5568138. Policy #0 lag: (min: 2.0, avg: 10.0, max: 34.0) [2023-10-08 00:18:06,212][50642] Avg episode reward: [(0, '13.190'), (1, '14.090')] [2023-10-08 00:18:06,212][52060] Updated weights for policy 0, policy_version 10800 (0.0007) [2023-10-08 00:18:06,589][52060] Updated weights for policy 0, policy_version 10810 (0.0010) [2023-10-08 00:18:06,810][51605] Saving new best policy, reward=13.190! [2023-10-08 00:18:09,410][52059] Updated weights for policy 1, policy_version 10952 (0.0010) [2023-10-08 00:18:09,779][52059] Updated weights for policy 1, policy_version 10962 (0.0011) [2023-10-08 00:18:10,143][52059] Updated weights for policy 1, policy_version 10972 (0.0012) [2023-10-08 00:18:10,676][52060] Updated weights for policy 0, policy_version 10820 (0.0008) [2023-10-08 00:18:11,045][52060] Updated weights for policy 0, policy_version 10830 (0.0009) [2023-10-08 00:18:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 22315008. Throughput: 0: 1706.4, 1: 1727.4. Samples: 5588352. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:18:11,211][50642] Avg episode reward: [(0, '11.690'), (1, '15.860')] [2023-10-08 00:18:11,412][52060] Updated weights for policy 0, policy_version 10840 (0.0007) [2023-10-08 00:18:14,125][52059] Updated weights for policy 1, policy_version 10982 (0.0007) [2023-10-08 00:18:14,490][52059] Updated weights for policy 1, policy_version 10992 (0.0008) [2023-10-08 00:18:14,875][52059] Updated weights for policy 1, policy_version 11002 (0.0009) [2023-10-08 00:18:15,288][52060] Updated weights for policy 0, policy_version 10850 (0.0008) [2023-10-08 00:18:15,682][52060] Updated weights for policy 0, policy_version 10860 (0.0011) [2023-10-08 00:18:16,061][52060] Updated weights for policy 0, policy_version 10870 (0.0009) [2023-10-08 00:18:16,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 22380544. Throughput: 0: 1696.5, 1: 1721.1. Samples: 5608602. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:18:16,211][50642] Avg episode reward: [(0, '12.890'), (1, '15.950')] [2023-10-08 00:18:16,432][52060] Updated weights for policy 0, policy_version 10880 (0.0008) [2023-10-08 00:18:18,771][52059] Updated weights for policy 1, policy_version 11012 (0.0009) [2023-10-08 00:18:19,136][52059] Updated weights for policy 1, policy_version 11022 (0.0008) [2023-10-08 00:18:19,500][52059] Updated weights for policy 1, policy_version 11032 (0.0008) [2023-10-08 00:18:20,357][52060] Updated weights for policy 0, policy_version 10890 (0.0009) [2023-10-08 00:18:20,738][52060] Updated weights for policy 0, policy_version 10900 (0.0010) [2023-10-08 00:18:21,103][52060] Updated weights for policy 0, policy_version 10910 (0.0008) [2023-10-08 00:18:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22478848. Throughput: 0: 1711.3, 1: 1744.6. Samples: 5619728. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:18:21,211][50642] Avg episode reward: [(0, '11.690'), (1, '13.390')] [2023-10-08 00:18:23,378][52059] Updated weights for policy 1, policy_version 11042 (0.0007) [2023-10-08 00:18:23,732][52059] Updated weights for policy 1, policy_version 11052 (0.0009) [2023-10-08 00:18:24,105][52059] Updated weights for policy 1, policy_version 11062 (0.0008) [2023-10-08 00:18:24,474][52059] Updated weights for policy 1, policy_version 11072 (0.0009) [2023-10-08 00:18:25,235][52060] Updated weights for policy 0, policy_version 10920 (0.0009) [2023-10-08 00:18:25,593][52060] Updated weights for policy 0, policy_version 10930 (0.0011) [2023-10-08 00:18:25,963][52060] Updated weights for policy 0, policy_version 10940 (0.0008) [2023-10-08 00:18:26,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22544384. Throughput: 0: 1711.9, 1: 1717.3. Samples: 5639938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:18:26,211][50642] Avg episode reward: [(0, '13.430'), (1, '14.630')] [2023-10-08 00:18:26,213][51605] Saving new best policy, reward=13.430! [2023-10-08 00:18:28,508][52059] Updated weights for policy 1, policy_version 11082 (0.0011) [2023-10-08 00:18:28,873][52059] Updated weights for policy 1, policy_version 11092 (0.0010) [2023-10-08 00:18:29,234][52059] Updated weights for policy 1, policy_version 11102 (0.0011) [2023-10-08 00:18:29,832][52060] Updated weights for policy 0, policy_version 10950 (0.0008) [2023-10-08 00:18:30,199][52060] Updated weights for policy 0, policy_version 10960 (0.0011) [2023-10-08 00:18:30,565][52060] Updated weights for policy 0, policy_version 10970 (0.0010) [2023-10-08 00:18:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22609920. Throughput: 0: 1691.9, 1: 1725.1. Samples: 5660090. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:18:31,211][50642] Avg episode reward: [(0, '12.100'), (1, '15.760')] [2023-10-08 00:18:33,239][52059] Updated weights for policy 1, policy_version 11112 (0.0009) [2023-10-08 00:18:33,605][52059] Updated weights for policy 1, policy_version 11122 (0.0009) [2023-10-08 00:18:33,964][52059] Updated weights for policy 1, policy_version 11132 (0.0009) [2023-10-08 00:18:34,424][52060] Updated weights for policy 0, policy_version 10980 (0.0008) [2023-10-08 00:18:34,785][52060] Updated weights for policy 0, policy_version 10990 (0.0007) [2023-10-08 00:18:35,147][52060] Updated weights for policy 0, policy_version 11000 (0.0007) [2023-10-08 00:18:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22675456. Throughput: 0: 1722.7, 1: 1724.8. Samples: 5671126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:18:36,211][50642] Avg episode reward: [(0, '13.010'), (1, '13.670')] [2023-10-08 00:18:37,709][52059] Updated weights for policy 1, policy_version 11142 (0.0009) [2023-10-08 00:18:38,075][52059] Updated weights for policy 1, policy_version 11152 (0.0007) [2023-10-08 00:18:38,449][52059] Updated weights for policy 1, policy_version 11162 (0.0008) [2023-10-08 00:18:39,122][52060] Updated weights for policy 0, policy_version 11010 (0.0010) [2023-10-08 00:18:39,484][52060] Updated weights for policy 0, policy_version 11020 (0.0009) [2023-10-08 00:18:39,859][52060] Updated weights for policy 0, policy_version 11030 (0.0009) [2023-10-08 00:18:40,235][52060] Updated weights for policy 0, policy_version 11040 (0.0009) [2023-10-08 00:18:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22740992. Throughput: 0: 1707.6, 1: 1717.4. Samples: 5691274. Policy #0 lag: (min: 26.0, avg: 41.4, max: 58.0) [2023-10-08 00:18:41,211][50642] Avg episode reward: [(0, '12.310'), (1, '15.220')] [2023-10-08 00:18:42,229][52059] Updated weights for policy 1, policy_version 11172 (0.0008) [2023-10-08 00:18:42,594][52059] Updated weights for policy 1, policy_version 11182 (0.0007) [2023-10-08 00:18:42,963][52059] Updated weights for policy 1, policy_version 11192 (0.0008) [2023-10-08 00:18:44,139][52060] Updated weights for policy 0, policy_version 11050 (0.0008) [2023-10-08 00:18:44,505][52060] Updated weights for policy 0, policy_version 11060 (0.0007) [2023-10-08 00:18:44,876][52060] Updated weights for policy 0, policy_version 11070 (0.0008) [2023-10-08 00:18:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22806528. Throughput: 0: 1699.0, 1: 1741.6. Samples: 5712140. Policy #0 lag: (min: 26.0, avg: 41.4, max: 58.0) [2023-10-08 00:18:46,211][50642] Avg episode reward: [(0, '12.620'), (1, '14.570')] [2023-10-08 00:18:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000011072_11337728.pth... [2023-10-08 00:18:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000011200_11468800.pth... [2023-10-08 00:18:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000009472_9699328.pth [2023-10-08 00:18:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000009600_9830400.pth [2023-10-08 00:18:47,057][52059] Updated weights for policy 1, policy_version 11202 (0.0009) [2023-10-08 00:18:47,426][52059] Updated weights for policy 1, policy_version 11212 (0.0009) [2023-10-08 00:18:47,783][52059] Updated weights for policy 1, policy_version 11222 (0.0008) [2023-10-08 00:18:48,149][52059] Updated weights for policy 1, policy_version 11232 (0.0009) [2023-10-08 00:18:48,845][52060] Updated weights for policy 0, policy_version 11080 (0.0007) [2023-10-08 00:18:49,209][52060] Updated weights for policy 0, policy_version 11090 (0.0007) [2023-10-08 00:18:49,585][52060] Updated weights for policy 0, policy_version 11100 (0.0007) [2023-10-08 00:18:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 22872064. Throughput: 0: 1727.2, 1: 1707.4. Samples: 5722692. Policy #0 lag: (min: 0.0, avg: 25.3, max: 32.0) [2023-10-08 00:18:51,211][50642] Avg episode reward: [(0, '12.340'), (1, '14.290')] [2023-10-08 00:18:52,179][52059] Updated weights for policy 1, policy_version 11242 (0.0009) [2023-10-08 00:18:52,545][52059] Updated weights for policy 1, policy_version 11252 (0.0008) [2023-10-08 00:18:52,921][52059] Updated weights for policy 1, policy_version 11262 (0.0008) [2023-10-08 00:18:53,536][52060] Updated weights for policy 0, policy_version 11110 (0.0007) [2023-10-08 00:18:53,901][52060] Updated weights for policy 0, policy_version 11120 (0.0008) [2023-10-08 00:18:54,270][52060] Updated weights for policy 0, policy_version 11130 (0.0008) [2023-10-08 00:18:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 22937600. Throughput: 0: 1704.7, 1: 1737.2. Samples: 5743236. Policy #0 lag: (min: 0.0, avg: 25.3, max: 32.0) [2023-10-08 00:18:56,211][50642] Avg episode reward: [(0, '12.320'), (1, '15.800')] [2023-10-08 00:18:56,769][52059] Updated weights for policy 1, policy_version 11272 (0.0008) [2023-10-08 00:18:57,138][52059] Updated weights for policy 1, policy_version 11282 (0.0007) [2023-10-08 00:18:57,502][52059] Updated weights for policy 1, policy_version 11292 (0.0007) [2023-10-08 00:18:58,062][52060] Updated weights for policy 0, policy_version 11140 (0.0007) [2023-10-08 00:18:58,435][52060] Updated weights for policy 0, policy_version 11150 (0.0007) [2023-10-08 00:18:58,801][52060] Updated weights for policy 0, policy_version 11160 (0.0009) [2023-10-08 00:19:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 23003136. Throughput: 0: 1719.5, 1: 1744.9. Samples: 5764500. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:19:01,211][50642] Avg episode reward: [(0, '12.330'), (1, '15.320')] [2023-10-08 00:19:01,369][52059] Updated weights for policy 1, policy_version 11302 (0.0008) [2023-10-08 00:19:01,737][52059] Updated weights for policy 1, policy_version 11312 (0.0009) [2023-10-08 00:19:02,106][52059] Updated weights for policy 1, policy_version 11322 (0.0008) [2023-10-08 00:19:02,886][52060] Updated weights for policy 0, policy_version 11170 (0.0010) [2023-10-08 00:19:03,260][52060] Updated weights for policy 0, policy_version 11180 (0.0008) [2023-10-08 00:19:03,632][52060] Updated weights for policy 0, policy_version 11190 (0.0008) [2023-10-08 00:19:04,004][52060] Updated weights for policy 0, policy_version 11200 (0.0009) [2023-10-08 00:19:06,123][52059] Updated weights for policy 1, policy_version 11332 (0.0009) [2023-10-08 00:19:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 23068672. Throughput: 0: 1711.8, 1: 1720.3. Samples: 5774174. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:19:06,211][50642] Avg episode reward: [(0, '12.360'), (1, '14.720')] [2023-10-08 00:19:06,484][52059] Updated weights for policy 1, policy_version 11342 (0.0007) [2023-10-08 00:19:06,849][52059] Updated weights for policy 1, policy_version 11352 (0.0007) [2023-10-08 00:19:08,139][52060] Updated weights for policy 0, policy_version 11210 (0.0009) [2023-10-08 00:19:08,515][52060] Updated weights for policy 0, policy_version 11220 (0.0009) [2023-10-08 00:19:08,882][52060] Updated weights for policy 0, policy_version 11230 (0.0010) [2023-10-08 00:19:10,755][52059] Updated weights for policy 1, policy_version 11362 (0.0008) [2023-10-08 00:19:11,120][52059] Updated weights for policy 1, policy_version 11372 (0.0008) [2023-10-08 00:19:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23134208. Throughput: 0: 1703.0, 1: 1744.4. Samples: 5795072. Policy #0 lag: (min: 17.0, avg: 30.2, max: 49.0) [2023-10-08 00:19:11,211][50642] Avg episode reward: [(0, '13.110'), (1, '15.250')] [2023-10-08 00:19:11,487][52059] Updated weights for policy 1, policy_version 11382 (0.0007) [2023-10-08 00:19:11,853][52059] Updated weights for policy 1, policy_version 11392 (0.0008) [2023-10-08 00:19:12,904][52060] Updated weights for policy 0, policy_version 11240 (0.0008) [2023-10-08 00:19:13,273][52060] Updated weights for policy 0, policy_version 11250 (0.0009) [2023-10-08 00:19:13,639][52060] Updated weights for policy 0, policy_version 11260 (0.0009) [2023-10-08 00:19:15,796][52059] Updated weights for policy 1, policy_version 11402 (0.0009) [2023-10-08 00:19:16,161][52059] Updated weights for policy 1, policy_version 11412 (0.0009) [2023-10-08 00:19:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23199744. Throughput: 0: 1723.9, 1: 1737.6. Samples: 5815856. Policy #0 lag: (min: 17.0, avg: 30.2, max: 49.0) [2023-10-08 00:19:16,211][50642] Avg episode reward: [(0, '12.390'), (1, '14.500')] [2023-10-08 00:19:16,527][52059] Updated weights for policy 1, policy_version 11422 (0.0008) [2023-10-08 00:19:17,657][52060] Updated weights for policy 0, policy_version 11270 (0.0008) [2023-10-08 00:19:18,024][52060] Updated weights for policy 0, policy_version 11280 (0.0007) [2023-10-08 00:19:18,386][52060] Updated weights for policy 0, policy_version 11290 (0.0009) [2023-10-08 00:19:20,477][52059] Updated weights for policy 1, policy_version 11432 (0.0007) [2023-10-08 00:19:20,836][52059] Updated weights for policy 1, policy_version 11442 (0.0008) [2023-10-08 00:19:21,203][52059] Updated weights for policy 1, policy_version 11452 (0.0009) [2023-10-08 00:19:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 23265280. Throughput: 0: 1693.0, 1: 1742.1. Samples: 5825704. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:19:21,211][50642] Avg episode reward: [(0, '13.080'), (1, '16.520')] [2023-10-08 00:19:21,353][51710] Saving new best policy, reward=16.520! [2023-10-08 00:19:22,361][52060] Updated weights for policy 0, policy_version 11300 (0.0007) [2023-10-08 00:19:22,717][52060] Updated weights for policy 0, policy_version 11310 (0.0009) [2023-10-08 00:19:23,092][52060] Updated weights for policy 0, policy_version 11320 (0.0010) [2023-10-08 00:19:25,174][52059] Updated weights for policy 1, policy_version 11462 (0.0008) [2023-10-08 00:19:25,539][52059] Updated weights for policy 1, policy_version 11472 (0.0009) [2023-10-08 00:19:25,901][52059] Updated weights for policy 1, policy_version 11482 (0.0008) [2023-10-08 00:19:26,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 23363584. Throughput: 0: 1708.4, 1: 1748.6. Samples: 5846840. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:19:26,211][50642] Avg episode reward: [(0, '12.210'), (1, '14.980')] [2023-10-08 00:19:26,975][52060] Updated weights for policy 0, policy_version 11330 (0.0009) [2023-10-08 00:19:27,352][52060] Updated weights for policy 0, policy_version 11340 (0.0010) [2023-10-08 00:19:27,720][52060] Updated weights for policy 0, policy_version 11350 (0.0009) [2023-10-08 00:19:28,097][52060] Updated weights for policy 0, policy_version 11360 (0.0007) [2023-10-08 00:19:29,670][52059] Updated weights for policy 1, policy_version 11492 (0.0009) [2023-10-08 00:19:30,027][52059] Updated weights for policy 1, policy_version 11502 (0.0011) [2023-10-08 00:19:30,403][52059] Updated weights for policy 1, policy_version 11512 (0.0008) [2023-10-08 00:19:31,211][50642] Fps is (10 sec: 16383.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23429120. Throughput: 0: 1721.4, 1: 1722.5. Samples: 5867116. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:19:31,212][50642] Avg episode reward: [(0, '12.590'), (1, '15.360')] [2023-10-08 00:19:31,798][52060] Updated weights for policy 0, policy_version 11370 (0.0007) [2023-10-08 00:19:32,166][52060] Updated weights for policy 0, policy_version 11380 (0.0007) [2023-10-08 00:19:32,541][52060] Updated weights for policy 0, policy_version 11390 (0.0008) [2023-10-08 00:19:34,411][52059] Updated weights for policy 1, policy_version 11522 (0.0008) [2023-10-08 00:19:34,770][52059] Updated weights for policy 1, policy_version 11532 (0.0008) [2023-10-08 00:19:35,138][52059] Updated weights for policy 1, policy_version 11542 (0.0008) [2023-10-08 00:19:35,503][52059] Updated weights for policy 1, policy_version 11552 (0.0009) [2023-10-08 00:19:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 23494656. Throughput: 0: 1693.0, 1: 1757.3. Samples: 5877958. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 00:19:36,211][50642] Avg episode reward: [(0, '12.420'), (1, '14.180')] [2023-10-08 00:19:36,634][52060] Updated weights for policy 0, policy_version 11400 (0.0007) [2023-10-08 00:19:36,995][52060] Updated weights for policy 0, policy_version 11410 (0.0009) [2023-10-08 00:19:37,375][52060] Updated weights for policy 0, policy_version 11420 (0.0008) [2023-10-08 00:19:39,256][52059] Updated weights for policy 1, policy_version 11562 (0.0011) [2023-10-08 00:19:39,620][52059] Updated weights for policy 1, policy_version 11572 (0.0010) [2023-10-08 00:19:39,984][52059] Updated weights for policy 1, policy_version 11582 (0.0011) [2023-10-08 00:19:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23560192. Throughput: 0: 1715.2, 1: 1730.3. Samples: 5898284. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 00:19:41,211][50642] Avg episode reward: [(0, '12.050'), (1, '12.690')] [2023-10-08 00:19:41,516][52060] Updated weights for policy 0, policy_version 11430 (0.0008) [2023-10-08 00:19:41,886][52060] Updated weights for policy 0, policy_version 11440 (0.0008) [2023-10-08 00:19:42,245][52060] Updated weights for policy 0, policy_version 11450 (0.0009) [2023-10-08 00:19:43,956][52059] Updated weights for policy 1, policy_version 11592 (0.0009) [2023-10-08 00:19:44,332][52059] Updated weights for policy 1, policy_version 11602 (0.0009) [2023-10-08 00:19:44,698][52059] Updated weights for policy 1, policy_version 11612 (0.0008) [2023-10-08 00:19:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23625728. Throughput: 0: 1713.6, 1: 1726.0. Samples: 5919282. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:19:46,211][50642] Avg episode reward: [(0, '13.250'), (1, '14.700')] [2023-10-08 00:19:46,368][52060] Updated weights for policy 0, policy_version 11460 (0.0009) [2023-10-08 00:19:46,735][52060] Updated weights for policy 0, policy_version 11470 (0.0009) [2023-10-08 00:19:47,104][52060] Updated weights for policy 0, policy_version 11480 (0.0010) [2023-10-08 00:19:48,677][52059] Updated weights for policy 1, policy_version 11622 (0.0007) [2023-10-08 00:19:49,040][52059] Updated weights for policy 1, policy_version 11632 (0.0008) [2023-10-08 00:19:49,411][52059] Updated weights for policy 1, policy_version 11642 (0.0008) [2023-10-08 00:19:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 23691264. Throughput: 0: 1704.3, 1: 1742.9. Samples: 5929296. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:19:51,211][50642] Avg episode reward: [(0, '11.640'), (1, '13.290')] [2023-10-08 00:19:51,252][52060] Updated weights for policy 0, policy_version 11490 (0.0008) [2023-10-08 00:19:51,626][52060] Updated weights for policy 0, policy_version 11500 (0.0008) [2023-10-08 00:19:51,992][52060] Updated weights for policy 0, policy_version 11510 (0.0008) [2023-10-08 00:19:52,365][52060] Updated weights for policy 0, policy_version 11520 (0.0008) [2023-10-08 00:19:53,213][52059] Updated weights for policy 1, policy_version 11652 (0.0008) [2023-10-08 00:19:53,572][52059] Updated weights for policy 1, policy_version 11662 (0.0008) [2023-10-08 00:19:53,938][52059] Updated weights for policy 1, policy_version 11672 (0.0010) [2023-10-08 00:19:56,186][52060] Updated weights for policy 0, policy_version 11530 (0.0008) [2023-10-08 00:19:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23756800. Throughput: 0: 1715.9, 1: 1725.1. Samples: 5949918. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 00:19:56,211][50642] Avg episode reward: [(0, '12.920'), (1, '15.280')] [2023-10-08 00:19:56,546][52060] Updated weights for policy 0, policy_version 11540 (0.0007) [2023-10-08 00:19:56,928][52060] Updated weights for policy 0, policy_version 11550 (0.0008) [2023-10-08 00:19:57,828][52059] Updated weights for policy 1, policy_version 11682 (0.0008) [2023-10-08 00:19:58,196][52059] Updated weights for policy 1, policy_version 11692 (0.0010) [2023-10-08 00:19:58,553][52059] Updated weights for policy 1, policy_version 11702 (0.0011) [2023-10-08 00:19:58,919][52059] Updated weights for policy 1, policy_version 11712 (0.0010) [2023-10-08 00:20:00,960][52060] Updated weights for policy 0, policy_version 11560 (0.0009) [2023-10-08 00:20:01,211][50642] Fps is (10 sec: 13106.7, 60 sec: 13653.2, 300 sec: 13662.6). Total num frames: 23822336. Throughput: 0: 1713.5, 1: 1734.7. Samples: 5971024. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 00:20:01,212][50642] Avg episode reward: [(0, '11.580'), (1, '15.500')] [2023-10-08 00:20:01,334][52060] Updated weights for policy 0, policy_version 11570 (0.0010) [2023-10-08 00:20:01,703][52060] Updated weights for policy 0, policy_version 11580 (0.0008) [2023-10-08 00:20:02,896][52059] Updated weights for policy 1, policy_version 11722 (0.0010) [2023-10-08 00:20:03,261][52059] Updated weights for policy 1, policy_version 11732 (0.0007) [2023-10-08 00:20:03,628][52059] Updated weights for policy 1, policy_version 11742 (0.0008) [2023-10-08 00:20:05,668][52060] Updated weights for policy 0, policy_version 11590 (0.0009) [2023-10-08 00:20:06,036][52060] Updated weights for policy 0, policy_version 11600 (0.0007) [2023-10-08 00:20:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23887872. Throughput: 0: 1723.2, 1: 1721.2. Samples: 5980700. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:20:06,211][50642] Avg episode reward: [(0, '13.400'), (1, '15.150')] [2023-10-08 00:20:06,401][52060] Updated weights for policy 0, policy_version 11610 (0.0008) [2023-10-08 00:20:07,642][52059] Updated weights for policy 1, policy_version 11752 (0.0009) [2023-10-08 00:20:08,003][52059] Updated weights for policy 1, policy_version 11762 (0.0009) [2023-10-08 00:20:08,363][52059] Updated weights for policy 1, policy_version 11772 (0.0009) [2023-10-08 00:20:10,369][52060] Updated weights for policy 0, policy_version 11620 (0.0009) [2023-10-08 00:20:10,735][52060] Updated weights for policy 0, policy_version 11630 (0.0010) [2023-10-08 00:20:11,112][52060] Updated weights for policy 0, policy_version 11640 (0.0009) [2023-10-08 00:20:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 23953408. Throughput: 0: 1726.3, 1: 1725.2. Samples: 6002156. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:20:11,211][50642] Avg episode reward: [(0, '12.330'), (1, '14.870')] [2023-10-08 00:20:12,104][52059] Updated weights for policy 1, policy_version 11782 (0.0009) [2023-10-08 00:20:12,467][52059] Updated weights for policy 1, policy_version 11792 (0.0010) [2023-10-08 00:20:12,825][52059] Updated weights for policy 1, policy_version 11802 (0.0010) [2023-10-08 00:20:14,905][52060] Updated weights for policy 0, policy_version 11650 (0.0010) [2023-10-08 00:20:15,271][52060] Updated weights for policy 0, policy_version 11660 (0.0009) [2023-10-08 00:20:15,651][52060] Updated weights for policy 0, policy_version 11670 (0.0010) [2023-10-08 00:20:16,016][52060] Updated weights for policy 0, policy_version 11680 (0.0008) [2023-10-08 00:20:16,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 24051712. Throughput: 0: 1698.7, 1: 1752.8. Samples: 6022432. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:20:16,211][50642] Avg episode reward: [(0, '12.430'), (1, '13.860')] [2023-10-08 00:20:16,868][52059] Updated weights for policy 1, policy_version 11812 (0.0008) [2023-10-08 00:20:17,227][52059] Updated weights for policy 1, policy_version 11822 (0.0007) [2023-10-08 00:20:17,595][52059] Updated weights for policy 1, policy_version 11832 (0.0007) [2023-10-08 00:20:20,130][52060] Updated weights for policy 0, policy_version 11690 (0.0010) [2023-10-08 00:20:20,496][52060] Updated weights for policy 0, policy_version 11700 (0.0010) [2023-10-08 00:20:20,870][52060] Updated weights for policy 0, policy_version 11710 (0.0007) [2023-10-08 00:20:21,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 24117248. Throughput: 0: 1718.5, 1: 1720.8. Samples: 6032726. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:20:21,211][50642] Avg episode reward: [(0, '12.710'), (1, '13.620')] [2023-10-08 00:20:21,502][52059] Updated weights for policy 1, policy_version 11842 (0.0008) [2023-10-08 00:20:21,872][52059] Updated weights for policy 1, policy_version 11852 (0.0008) [2023-10-08 00:20:22,237][52059] Updated weights for policy 1, policy_version 11862 (0.0008) [2023-10-08 00:20:22,596][52059] Updated weights for policy 1, policy_version 11872 (0.0007) [2023-10-08 00:20:24,713][52060] Updated weights for policy 0, policy_version 11720 (0.0007) [2023-10-08 00:20:25,084][52060] Updated weights for policy 0, policy_version 11730 (0.0007) [2023-10-08 00:20:25,450][52060] Updated weights for policy 0, policy_version 11740 (0.0007) [2023-10-08 00:20:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 24182784. Throughput: 0: 1709.6, 1: 1740.1. Samples: 6053520. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:20:26,211][50642] Avg episode reward: [(0, '12.140'), (1, '14.410')] [2023-10-08 00:20:26,480][52059] Updated weights for policy 1, policy_version 11882 (0.0010) [2023-10-08 00:20:26,846][52059] Updated weights for policy 1, policy_version 11892 (0.0007) [2023-10-08 00:20:27,202][52059] Updated weights for policy 1, policy_version 11902 (0.0007) [2023-10-08 00:20:29,442][52060] Updated weights for policy 0, policy_version 11750 (0.0008) [2023-10-08 00:20:29,813][52060] Updated weights for policy 0, policy_version 11760 (0.0009) [2023-10-08 00:20:30,175][52060] Updated weights for policy 0, policy_version 11770 (0.0008) [2023-10-08 00:20:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 24248320. Throughput: 0: 1688.4, 1: 1748.1. Samples: 6073928. Policy #0 lag: (min: 9.0, avg: 21.9, max: 41.0) [2023-10-08 00:20:31,211][50642] Avg episode reward: [(0, '13.530'), (1, '14.190')] [2023-10-08 00:20:31,217][51605] Saving new best policy, reward=13.530! [2023-10-08 00:20:31,239][52059] Updated weights for policy 1, policy_version 11912 (0.0007) [2023-10-08 00:20:31,597][52059] Updated weights for policy 1, policy_version 11922 (0.0007) [2023-10-08 00:20:31,956][52059] Updated weights for policy 1, policy_version 11932 (0.0008) [2023-10-08 00:20:34,255][52060] Updated weights for policy 0, policy_version 11780 (0.0008) [2023-10-08 00:20:34,618][52060] Updated weights for policy 0, policy_version 11790 (0.0009) [2023-10-08 00:20:34,991][52060] Updated weights for policy 0, policy_version 11800 (0.0008) [2023-10-08 00:20:35,729][52059] Updated weights for policy 1, policy_version 11942 (0.0010) [2023-10-08 00:20:36,101][52059] Updated weights for policy 1, policy_version 11952 (0.0008) [2023-10-08 00:20:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 24313856. Throughput: 0: 1725.2, 1: 1729.2. Samples: 6084740. Policy #0 lag: (min: 9.0, avg: 21.9, max: 41.0) [2023-10-08 00:20:36,211][50642] Avg episode reward: [(0, '12.090'), (1, '15.090')] [2023-10-08 00:20:36,458][52059] Updated weights for policy 1, policy_version 11962 (0.0007) [2023-10-08 00:20:39,087][52060] Updated weights for policy 0, policy_version 11810 (0.0007) [2023-10-08 00:20:39,502][52060] Updated weights for policy 0, policy_version 11820 (0.0011) [2023-10-08 00:20:39,867][52060] Updated weights for policy 0, policy_version 11830 (0.0010) [2023-10-08 00:20:40,199][52059] Updated weights for policy 1, policy_version 11972 (0.0008) [2023-10-08 00:20:40,234][52060] Updated weights for policy 0, policy_version 11840 (0.0009) [2023-10-08 00:20:40,570][52059] Updated weights for policy 1, policy_version 11982 (0.0009) [2023-10-08 00:20:40,924][52059] Updated weights for policy 1, policy_version 11992 (0.0008) [2023-10-08 00:20:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 24379392. Throughput: 0: 1698.7, 1: 1756.0. Samples: 6105380. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 00:20:41,211][50642] Avg episode reward: [(0, '13.750'), (1, '15.540')] [2023-10-08 00:20:41,211][51605] Saving new best policy, reward=13.750! [2023-10-08 00:20:44,105][52060] Updated weights for policy 0, policy_version 11850 (0.0007) [2023-10-08 00:20:44,467][52060] Updated weights for policy 0, policy_version 11860 (0.0009) [2023-10-08 00:20:44,841][52060] Updated weights for policy 0, policy_version 11870 (0.0008) [2023-10-08 00:20:45,029][52059] Updated weights for policy 1, policy_version 12002 (0.0009) [2023-10-08 00:20:45,387][52059] Updated weights for policy 1, policy_version 12012 (0.0009) [2023-10-08 00:20:45,752][52059] Updated weights for policy 1, policy_version 12022 (0.0008) [2023-10-08 00:20:46,118][52059] Updated weights for policy 1, policy_version 12032 (0.0008) [2023-10-08 00:20:46,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 24477696. Throughput: 0: 1693.7, 1: 1731.1. Samples: 6125138. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 00:20:46,211][50642] Avg episode reward: [(0, '11.770'), (1, '15.040')] [2023-10-08 00:20:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000011872_12156928.pth... [2023-10-08 00:20:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000012032_12320768.pth... [2023-10-08 00:20:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000010400_10649600.pth [2023-10-08 00:20:46,249][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000010272_10518528.pth [2023-10-08 00:20:48,963][52060] Updated weights for policy 0, policy_version 11880 (0.0009) [2023-10-08 00:20:49,327][52060] Updated weights for policy 0, policy_version 11890 (0.0008) [2023-10-08 00:20:49,696][52060] Updated weights for policy 0, policy_version 11900 (0.0008) [2023-10-08 00:20:50,082][52059] Updated weights for policy 1, policy_version 12042 (0.0009) [2023-10-08 00:20:50,445][52059] Updated weights for policy 1, policy_version 12052 (0.0007) [2023-10-08 00:20:50,820][52059] Updated weights for policy 1, policy_version 12062 (0.0011) [2023-10-08 00:20:51,211][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 24543232. Throughput: 0: 1708.3, 1: 1752.3. Samples: 6136424. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 00:20:51,211][50642] Avg episode reward: [(0, '13.330'), (1, '14.560')] [2023-10-08 00:20:53,720][52060] Updated weights for policy 0, policy_version 11910 (0.0007) [2023-10-08 00:20:54,082][52060] Updated weights for policy 0, policy_version 11920 (0.0009) [2023-10-08 00:20:54,459][52060] Updated weights for policy 0, policy_version 11930 (0.0009) [2023-10-08 00:20:54,747][52059] Updated weights for policy 1, policy_version 12072 (0.0010) [2023-10-08 00:20:55,111][52059] Updated weights for policy 1, policy_version 12082 (0.0007) [2023-10-08 00:20:55,480][52059] Updated weights for policy 1, policy_version 12092 (0.0009) [2023-10-08 00:20:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 24608768. Throughput: 0: 1684.9, 1: 1740.2. Samples: 6156284. Policy #0 lag: (min: 1.0, avg: 15.1, max: 33.0) [2023-10-08 00:20:56,211][50642] Avg episode reward: [(0, '12.190'), (1, '14.750')] [2023-10-08 00:20:58,338][52060] Updated weights for policy 0, policy_version 11940 (0.0009) [2023-10-08 00:20:58,714][52060] Updated weights for policy 0, policy_version 11950 (0.0011) [2023-10-08 00:20:59,078][52060] Updated weights for policy 0, policy_version 11960 (0.0010) [2023-10-08 00:20:59,537][52059] Updated weights for policy 1, policy_version 12102 (0.0008) [2023-10-08 00:20:59,909][52059] Updated weights for policy 1, policy_version 12112 (0.0007) [2023-10-08 00:21:00,278][52059] Updated weights for policy 1, policy_version 12122 (0.0008) [2023-10-08 00:21:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 24674304. Throughput: 0: 1712.2, 1: 1716.7. Samples: 6176736. Policy #0 lag: (min: 1.0, avg: 15.1, max: 33.0) [2023-10-08 00:21:01,212][50642] Avg episode reward: [(0, '13.000'), (1, '15.310')] [2023-10-08 00:21:03,040][52060] Updated weights for policy 0, policy_version 11970 (0.0008) [2023-10-08 00:21:03,415][52060] Updated weights for policy 0, policy_version 11980 (0.0009) [2023-10-08 00:21:03,776][52060] Updated weights for policy 0, policy_version 11990 (0.0009) [2023-10-08 00:21:04,146][52060] Updated weights for policy 0, policy_version 12000 (0.0010) [2023-10-08 00:21:04,219][52059] Updated weights for policy 1, policy_version 12132 (0.0008) [2023-10-08 00:21:04,582][52059] Updated weights for policy 1, policy_version 12142 (0.0008) [2023-10-08 00:21:04,950][52059] Updated weights for policy 1, policy_version 12152 (0.0010) [2023-10-08 00:21:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 24739840. Throughput: 0: 1699.6, 1: 1748.2. Samples: 6187880. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:21:06,211][50642] Avg episode reward: [(0, '12.550'), (1, '14.190')] [2023-10-08 00:21:08,015][52060] Updated weights for policy 0, policy_version 12010 (0.0010) [2023-10-08 00:21:08,381][52060] Updated weights for policy 0, policy_version 12020 (0.0008) [2023-10-08 00:21:08,746][52060] Updated weights for policy 0, policy_version 12030 (0.0011) [2023-10-08 00:21:08,889][52059] Updated weights for policy 1, policy_version 12162 (0.0010) [2023-10-08 00:21:09,254][52059] Updated weights for policy 1, policy_version 12172 (0.0008) [2023-10-08 00:21:09,625][52059] Updated weights for policy 1, policy_version 12182 (0.0009) [2023-10-08 00:21:09,982][52059] Updated weights for policy 1, policy_version 12192 (0.0008) [2023-10-08 00:21:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 24805376. Throughput: 0: 1702.2, 1: 1723.9. Samples: 6207696. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:21:11,211][50642] Avg episode reward: [(0, '12.590'), (1, '14.700')] [2023-10-08 00:21:12,690][52060] Updated weights for policy 0, policy_version 12040 (0.0009) [2023-10-08 00:21:13,053][52060] Updated weights for policy 0, policy_version 12050 (0.0008) [2023-10-08 00:21:13,423][52060] Updated weights for policy 0, policy_version 12060 (0.0010) [2023-10-08 00:21:13,880][52059] Updated weights for policy 1, policy_version 12202 (0.0010) [2023-10-08 00:21:14,249][52059] Updated weights for policy 1, policy_version 12212 (0.0010) [2023-10-08 00:21:14,617][52059] Updated weights for policy 1, policy_version 12222 (0.0009) [2023-10-08 00:21:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 24870912. Throughput: 0: 1720.4, 1: 1717.6. Samples: 6228636. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:21:16,211][50642] Avg episode reward: [(0, '13.250'), (1, '14.970')] [2023-10-08 00:21:17,267][52060] Updated weights for policy 0, policy_version 12070 (0.0010) [2023-10-08 00:21:17,638][52060] Updated weights for policy 0, policy_version 12080 (0.0010) [2023-10-08 00:21:18,003][52060] Updated weights for policy 0, policy_version 12090 (0.0009) [2023-10-08 00:21:18,592][52059] Updated weights for policy 1, policy_version 12232 (0.0009) [2023-10-08 00:21:18,968][52059] Updated weights for policy 1, policy_version 12242 (0.0008) [2023-10-08 00:21:19,330][52059] Updated weights for policy 1, policy_version 12252 (0.0008) [2023-10-08 00:21:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 24936448. Throughput: 0: 1690.7, 1: 1735.9. Samples: 6238936. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:21:21,211][50642] Avg episode reward: [(0, '12.450'), (1, '15.110')] [2023-10-08 00:21:22,015][52060] Updated weights for policy 0, policy_version 12100 (0.0009) [2023-10-08 00:21:22,378][52060] Updated weights for policy 0, policy_version 12110 (0.0008) [2023-10-08 00:21:22,742][52060] Updated weights for policy 0, policy_version 12120 (0.0009) [2023-10-08 00:21:23,325][52059] Updated weights for policy 1, policy_version 12262 (0.0008) [2023-10-08 00:21:23,694][52059] Updated weights for policy 1, policy_version 12272 (0.0009) [2023-10-08 00:21:24,051][52059] Updated weights for policy 1, policy_version 12282 (0.0007) [2023-10-08 00:21:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 25001984. Throughput: 0: 1715.9, 1: 1708.4. Samples: 6259470. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:21:26,211][50642] Avg episode reward: [(0, '13.200'), (1, '15.540')] [2023-10-08 00:21:26,978][52060] Updated weights for policy 0, policy_version 12130 (0.0007) [2023-10-08 00:21:27,383][52060] Updated weights for policy 0, policy_version 12140 (0.0007) [2023-10-08 00:21:27,747][52060] Updated weights for policy 0, policy_version 12150 (0.0008) [2023-10-08 00:21:28,057][52059] Updated weights for policy 1, policy_version 12292 (0.0011) [2023-10-08 00:21:28,117][52060] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-10-08 00:21:28,419][52059] Updated weights for policy 1, policy_version 12302 (0.0007) [2023-10-08 00:21:28,782][52059] Updated weights for policy 1, policy_version 12312 (0.0008) [2023-10-08 00:21:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 25067520. Throughput: 0: 1719.7, 1: 1730.6. Samples: 6280402. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:21:31,211][50642] Avg episode reward: [(0, '12.130'), (1, '14.720')] [2023-10-08 00:21:32,193][52060] Updated weights for policy 0, policy_version 12170 (0.0007) [2023-10-08 00:21:32,556][52060] Updated weights for policy 0, policy_version 12180 (0.0007) [2023-10-08 00:21:32,722][52059] Updated weights for policy 1, policy_version 12322 (0.0011) [2023-10-08 00:21:32,925][52060] Updated weights for policy 0, policy_version 12190 (0.0008) [2023-10-08 00:21:33,097][52059] Updated weights for policy 1, policy_version 12332 (0.0008) [2023-10-08 00:21:33,466][52059] Updated weights for policy 1, policy_version 12342 (0.0009) [2023-10-08 00:21:33,818][52059] Updated weights for policy 1, policy_version 12352 (0.0010) [2023-10-08 00:21:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 25133056. Throughput: 0: 1696.1, 1: 1711.3. Samples: 6289754. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:21:36,212][50642] Avg episode reward: [(0, '12.880'), (1, '14.590')] [2023-10-08 00:21:36,849][52060] Updated weights for policy 0, policy_version 12200 (0.0009) [2023-10-08 00:21:37,224][52060] Updated weights for policy 0, policy_version 12210 (0.0010) [2023-10-08 00:21:37,591][52060] Updated weights for policy 0, policy_version 12220 (0.0007) [2023-10-08 00:21:37,601][52059] Updated weights for policy 1, policy_version 12362 (0.0007) [2023-10-08 00:21:37,977][52059] Updated weights for policy 1, policy_version 12372 (0.0008) [2023-10-08 00:21:38,346][52059] Updated weights for policy 1, policy_version 12382 (0.0009) [2023-10-08 00:21:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 25198592. Throughput: 0: 1718.5, 1: 1723.4. Samples: 6311168. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:21:41,211][50642] Avg episode reward: [(0, '12.830'), (1, '13.910')] [2023-10-08 00:21:41,560][52060] Updated weights for policy 0, policy_version 12230 (0.0007) [2023-10-08 00:21:41,929][52060] Updated weights for policy 0, policy_version 12240 (0.0009) [2023-10-08 00:21:42,263][52059] Updated weights for policy 1, policy_version 12392 (0.0008) [2023-10-08 00:21:42,314][52060] Updated weights for policy 0, policy_version 12250 (0.0007) [2023-10-08 00:21:42,632][52059] Updated weights for policy 1, policy_version 12402 (0.0007) [2023-10-08 00:21:43,010][52059] Updated weights for policy 1, policy_version 12412 (0.0010) [2023-10-08 00:21:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 25264128. Throughput: 0: 1707.6, 1: 1744.1. Samples: 6332060. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:21:46,211][50642] Avg episode reward: [(0, '12.670'), (1, '13.200')] [2023-10-08 00:21:46,430][52060] Updated weights for policy 0, policy_version 12260 (0.0009) [2023-10-08 00:21:46,799][52060] Updated weights for policy 0, policy_version 12270 (0.0011) [2023-10-08 00:21:47,006][52059] Updated weights for policy 1, policy_version 12422 (0.0008) [2023-10-08 00:21:47,157][52060] Updated weights for policy 0, policy_version 12280 (0.0007) [2023-10-08 00:21:47,367][52059] Updated weights for policy 1, policy_version 12432 (0.0008) [2023-10-08 00:21:47,726][52059] Updated weights for policy 1, policy_version 12442 (0.0008) [2023-10-08 00:21:51,172][52060] Updated weights for policy 0, policy_version 12290 (0.0008) [2023-10-08 00:21:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 25329664. Throughput: 0: 1698.0, 1: 1710.4. Samples: 6341258. Policy #0 lag: (min: 31.0, avg: 32.1, max: 53.0) [2023-10-08 00:21:51,211][50642] Avg episode reward: [(0, '13.570'), (1, '17.350')] [2023-10-08 00:21:51,211][51710] Saving new best policy, reward=17.350! [2023-10-08 00:21:51,553][52060] Updated weights for policy 0, policy_version 12300 (0.0008) [2023-10-08 00:21:51,697][52059] Updated weights for policy 1, policy_version 12452 (0.0010) [2023-10-08 00:21:51,914][52060] Updated weights for policy 0, policy_version 12310 (0.0009) [2023-10-08 00:21:52,068][52059] Updated weights for policy 1, policy_version 12462 (0.0008) [2023-10-08 00:21:52,277][52060] Updated weights for policy 0, policy_version 12320 (0.0007) [2023-10-08 00:21:52,425][52059] Updated weights for policy 1, policy_version 12472 (0.0010) [2023-10-08 00:21:56,209][52060] Updated weights for policy 0, policy_version 12330 (0.0008) [2023-10-08 00:21:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 25395200. Throughput: 0: 1709.9, 1: 1737.5. Samples: 6362828. Policy #0 lag: (min: 31.0, avg: 32.1, max: 53.0) [2023-10-08 00:21:56,211][50642] Avg episode reward: [(0, '12.160'), (1, '14.060')] [2023-10-08 00:21:56,391][52059] Updated weights for policy 1, policy_version 12482 (0.0009) [2023-10-08 00:21:56,572][52060] Updated weights for policy 0, policy_version 12340 (0.0007) [2023-10-08 00:21:56,757][52059] Updated weights for policy 1, policy_version 12492 (0.0007) [2023-10-08 00:21:56,937][52060] Updated weights for policy 0, policy_version 12350 (0.0008) [2023-10-08 00:21:57,132][52059] Updated weights for policy 1, policy_version 12502 (0.0007) [2023-10-08 00:21:57,506][52059] Updated weights for policy 1, policy_version 12512 (0.0008) [2023-10-08 00:22:00,963][52060] Updated weights for policy 0, policy_version 12360 (0.0008) [2023-10-08 00:22:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 25460736. Throughput: 0: 1709.0, 1: 1741.8. Samples: 6383924. Policy #0 lag: (min: 31.0, avg: 32.1, max: 53.0) [2023-10-08 00:22:01,211][50642] Avg episode reward: [(0, '13.660'), (1, '15.160')] [2023-10-08 00:22:01,282][52059] Updated weights for policy 1, policy_version 12522 (0.0008) [2023-10-08 00:22:01,339][52060] Updated weights for policy 0, policy_version 12370 (0.0008) [2023-10-08 00:22:01,646][52059] Updated weights for policy 1, policy_version 12532 (0.0007) [2023-10-08 00:22:01,704][52060] Updated weights for policy 0, policy_version 12380 (0.0007) [2023-10-08 00:22:02,007][52059] Updated weights for policy 1, policy_version 12542 (0.0007) [2023-10-08 00:22:05,597][52060] Updated weights for policy 0, policy_version 12390 (0.0007) [2023-10-08 00:22:05,916][52059] Updated weights for policy 1, policy_version 12552 (0.0008) [2023-10-08 00:22:05,960][52060] Updated weights for policy 0, policy_version 12400 (0.0009) [2023-10-08 00:22:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 25526272. Throughput: 0: 1711.7, 1: 1724.2. Samples: 6393552. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:06,211][50642] Avg episode reward: [(0, '11.190'), (1, '17.360')] [2023-10-08 00:22:06,302][52059] Updated weights for policy 1, policy_version 12562 (0.0008) [2023-10-08 00:22:06,338][52060] Updated weights for policy 0, policy_version 12410 (0.0008) [2023-10-08 00:22:06,671][52059] Updated weights for policy 1, policy_version 12572 (0.0008) [2023-10-08 00:22:06,809][51710] Saving new best policy, reward=17.360! [2023-10-08 00:22:10,210][52060] Updated weights for policy 0, policy_version 12420 (0.0009) [2023-10-08 00:22:10,529][52059] Updated weights for policy 1, policy_version 12582 (0.0008) [2023-10-08 00:22:10,573][52060] Updated weights for policy 0, policy_version 12430 (0.0009) [2023-10-08 00:22:10,893][52059] Updated weights for policy 1, policy_version 12592 (0.0009) [2023-10-08 00:22:10,938][52060] Updated weights for policy 0, policy_version 12440 (0.0007) [2023-10-08 00:22:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 25591808. Throughput: 0: 1714.4, 1: 1742.3. Samples: 6415018. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:11,211][50642] Avg episode reward: [(0, '13.530'), (1, '12.470')] [2023-10-08 00:22:11,259][52059] Updated weights for policy 1, policy_version 12602 (0.0008) [2023-10-08 00:22:14,838][52060] Updated weights for policy 0, policy_version 12450 (0.0007) [2023-10-08 00:22:14,978][52059] Updated weights for policy 1, policy_version 12612 (0.0010) [2023-10-08 00:22:15,245][52060] Updated weights for policy 0, policy_version 12460 (0.0008) [2023-10-08 00:22:15,341][52059] Updated weights for policy 1, policy_version 12622 (0.0009) [2023-10-08 00:22:15,612][52060] Updated weights for policy 0, policy_version 12470 (0.0008) [2023-10-08 00:22:15,706][52059] Updated weights for policy 1, policy_version 12632 (0.0009) [2023-10-08 00:22:15,982][52060] Updated weights for policy 0, policy_version 12480 (0.0007) [2023-10-08 00:22:16,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 25722880. Throughput: 0: 1694.8, 1: 1724.1. Samples: 6434254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:16,211][50642] Avg episode reward: [(0, '12.670'), (1, '13.550')] [2023-10-08 00:22:19,608][52059] Updated weights for policy 1, policy_version 12642 (0.0008) [2023-10-08 00:22:19,933][52060] Updated weights for policy 0, policy_version 12490 (0.0009) [2023-10-08 00:22:19,967][52059] Updated weights for policy 1, policy_version 12652 (0.0007) [2023-10-08 00:22:20,308][52060] Updated weights for policy 0, policy_version 12500 (0.0008) [2023-10-08 00:22:20,336][52059] Updated weights for policy 1, policy_version 12662 (0.0009) [2023-10-08 00:22:20,674][52060] Updated weights for policy 0, policy_version 12510 (0.0009) [2023-10-08 00:22:20,702][52059] Updated weights for policy 1, policy_version 12672 (0.0009) [2023-10-08 00:22:21,210][50642] Fps is (10 sec: 19660.8, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 25788416. Throughput: 0: 1723.7, 1: 1746.8. Samples: 6445924. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:21,211][50642] Avg episode reward: [(0, '12.540'), (1, '17.050')] [2023-10-08 00:22:24,762][52060] Updated weights for policy 0, policy_version 12520 (0.0008) [2023-10-08 00:22:24,911][52059] Updated weights for policy 1, policy_version 12682 (0.0008) [2023-10-08 00:22:25,131][52060] Updated weights for policy 0, policy_version 12530 (0.0009) [2023-10-08 00:22:25,282][52059] Updated weights for policy 1, policy_version 12692 (0.0007) [2023-10-08 00:22:25,500][52060] Updated weights for policy 0, policy_version 12540 (0.0009) [2023-10-08 00:22:25,637][52059] Updated weights for policy 1, policy_version 12702 (0.0008) [2023-10-08 00:22:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 25853952. Throughput: 0: 1713.3, 1: 1736.4. Samples: 6466406. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:26,211][50642] Avg episode reward: [(0, '13.690'), (1, '13.920')] [2023-10-08 00:22:29,435][52060] Updated weights for policy 0, policy_version 12550 (0.0008) [2023-10-08 00:22:29,530][52059] Updated weights for policy 1, policy_version 12712 (0.0009) [2023-10-08 00:22:29,801][52060] Updated weights for policy 0, policy_version 12560 (0.0009) [2023-10-08 00:22:29,897][52059] Updated weights for policy 1, policy_version 12722 (0.0008) [2023-10-08 00:22:30,177][52060] Updated weights for policy 0, policy_version 12570 (0.0007) [2023-10-08 00:22:30,262][52059] Updated weights for policy 1, policy_version 12732 (0.0009) [2023-10-08 00:22:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 25919488. Throughput: 0: 1701.5, 1: 1717.3. Samples: 6485904. Policy #0 lag: (min: 12.0, avg: 25.4, max: 44.0) [2023-10-08 00:22:31,211][50642] Avg episode reward: [(0, '12.520'), (1, '16.910')] [2023-10-08 00:22:34,169][52059] Updated weights for policy 1, policy_version 12742 (0.0009) [2023-10-08 00:22:34,171][52060] Updated weights for policy 0, policy_version 12580 (0.0007) [2023-10-08 00:22:34,536][52059] Updated weights for policy 1, policy_version 12752 (0.0007) [2023-10-08 00:22:34,545][52060] Updated weights for policy 0, policy_version 12590 (0.0007) [2023-10-08 00:22:34,899][52059] Updated weights for policy 1, policy_version 12762 (0.0007) [2023-10-08 00:22:34,917][52060] Updated weights for policy 0, policy_version 12600 (0.0007) [2023-10-08 00:22:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 25985024. Throughput: 0: 1732.9, 1: 1751.3. Samples: 6498050. Policy #0 lag: (min: 12.0, avg: 25.4, max: 44.0) [2023-10-08 00:22:36,211][50642] Avg episode reward: [(0, '13.770'), (1, '16.280')] [2023-10-08 00:22:36,211][51605] Saving new best policy, reward=13.770! [2023-10-08 00:22:38,694][52059] Updated weights for policy 1, policy_version 12772 (0.0008) [2023-10-08 00:22:38,827][52060] Updated weights for policy 0, policy_version 12610 (0.0008) [2023-10-08 00:22:39,052][52059] Updated weights for policy 1, policy_version 12782 (0.0008) [2023-10-08 00:22:39,202][52060] Updated weights for policy 0, policy_version 12620 (0.0011) [2023-10-08 00:22:39,421][52059] Updated weights for policy 1, policy_version 12792 (0.0009) [2023-10-08 00:22:39,560][52060] Updated weights for policy 0, policy_version 12630 (0.0008) [2023-10-08 00:22:39,930][52060] Updated weights for policy 0, policy_version 12640 (0.0007) [2023-10-08 00:22:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 26050560. Throughput: 0: 1702.9, 1: 1721.6. Samples: 6516934. Policy #0 lag: (min: 12.0, avg: 25.4, max: 44.0) [2023-10-08 00:22:41,211][50642] Avg episode reward: [(0, '12.020'), (1, '15.540')] [2023-10-08 00:22:43,250][52059] Updated weights for policy 1, policy_version 12802 (0.0008) [2023-10-08 00:22:43,619][52059] Updated weights for policy 1, policy_version 12812 (0.0009) [2023-10-08 00:22:43,676][52060] Updated weights for policy 0, policy_version 12650 (0.0008) [2023-10-08 00:22:43,981][52059] Updated weights for policy 1, policy_version 12822 (0.0008) [2023-10-08 00:22:44,048][52060] Updated weights for policy 0, policy_version 12660 (0.0008) [2023-10-08 00:22:44,343][52059] Updated weights for policy 1, policy_version 12832 (0.0010) [2023-10-08 00:22:44,414][52060] Updated weights for policy 0, policy_version 12670 (0.0007) [2023-10-08 00:22:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 26116096. Throughput: 0: 1704.7, 1: 1723.1. Samples: 6538172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:46,211][50642] Avg episode reward: [(0, '13.420'), (1, '17.070')] [2023-10-08 00:22:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000012672_12976128.pth... [2023-10-08 00:22:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000012832_13139968.pth... [2023-10-08 00:22:46,249][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000011072_11337728.pth [2023-10-08 00:22:46,255][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000011200_11468800.pth [2023-10-08 00:22:48,292][52059] Updated weights for policy 1, policy_version 12842 (0.0007) [2023-10-08 00:22:48,466][52060] Updated weights for policy 0, policy_version 12680 (0.0009) [2023-10-08 00:22:48,660][52059] Updated weights for policy 1, policy_version 12852 (0.0007) [2023-10-08 00:22:48,834][52060] Updated weights for policy 0, policy_version 12690 (0.0009) [2023-10-08 00:22:49,028][52059] Updated weights for policy 1, policy_version 12862 (0.0007) [2023-10-08 00:22:49,194][52060] Updated weights for policy 0, policy_version 12700 (0.0009) [2023-10-08 00:22:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 26181632. Throughput: 0: 1709.6, 1: 1732.0. Samples: 6548428. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:51,211][50642] Avg episode reward: [(0, '12.450'), (1, '16.270')] [2023-10-08 00:22:52,863][52059] Updated weights for policy 1, policy_version 12872 (0.0009) [2023-10-08 00:22:53,145][52060] Updated weights for policy 0, policy_version 12710 (0.0008) [2023-10-08 00:22:53,228][52059] Updated weights for policy 1, policy_version 12882 (0.0007) [2023-10-08 00:22:53,507][52060] Updated weights for policy 0, policy_version 12720 (0.0007) [2023-10-08 00:22:53,594][52059] Updated weights for policy 1, policy_version 12892 (0.0007) [2023-10-08 00:22:53,880][52060] Updated weights for policy 0, policy_version 12730 (0.0007) [2023-10-08 00:22:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 26247168. Throughput: 0: 1691.3, 1: 1727.4. Samples: 6568862. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:22:56,211][50642] Avg episode reward: [(0, '13.010'), (1, '15.340')] [2023-10-08 00:22:57,793][52060] Updated weights for policy 0, policy_version 12740 (0.0007) [2023-10-08 00:22:57,816][52059] Updated weights for policy 1, policy_version 12902 (0.0007) [2023-10-08 00:22:58,158][52060] Updated weights for policy 0, policy_version 12750 (0.0008) [2023-10-08 00:22:58,197][52059] Updated weights for policy 1, policy_version 12912 (0.0007) [2023-10-08 00:22:58,536][52060] Updated weights for policy 0, policy_version 12760 (0.0008) [2023-10-08 00:22:58,559][52059] Updated weights for policy 1, policy_version 12922 (0.0007) [2023-10-08 00:23:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 26312704. Throughput: 0: 1724.5, 1: 1737.8. Samples: 6590058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:23:01,211][50642] Avg episode reward: [(0, '13.030'), (1, '15.250')] [2023-10-08 00:23:02,395][52059] Updated weights for policy 1, policy_version 12932 (0.0010) [2023-10-08 00:23:02,699][52060] Updated weights for policy 0, policy_version 12770 (0.0008) [2023-10-08 00:23:02,755][52059] Updated weights for policy 1, policy_version 12942 (0.0008) [2023-10-08 00:23:03,111][52060] Updated weights for policy 0, policy_version 12780 (0.0008) [2023-10-08 00:23:03,120][52059] Updated weights for policy 1, policy_version 12952 (0.0008) [2023-10-08 00:23:03,480][52060] Updated weights for policy 0, policy_version 12790 (0.0008) [2023-10-08 00:23:03,840][52060] Updated weights for policy 0, policy_version 12800 (0.0008) [2023-10-08 00:23:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 26378240. Throughput: 0: 1693.2, 1: 1716.7. Samples: 6599372. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:23:06,211][50642] Avg episode reward: [(0, '12.360'), (1, '16.190')] [2023-10-08 00:23:07,115][52059] Updated weights for policy 1, policy_version 12962 (0.0008) [2023-10-08 00:23:07,484][52059] Updated weights for policy 1, policy_version 12972 (0.0007) [2023-10-08 00:23:07,846][52059] Updated weights for policy 1, policy_version 12982 (0.0007) [2023-10-08 00:23:07,923][52060] Updated weights for policy 0, policy_version 12810 (0.0008) [2023-10-08 00:23:08,206][52059] Updated weights for policy 1, policy_version 12992 (0.0009) [2023-10-08 00:23:08,290][52060] Updated weights for policy 0, policy_version 12820 (0.0008) [2023-10-08 00:23:08,661][52060] Updated weights for policy 0, policy_version 12830 (0.0009) [2023-10-08 00:23:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 26443776. Throughput: 0: 1703.3, 1: 1723.0. Samples: 6620588. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:23:11,211][50642] Avg episode reward: [(0, '14.060'), (1, '14.800')] [2023-10-08 00:23:11,212][51605] Saving new best policy, reward=14.060! [2023-10-08 00:23:12,013][52059] Updated weights for policy 1, policy_version 13002 (0.0009) [2023-10-08 00:23:12,384][52059] Updated weights for policy 1, policy_version 13012 (0.0009) [2023-10-08 00:23:12,749][52060] Updated weights for policy 0, policy_version 12840 (0.0008) [2023-10-08 00:23:12,761][52059] Updated weights for policy 1, policy_version 13022 (0.0007) [2023-10-08 00:23:13,115][52060] Updated weights for policy 0, policy_version 12850 (0.0007) [2023-10-08 00:23:13,488][52060] Updated weights for policy 0, policy_version 12860 (0.0007) [2023-10-08 00:23:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 26509312. Throughput: 0: 1722.9, 1: 1747.1. Samples: 6642052. Policy #0 lag: (min: 33.0, avg: 53.4, max: 56.0) [2023-10-08 00:23:16,211][50642] Avg episode reward: [(0, '12.020'), (1, '15.140')] [2023-10-08 00:23:16,781][52059] Updated weights for policy 1, policy_version 13032 (0.0010) [2023-10-08 00:23:17,155][52059] Updated weights for policy 1, policy_version 13042 (0.0008) [2023-10-08 00:23:17,380][52060] Updated weights for policy 0, policy_version 12870 (0.0008) [2023-10-08 00:23:17,517][52059] Updated weights for policy 1, policy_version 13052 (0.0007) [2023-10-08 00:23:17,745][52060] Updated weights for policy 0, policy_version 12880 (0.0007) [2023-10-08 00:23:18,107][52060] Updated weights for policy 0, policy_version 12890 (0.0008) [2023-10-08 00:23:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 26574848. Throughput: 0: 1691.9, 1: 1714.4. Samples: 6651336. Policy #0 lag: (min: 33.0, avg: 53.4, max: 56.0) [2023-10-08 00:23:21,211][50642] Avg episode reward: [(0, '13.990'), (1, '15.360')] [2023-10-08 00:23:21,419][52059] Updated weights for policy 1, policy_version 13062 (0.0007) [2023-10-08 00:23:21,783][52059] Updated weights for policy 1, policy_version 13072 (0.0009) [2023-10-08 00:23:22,129][52060] Updated weights for policy 0, policy_version 12900 (0.0007) [2023-10-08 00:23:22,154][52059] Updated weights for policy 1, policy_version 13082 (0.0008) [2023-10-08 00:23:22,494][52060] Updated weights for policy 0, policy_version 12910 (0.0009) [2023-10-08 00:23:22,865][52060] Updated weights for policy 0, policy_version 12920 (0.0010) [2023-10-08 00:23:25,909][52059] Updated weights for policy 1, policy_version 13092 (0.0010) [2023-10-08 00:23:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 26640384. Throughput: 0: 1716.7, 1: 1743.2. Samples: 6672630. Policy #0 lag: (min: 33.0, avg: 53.4, max: 56.0) [2023-10-08 00:23:26,211][50642] Avg episode reward: [(0, '12.280'), (1, '13.800')] [2023-10-08 00:23:26,272][52059] Updated weights for policy 1, policy_version 13102 (0.0008) [2023-10-08 00:23:26,633][52059] Updated weights for policy 1, policy_version 13112 (0.0009) [2023-10-08 00:23:26,873][52060] Updated weights for policy 0, policy_version 12930 (0.0010) [2023-10-08 00:23:27,241][52060] Updated weights for policy 0, policy_version 12940 (0.0007) [2023-10-08 00:23:27,612][52060] Updated weights for policy 0, policy_version 12950 (0.0008) [2023-10-08 00:23:27,975][52060] Updated weights for policy 0, policy_version 12960 (0.0007) [2023-10-08 00:23:30,661][52059] Updated weights for policy 1, policy_version 13122 (0.0007) [2023-10-08 00:23:31,024][52059] Updated weights for policy 1, policy_version 13132 (0.0007) [2023-10-08 00:23:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 26705920. Throughput: 0: 1719.6, 1: 1734.1. Samples: 6693586. Policy #0 lag: (min: 1.0, avg: 18.9, max: 33.0) [2023-10-08 00:23:31,211][50642] Avg episode reward: [(0, '13.380'), (1, '14.710')] [2023-10-08 00:23:31,381][52059] Updated weights for policy 1, policy_version 13142 (0.0009) [2023-10-08 00:23:31,745][52059] Updated weights for policy 1, policy_version 13152 (0.0008) [2023-10-08 00:23:32,016][52060] Updated weights for policy 0, policy_version 12970 (0.0010) [2023-10-08 00:23:32,391][52060] Updated weights for policy 0, policy_version 12980 (0.0008) [2023-10-08 00:23:32,754][52060] Updated weights for policy 0, policy_version 12990 (0.0009) [2023-10-08 00:23:35,660][52059] Updated weights for policy 1, policy_version 13162 (0.0009) [2023-10-08 00:23:36,018][52059] Updated weights for policy 1, policy_version 13172 (0.0008) [2023-10-08 00:23:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 26771456. Throughput: 0: 1706.9, 1: 1735.2. Samples: 6703322. Policy #0 lag: (min: 1.0, avg: 18.9, max: 33.0) [2023-10-08 00:23:36,211][50642] Avg episode reward: [(0, '11.950'), (1, '16.190')] [2023-10-08 00:23:36,381][52059] Updated weights for policy 1, policy_version 13182 (0.0008) [2023-10-08 00:23:36,719][52060] Updated weights for policy 0, policy_version 13000 (0.0008) [2023-10-08 00:23:37,086][52060] Updated weights for policy 0, policy_version 13010 (0.0009) [2023-10-08 00:23:37,464][52060] Updated weights for policy 0, policy_version 13020 (0.0008) [2023-10-08 00:23:40,425][52059] Updated weights for policy 1, policy_version 13192 (0.0007) [2023-10-08 00:23:40,794][52059] Updated weights for policy 1, policy_version 13202 (0.0008) [2023-10-08 00:23:41,167][52059] Updated weights for policy 1, policy_version 13212 (0.0009) [2023-10-08 00:23:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 26836992. Throughput: 0: 1721.3, 1: 1740.9. Samples: 6724662. Policy #0 lag: (min: 31.0, avg: 32.9, max: 62.0) [2023-10-08 00:23:41,211][50642] Avg episode reward: [(0, '13.080'), (1, '14.680')] [2023-10-08 00:23:41,332][52060] Updated weights for policy 0, policy_version 13030 (0.0008) [2023-10-08 00:23:41,699][52060] Updated weights for policy 0, policy_version 13040 (0.0009) [2023-10-08 00:23:42,072][52060] Updated weights for policy 0, policy_version 13050 (0.0009) [2023-10-08 00:23:45,039][52059] Updated weights for policy 1, policy_version 13222 (0.0008) [2023-10-08 00:23:45,412][52059] Updated weights for policy 1, policy_version 13232 (0.0009) [2023-10-08 00:23:45,776][52059] Updated weights for policy 1, policy_version 13242 (0.0008) [2023-10-08 00:23:45,865][52060] Updated weights for policy 0, policy_version 13060 (0.0010) [2023-10-08 00:23:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 26935296. Throughput: 0: 1712.8, 1: 1726.9. Samples: 6744844. Policy #0 lag: (min: 31.0, avg: 32.9, max: 62.0) [2023-10-08 00:23:46,211][50642] Avg episode reward: [(0, '13.240'), (1, '16.570')] [2023-10-08 00:23:46,231][52060] Updated weights for policy 0, policy_version 13070 (0.0008) [2023-10-08 00:23:46,597][52060] Updated weights for policy 0, policy_version 13080 (0.0007) [2023-10-08 00:23:49,786][52059] Updated weights for policy 1, policy_version 13252 (0.0008) [2023-10-08 00:23:50,156][52059] Updated weights for policy 1, policy_version 13262 (0.0008) [2023-10-08 00:23:50,513][52059] Updated weights for policy 1, policy_version 13272 (0.0008) [2023-10-08 00:23:50,825][52060] Updated weights for policy 0, policy_version 13090 (0.0007) [2023-10-08 00:23:51,193][52060] Updated weights for policy 0, policy_version 13100 (0.0008) [2023-10-08 00:23:51,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 27000832. Throughput: 0: 1718.2, 1: 1743.5. Samples: 6755146. Policy #0 lag: (min: 31.0, avg: 32.9, max: 62.0) [2023-10-08 00:23:51,211][50642] Avg episode reward: [(0, '12.450'), (1, '16.700')] [2023-10-08 00:23:51,556][52060] Updated weights for policy 0, policy_version 13110 (0.0008) [2023-10-08 00:23:51,930][52060] Updated weights for policy 0, policy_version 13120 (0.0007) [2023-10-08 00:23:54,577][52059] Updated weights for policy 1, policy_version 13282 (0.0007) [2023-10-08 00:23:54,941][52059] Updated weights for policy 1, policy_version 13292 (0.0008) [2023-10-08 00:23:55,313][52059] Updated weights for policy 1, policy_version 13302 (0.0007) [2023-10-08 00:23:55,671][52059] Updated weights for policy 1, policy_version 13312 (0.0008) [2023-10-08 00:23:55,743][52060] Updated weights for policy 0, policy_version 13130 (0.0009) [2023-10-08 00:23:56,118][52060] Updated weights for policy 0, policy_version 13140 (0.0008) [2023-10-08 00:23:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 27066368. Throughput: 0: 1722.3, 1: 1734.1. Samples: 6776122. Policy #0 lag: (min: 31.0, avg: 37.4, max: 63.0) [2023-10-08 00:23:56,210][50642] Avg episode reward: [(0, '14.050'), (1, '14.860')] [2023-10-08 00:23:56,484][52060] Updated weights for policy 0, policy_version 13150 (0.0008) [2023-10-08 00:23:59,650][52059] Updated weights for policy 1, policy_version 13322 (0.0007) [2023-10-08 00:24:00,012][52059] Updated weights for policy 1, policy_version 13332 (0.0007) [2023-10-08 00:24:00,287][52060] Updated weights for policy 0, policy_version 13160 (0.0008) [2023-10-08 00:24:00,375][52059] Updated weights for policy 1, policy_version 13342 (0.0007) [2023-10-08 00:24:00,649][52060] Updated weights for policy 0, policy_version 13170 (0.0010) [2023-10-08 00:24:01,016][52060] Updated weights for policy 0, policy_version 13180 (0.0008) [2023-10-08 00:24:01,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 27164672. Throughput: 0: 1707.4, 1: 1708.2. Samples: 6795756. Policy #0 lag: (min: 31.0, avg: 37.4, max: 63.0) [2023-10-08 00:24:01,211][50642] Avg episode reward: [(0, '13.260'), (1, '16.040')] [2023-10-08 00:24:04,333][52059] Updated weights for policy 1, policy_version 13352 (0.0009) [2023-10-08 00:24:04,705][52059] Updated weights for policy 1, policy_version 13362 (0.0009) [2023-10-08 00:24:04,964][52060] Updated weights for policy 0, policy_version 13190 (0.0009) [2023-10-08 00:24:05,071][52059] Updated weights for policy 1, policy_version 13372 (0.0007) [2023-10-08 00:24:05,325][52060] Updated weights for policy 0, policy_version 13200 (0.0010) [2023-10-08 00:24:05,698][52060] Updated weights for policy 0, policy_version 13210 (0.0008) [2023-10-08 00:24:06,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 27230208. Throughput: 0: 1728.6, 1: 1738.8. Samples: 6807368. Policy #0 lag: (min: 11.0, avg: 19.0, max: 43.0) [2023-10-08 00:24:06,211][50642] Avg episode reward: [(0, '13.690'), (1, '16.620')] [2023-10-08 00:24:08,935][52059] Updated weights for policy 1, policy_version 13382 (0.0009) [2023-10-08 00:24:09,296][52059] Updated weights for policy 1, policy_version 13392 (0.0010) [2023-10-08 00:24:09,595][52060] Updated weights for policy 0, policy_version 13220 (0.0007) [2023-10-08 00:24:09,659][52059] Updated weights for policy 1, policy_version 13402 (0.0007) [2023-10-08 00:24:09,957][52060] Updated weights for policy 0, policy_version 13230 (0.0008) [2023-10-08 00:24:10,331][52060] Updated weights for policy 0, policy_version 13240 (0.0008) [2023-10-08 00:24:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 27295744. Throughput: 0: 1725.2, 1: 1708.0. Samples: 6827124. Policy #0 lag: (min: 11.0, avg: 19.0, max: 43.0) [2023-10-08 00:24:11,211][50642] Avg episode reward: [(0, '13.030'), (1, '15.520')] [2023-10-08 00:24:13,608][52059] Updated weights for policy 1, policy_version 13412 (0.0007) [2023-10-08 00:24:13,979][52059] Updated weights for policy 1, policy_version 13422 (0.0009) [2023-10-08 00:24:14,342][52060] Updated weights for policy 0, policy_version 13250 (0.0010) [2023-10-08 00:24:14,345][52059] Updated weights for policy 1, policy_version 13432 (0.0008) [2023-10-08 00:24:14,712][52060] Updated weights for policy 0, policy_version 13260 (0.0009) [2023-10-08 00:24:15,093][52060] Updated weights for policy 0, policy_version 13270 (0.0007) [2023-10-08 00:24:15,459][52060] Updated weights for policy 0, policy_version 13280 (0.0008) [2023-10-08 00:24:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 27361280. Throughput: 0: 1704.0, 1: 1714.1. Samples: 6847398. Policy #0 lag: (min: 11.0, avg: 19.0, max: 43.0) [2023-10-08 00:24:16,211][50642] Avg episode reward: [(0, '12.940'), (1, '15.820')] [2023-10-08 00:24:18,323][52059] Updated weights for policy 1, policy_version 13442 (0.0008) [2023-10-08 00:24:18,688][52059] Updated weights for policy 1, policy_version 13452 (0.0007) [2023-10-08 00:24:19,059][52059] Updated weights for policy 1, policy_version 13462 (0.0007) [2023-10-08 00:24:19,415][52059] Updated weights for policy 1, policy_version 13472 (0.0008) [2023-10-08 00:24:19,533][52060] Updated weights for policy 0, policy_version 13290 (0.0009) [2023-10-08 00:24:19,894][52060] Updated weights for policy 0, policy_version 13300 (0.0009) [2023-10-08 00:24:20,260][52060] Updated weights for policy 0, policy_version 13310 (0.0008) [2023-10-08 00:24:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 27426816. Throughput: 0: 1734.1, 1: 1714.6. Samples: 6858514. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:24:21,211][50642] Avg episode reward: [(0, '13.090'), (1, '16.310')] [2023-10-08 00:24:23,341][52059] Updated weights for policy 1, policy_version 13482 (0.0009) [2023-10-08 00:24:23,707][52059] Updated weights for policy 1, policy_version 13492 (0.0009) [2023-10-08 00:24:24,068][52059] Updated weights for policy 1, policy_version 13502 (0.0007) [2023-10-08 00:24:24,120][52060] Updated weights for policy 0, policy_version 13320 (0.0007) [2023-10-08 00:24:24,498][52060] Updated weights for policy 0, policy_version 13330 (0.0007) [2023-10-08 00:24:24,857][52060] Updated weights for policy 0, policy_version 13340 (0.0008) [2023-10-08 00:24:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 27492352. Throughput: 0: 1711.4, 1: 1700.5. Samples: 6878198. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:24:26,211][50642] Avg episode reward: [(0, '13.610'), (1, '14.830')] [2023-10-08 00:24:27,939][52059] Updated weights for policy 1, policy_version 13512 (0.0011) [2023-10-08 00:24:28,306][52059] Updated weights for policy 1, policy_version 13522 (0.0010) [2023-10-08 00:24:28,673][52059] Updated weights for policy 1, policy_version 13532 (0.0008) [2023-10-08 00:24:28,942][52060] Updated weights for policy 0, policy_version 13350 (0.0008) [2023-10-08 00:24:29,304][52060] Updated weights for policy 0, policy_version 13360 (0.0008) [2023-10-08 00:24:29,678][52060] Updated weights for policy 0, policy_version 13370 (0.0009) [2023-10-08 00:24:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 27557888. Throughput: 0: 1700.2, 1: 1733.5. Samples: 6899358. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:24:31,211][50642] Avg episode reward: [(0, '13.910'), (1, '16.170')] [2023-10-08 00:24:32,644][52059] Updated weights for policy 1, policy_version 13542 (0.0010) [2023-10-08 00:24:33,037][52059] Updated weights for policy 1, policy_version 13552 (0.0010) [2023-10-08 00:24:33,409][52059] Updated weights for policy 1, policy_version 13562 (0.0010) [2023-10-08 00:24:33,756][52060] Updated weights for policy 0, policy_version 13380 (0.0009) [2023-10-08 00:24:34,113][52060] Updated weights for policy 0, policy_version 13390 (0.0010) [2023-10-08 00:24:34,489][52060] Updated weights for policy 0, policy_version 13400 (0.0009) [2023-10-08 00:24:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 27623424. Throughput: 0: 1720.9, 1: 1708.7. Samples: 6909478. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 00:24:36,211][50642] Avg episode reward: [(0, '12.910'), (1, '18.990')] [2023-10-08 00:24:36,211][51710] Saving new best policy, reward=18.990! [2023-10-08 00:24:37,266][52059] Updated weights for policy 1, policy_version 13572 (0.0009) [2023-10-08 00:24:37,636][52059] Updated weights for policy 1, policy_version 13582 (0.0007) [2023-10-08 00:24:37,995][52059] Updated weights for policy 1, policy_version 13592 (0.0007) [2023-10-08 00:24:38,547][52060] Updated weights for policy 0, policy_version 13410 (0.0008) [2023-10-08 00:24:38,937][52060] Updated weights for policy 0, policy_version 13420 (0.0009) [2023-10-08 00:24:39,293][52060] Updated weights for policy 0, policy_version 13430 (0.0007) [2023-10-08 00:24:39,665][52060] Updated weights for policy 0, policy_version 13440 (0.0008) [2023-10-08 00:24:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 27688960. Throughput: 0: 1692.1, 1: 1720.7. Samples: 6929700. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 00:24:41,211][50642] Avg episode reward: [(0, '14.140'), (1, '14.410')] [2023-10-08 00:24:41,211][51605] Saving new best policy, reward=14.140! [2023-10-08 00:24:41,897][52059] Updated weights for policy 1, policy_version 13602 (0.0008) [2023-10-08 00:24:42,259][52059] Updated weights for policy 1, policy_version 13612 (0.0009) [2023-10-08 00:24:42,632][52059] Updated weights for policy 1, policy_version 13622 (0.0007) [2023-10-08 00:24:42,996][52059] Updated weights for policy 1, policy_version 13632 (0.0008) [2023-10-08 00:24:43,598][52060] Updated weights for policy 0, policy_version 13450 (0.0007) [2023-10-08 00:24:43,965][52060] Updated weights for policy 0, policy_version 13460 (0.0007) [2023-10-08 00:24:44,338][52060] Updated weights for policy 0, policy_version 13470 (0.0008) [2023-10-08 00:24:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 27754496. Throughput: 0: 1711.5, 1: 1745.2. Samples: 6951306. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 00:24:46,211][50642] Avg episode reward: [(0, '13.210'), (1, '15.710')] [2023-10-08 00:24:46,217][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000013472_13795328.pth... [2023-10-08 00:24:46,217][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000013632_13959168.pth... [2023-10-08 00:24:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000012032_12320768.pth [2023-10-08 00:24:46,248][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000011872_12156928.pth [2023-10-08 00:24:46,914][52059] Updated weights for policy 1, policy_version 13642 (0.0008) [2023-10-08 00:24:47,282][52059] Updated weights for policy 1, policy_version 13652 (0.0007) [2023-10-08 00:24:47,645][52059] Updated weights for policy 1, policy_version 13662 (0.0009) [2023-10-08 00:24:48,348][52060] Updated weights for policy 0, policy_version 13480 (0.0007) [2023-10-08 00:24:48,724][52060] Updated weights for policy 0, policy_version 13490 (0.0010) [2023-10-08 00:24:49,099][52060] Updated weights for policy 0, policy_version 13500 (0.0008) [2023-10-08 00:24:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 27820032. Throughput: 0: 1701.8, 1: 1714.3. Samples: 6961092. Policy #0 lag: (min: 31.0, avg: 33.5, max: 63.0) [2023-10-08 00:24:51,211][50642] Avg episode reward: [(0, '12.770'), (1, '16.610')] [2023-10-08 00:24:51,580][52059] Updated weights for policy 1, policy_version 13672 (0.0008) [2023-10-08 00:24:51,949][52059] Updated weights for policy 1, policy_version 13682 (0.0007) [2023-10-08 00:24:52,322][52059] Updated weights for policy 1, policy_version 13692 (0.0007) [2023-10-08 00:24:53,102][52060] Updated weights for policy 0, policy_version 13510 (0.0007) [2023-10-08 00:24:53,464][52060] Updated weights for policy 0, policy_version 13520 (0.0010) [2023-10-08 00:24:53,835][52060] Updated weights for policy 0, policy_version 13530 (0.0012) [2023-10-08 00:24:56,193][52059] Updated weights for policy 1, policy_version 13702 (0.0009) [2023-10-08 00:24:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 27885568. Throughput: 0: 1694.4, 1: 1743.6. Samples: 6981834. Policy #0 lag: (min: 31.0, avg: 33.5, max: 63.0) [2023-10-08 00:24:56,211][50642] Avg episode reward: [(0, '13.290'), (1, '13.560')] [2023-10-08 00:24:56,559][52059] Updated weights for policy 1, policy_version 13712 (0.0010) [2023-10-08 00:24:56,921][52059] Updated weights for policy 1, policy_version 13722 (0.0009) [2023-10-08 00:24:57,791][52060] Updated weights for policy 0, policy_version 13540 (0.0010) [2023-10-08 00:24:58,162][52060] Updated weights for policy 0, policy_version 13550 (0.0008) [2023-10-08 00:24:58,529][52060] Updated weights for policy 0, policy_version 13560 (0.0007) [2023-10-08 00:25:00,951][52059] Updated weights for policy 1, policy_version 13732 (0.0011) [2023-10-08 00:25:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 27951104. Throughput: 0: 1715.0, 1: 1740.3. Samples: 7002884. Policy #0 lag: (min: 31.0, avg: 33.5, max: 63.0) [2023-10-08 00:25:01,211][50642] Avg episode reward: [(0, '13.300'), (1, '15.740')] [2023-10-08 00:25:01,315][52059] Updated weights for policy 1, policy_version 13742 (0.0011) [2023-10-08 00:25:01,691][52059] Updated weights for policy 1, policy_version 13752 (0.0010) [2023-10-08 00:25:02,562][52060] Updated weights for policy 0, policy_version 13570 (0.0008) [2023-10-08 00:25:02,937][52060] Updated weights for policy 0, policy_version 13580 (0.0010) [2023-10-08 00:25:03,311][52060] Updated weights for policy 0, policy_version 13590 (0.0009) [2023-10-08 00:25:03,690][52060] Updated weights for policy 0, policy_version 13600 (0.0009) [2023-10-08 00:25:05,723][52059] Updated weights for policy 1, policy_version 13762 (0.0009) [2023-10-08 00:25:06,090][52059] Updated weights for policy 1, policy_version 13772 (0.0007) [2023-10-08 00:25:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13773.7). Total num frames: 28016640. Throughput: 0: 1687.2, 1: 1732.2. Samples: 7012386. Policy #0 lag: (min: 31.0, avg: 31.9, max: 50.0) [2023-10-08 00:25:06,211][50642] Avg episode reward: [(0, '14.120'), (1, '15.790')] [2023-10-08 00:25:06,457][52059] Updated weights for policy 1, policy_version 13782 (0.0007) [2023-10-08 00:25:06,823][52059] Updated weights for policy 1, policy_version 13792 (0.0007) [2023-10-08 00:25:07,648][52060] Updated weights for policy 0, policy_version 13610 (0.0009) [2023-10-08 00:25:08,029][52060] Updated weights for policy 0, policy_version 13620 (0.0009) [2023-10-08 00:25:08,409][52060] Updated weights for policy 0, policy_version 13630 (0.0009) [2023-10-08 00:25:10,871][52059] Updated weights for policy 1, policy_version 13802 (0.0012) [2023-10-08 00:25:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 28082176. Throughput: 0: 1705.1, 1: 1745.5. Samples: 7033472. Policy #0 lag: (min: 31.0, avg: 31.9, max: 50.0) [2023-10-08 00:25:11,211][50642] Avg episode reward: [(0, '13.100'), (1, '15.150')] [2023-10-08 00:25:11,247][52059] Updated weights for policy 1, policy_version 13812 (0.0007) [2023-10-08 00:25:11,610][52059] Updated weights for policy 1, policy_version 13822 (0.0009) [2023-10-08 00:25:12,379][52060] Updated weights for policy 0, policy_version 13640 (0.0008) [2023-10-08 00:25:12,742][52060] Updated weights for policy 0, policy_version 13650 (0.0007) [2023-10-08 00:25:13,120][52060] Updated weights for policy 0, policy_version 13660 (0.0009) [2023-10-08 00:25:15,530][52059] Updated weights for policy 1, policy_version 13832 (0.0009) [2023-10-08 00:25:15,898][52059] Updated weights for policy 1, policy_version 13842 (0.0009) [2023-10-08 00:25:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 28147712. Throughput: 0: 1720.7, 1: 1720.3. Samples: 7054202. Policy #0 lag: (min: 31.0, avg: 31.9, max: 50.0) [2023-10-08 00:25:16,212][50642] Avg episode reward: [(0, '13.240'), (1, '15.710')] [2023-10-08 00:25:16,265][52059] Updated weights for policy 1, policy_version 13852 (0.0007) [2023-10-08 00:25:17,081][52060] Updated weights for policy 0, policy_version 13670 (0.0009) [2023-10-08 00:25:17,453][52060] Updated weights for policy 0, policy_version 13680 (0.0011) [2023-10-08 00:25:17,824][52060] Updated weights for policy 0, policy_version 13690 (0.0010) [2023-10-08 00:25:20,453][52059] Updated weights for policy 1, policy_version 13862 (0.0007) [2023-10-08 00:25:20,833][52059] Updated weights for policy 1, policy_version 13872 (0.0007) [2023-10-08 00:25:21,198][52059] Updated weights for policy 1, policy_version 13882 (0.0007) [2023-10-08 00:25:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 28213248. Throughput: 0: 1695.8, 1: 1741.8. Samples: 7064168. Policy #0 lag: (min: 21.0, avg: 29.0, max: 53.0) [2023-10-08 00:25:21,211][50642] Avg episode reward: [(0, '14.770'), (1, '16.000')] [2023-10-08 00:25:21,213][51605] Saving new best policy, reward=14.770! [2023-10-08 00:25:21,844][52060] Updated weights for policy 0, policy_version 13700 (0.0010) [2023-10-08 00:25:22,225][52060] Updated weights for policy 0, policy_version 13710 (0.0010) [2023-10-08 00:25:22,597][52060] Updated weights for policy 0, policy_version 13720 (0.0009) [2023-10-08 00:25:25,022][52059] Updated weights for policy 1, policy_version 13892 (0.0009) [2023-10-08 00:25:25,385][52059] Updated weights for policy 1, policy_version 13902 (0.0010) [2023-10-08 00:25:25,758][52059] Updated weights for policy 1, policy_version 13912 (0.0008) [2023-10-08 00:25:26,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 28311552. Throughput: 0: 1719.9, 1: 1735.5. Samples: 7085192. Policy #0 lag: (min: 21.0, avg: 29.0, max: 53.0) [2023-10-08 00:25:26,211][50642] Avg episode reward: [(0, '13.310'), (1, '16.760')] [2023-10-08 00:25:26,574][52060] Updated weights for policy 0, policy_version 13730 (0.0007) [2023-10-08 00:25:26,967][52060] Updated weights for policy 0, policy_version 13740 (0.0009) [2023-10-08 00:25:27,336][52060] Updated weights for policy 0, policy_version 13750 (0.0010) [2023-10-08 00:25:27,705][52060] Updated weights for policy 0, policy_version 13760 (0.0008) [2023-10-08 00:25:29,639][52059] Updated weights for policy 1, policy_version 13922 (0.0008) [2023-10-08 00:25:29,997][52059] Updated weights for policy 1, policy_version 13932 (0.0007) [2023-10-08 00:25:30,353][52059] Updated weights for policy 1, policy_version 13942 (0.0009) [2023-10-08 00:25:30,718][52059] Updated weights for policy 1, policy_version 13952 (0.0010) [2023-10-08 00:25:31,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 28377088. Throughput: 0: 1713.3, 1: 1703.2. Samples: 7105052. Policy #0 lag: (min: 21.0, avg: 29.0, max: 53.0) [2023-10-08 00:25:31,211][50642] Avg episode reward: [(0, '13.370'), (1, '15.940')] [2023-10-08 00:25:31,621][52060] Updated weights for policy 0, policy_version 13770 (0.0009) [2023-10-08 00:25:31,984][52060] Updated weights for policy 0, policy_version 13780 (0.0009) [2023-10-08 00:25:32,357][52060] Updated weights for policy 0, policy_version 13790 (0.0009) [2023-10-08 00:25:34,736][52059] Updated weights for policy 1, policy_version 13962 (0.0008) [2023-10-08 00:25:35,107][52059] Updated weights for policy 1, policy_version 13972 (0.0008) [2023-10-08 00:25:35,472][52059] Updated weights for policy 1, policy_version 13982 (0.0009) [2023-10-08 00:25:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 28442624. Throughput: 0: 1701.5, 1: 1735.1. Samples: 7115738. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 00:25:36,211][50642] Avg episode reward: [(0, '12.710'), (1, '16.200')] [2023-10-08 00:25:36,221][52060] Updated weights for policy 0, policy_version 13800 (0.0009) [2023-10-08 00:25:36,597][52060] Updated weights for policy 0, policy_version 13810 (0.0009) [2023-10-08 00:25:36,965][52060] Updated weights for policy 0, policy_version 13820 (0.0008) [2023-10-08 00:25:39,397][52059] Updated weights for policy 1, policy_version 13992 (0.0010) [2023-10-08 00:25:39,757][52059] Updated weights for policy 1, policy_version 14002 (0.0011) [2023-10-08 00:25:40,129][52059] Updated weights for policy 1, policy_version 14012 (0.0009) [2023-10-08 00:25:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 28508160. Throughput: 0: 1710.3, 1: 1713.6. Samples: 7135910. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 00:25:41,211][50642] Avg episode reward: [(0, '13.570'), (1, '15.890')] [2023-10-08 00:25:41,268][52060] Updated weights for policy 0, policy_version 13830 (0.0010) [2023-10-08 00:25:41,623][52060] Updated weights for policy 0, policy_version 13840 (0.0009) [2023-10-08 00:25:41,988][52060] Updated weights for policy 0, policy_version 13850 (0.0010) [2023-10-08 00:25:44,083][52059] Updated weights for policy 1, policy_version 14022 (0.0011) [2023-10-08 00:25:44,446][52059] Updated weights for policy 1, policy_version 14032 (0.0009) [2023-10-08 00:25:44,815][52059] Updated weights for policy 1, policy_version 14042 (0.0008) [2023-10-08 00:25:46,016][52060] Updated weights for policy 0, policy_version 13860 (0.0010) [2023-10-08 00:25:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 28573696. Throughput: 0: 1708.1, 1: 1708.4. Samples: 7156626. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 00:25:46,211][50642] Avg episode reward: [(0, '14.490'), (1, '16.130')] [2023-10-08 00:25:46,379][52060] Updated weights for policy 0, policy_version 13870 (0.0007) [2023-10-08 00:25:46,754][52060] Updated weights for policy 0, policy_version 13880 (0.0009) [2023-10-08 00:25:48,735][52059] Updated weights for policy 1, policy_version 14052 (0.0008) [2023-10-08 00:25:49,100][52059] Updated weights for policy 1, policy_version 14062 (0.0007) [2023-10-08 00:25:49,463][52059] Updated weights for policy 1, policy_version 14072 (0.0009) [2023-10-08 00:25:50,662][52060] Updated weights for policy 0, policy_version 13890 (0.0007) [2023-10-08 00:25:51,027][52060] Updated weights for policy 0, policy_version 13900 (0.0007) [2023-10-08 00:25:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 28639232. Throughput: 0: 1703.4, 1: 1730.7. Samples: 7166918. Policy #0 lag: (min: 18.0, avg: 32.5, max: 50.0) [2023-10-08 00:25:51,211][50642] Avg episode reward: [(0, '14.110'), (1, '16.100')] [2023-10-08 00:25:51,397][52060] Updated weights for policy 0, policy_version 13910 (0.0008) [2023-10-08 00:25:51,768][52060] Updated weights for policy 0, policy_version 13920 (0.0009) [2023-10-08 00:25:53,380][52059] Updated weights for policy 1, policy_version 14082 (0.0008) [2023-10-08 00:25:53,740][52059] Updated weights for policy 1, policy_version 14092 (0.0008) [2023-10-08 00:25:54,101][52059] Updated weights for policy 1, policy_version 14102 (0.0008) [2023-10-08 00:25:54,469][52059] Updated weights for policy 1, policy_version 14112 (0.0009) [2023-10-08 00:25:55,700][52060] Updated weights for policy 0, policy_version 13930 (0.0009) [2023-10-08 00:25:56,074][52060] Updated weights for policy 0, policy_version 13940 (0.0008) [2023-10-08 00:25:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 28704768. Throughput: 0: 1710.3, 1: 1709.1. Samples: 7187342. Policy #0 lag: (min: 18.0, avg: 32.5, max: 50.0) [2023-10-08 00:25:56,211][50642] Avg episode reward: [(0, '14.240'), (1, '17.120')] [2023-10-08 00:25:56,450][52060] Updated weights for policy 0, policy_version 13950 (0.0009) [2023-10-08 00:25:58,328][52059] Updated weights for policy 1, policy_version 14122 (0.0008) [2023-10-08 00:25:58,692][52059] Updated weights for policy 1, policy_version 14132 (0.0007) [2023-10-08 00:25:59,056][52059] Updated weights for policy 1, policy_version 14142 (0.0011) [2023-10-08 00:26:00,416][52060] Updated weights for policy 0, policy_version 13960 (0.0010) [2023-10-08 00:26:00,789][52060] Updated weights for policy 0, policy_version 13970 (0.0009) [2023-10-08 00:26:01,167][52060] Updated weights for policy 0, policy_version 13980 (0.0009) [2023-10-08 00:26:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 28770304. Throughput: 0: 1689.5, 1: 1725.5. Samples: 7207876. Policy #0 lag: (min: 18.0, avg: 32.5, max: 50.0) [2023-10-08 00:26:01,211][50642] Avg episode reward: [(0, '14.370'), (1, '15.970')] [2023-10-08 00:26:02,991][52059] Updated weights for policy 1, policy_version 14152 (0.0010) [2023-10-08 00:26:03,359][52059] Updated weights for policy 1, policy_version 14162 (0.0010) [2023-10-08 00:26:03,737][52059] Updated weights for policy 1, policy_version 14172 (0.0010) [2023-10-08 00:26:04,990][52060] Updated weights for policy 0, policy_version 13990 (0.0008) [2023-10-08 00:26:05,357][52060] Updated weights for policy 0, policy_version 14000 (0.0008) [2023-10-08 00:26:05,726][52060] Updated weights for policy 0, policy_version 14010 (0.0009) [2023-10-08 00:26:06,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 28868608. Throughput: 0: 1710.8, 1: 1711.9. Samples: 7218192. Policy #0 lag: (min: 24.0, avg: 52.3, max: 56.0) [2023-10-08 00:26:06,211][50642] Avg episode reward: [(0, '14.290'), (1, '14.790')] [2023-10-08 00:26:07,792][52059] Updated weights for policy 1, policy_version 14182 (0.0010) [2023-10-08 00:26:08,174][52059] Updated weights for policy 1, policy_version 14192 (0.0008) [2023-10-08 00:26:08,541][52059] Updated weights for policy 1, policy_version 14202 (0.0008) [2023-10-08 00:26:09,720][52060] Updated weights for policy 0, policy_version 14020 (0.0008) [2023-10-08 00:26:10,100][52060] Updated weights for policy 0, policy_version 14030 (0.0010) [2023-10-08 00:26:10,462][52060] Updated weights for policy 0, policy_version 14040 (0.0010) [2023-10-08 00:26:11,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 28934144. Throughput: 0: 1708.1, 1: 1714.0. Samples: 7239188. Policy #0 lag: (min: 24.0, avg: 52.3, max: 56.0) [2023-10-08 00:26:11,211][50642] Avg episode reward: [(0, '14.620'), (1, '13.690')] [2023-10-08 00:26:12,269][52059] Updated weights for policy 1, policy_version 14212 (0.0008) [2023-10-08 00:26:12,638][52059] Updated weights for policy 1, policy_version 14222 (0.0009) [2023-10-08 00:26:13,001][52059] Updated weights for policy 1, policy_version 14232 (0.0007) [2023-10-08 00:26:14,529][52060] Updated weights for policy 0, policy_version 14050 (0.0008) [2023-10-08 00:26:14,941][52060] Updated weights for policy 0, policy_version 14060 (0.0009) [2023-10-08 00:26:15,305][52060] Updated weights for policy 0, policy_version 14070 (0.0008) [2023-10-08 00:26:15,668][52060] Updated weights for policy 0, policy_version 14080 (0.0009) [2023-10-08 00:26:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 28999680. Throughput: 0: 1684.0, 1: 1747.3. Samples: 7259462. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) [2023-10-08 00:26:16,211][50642] Avg episode reward: [(0, '14.160'), (1, '16.570')] [2023-10-08 00:26:16,895][52059] Updated weights for policy 1, policy_version 14242 (0.0007) [2023-10-08 00:26:17,260][52059] Updated weights for policy 1, policy_version 14252 (0.0008) [2023-10-08 00:26:17,621][52059] Updated weights for policy 1, policy_version 14262 (0.0009) [2023-10-08 00:26:17,996][52059] Updated weights for policy 1, policy_version 14272 (0.0009) [2023-10-08 00:26:19,641][52060] Updated weights for policy 0, policy_version 14090 (0.0008) [2023-10-08 00:26:20,006][52060] Updated weights for policy 0, policy_version 14100 (0.0010) [2023-10-08 00:26:20,374][52060] Updated weights for policy 0, policy_version 14110 (0.0009) [2023-10-08 00:26:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 29065216. Throughput: 0: 1714.7, 1: 1715.6. Samples: 7270100. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) [2023-10-08 00:26:21,211][50642] Avg episode reward: [(0, '14.350'), (1, '15.140')] [2023-10-08 00:26:22,136][52059] Updated weights for policy 1, policy_version 14282 (0.0008) [2023-10-08 00:26:22,510][52059] Updated weights for policy 1, policy_version 14292 (0.0009) [2023-10-08 00:26:22,867][52059] Updated weights for policy 1, policy_version 14302 (0.0008) [2023-10-08 00:26:24,383][52060] Updated weights for policy 0, policy_version 14120 (0.0010) [2023-10-08 00:26:24,749][52060] Updated weights for policy 0, policy_version 14130 (0.0010) [2023-10-08 00:26:25,128][52060] Updated weights for policy 0, policy_version 14140 (0.0009) [2023-10-08 00:26:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29130752. Throughput: 0: 1696.4, 1: 1736.0. Samples: 7290366. Policy #0 lag: (min: 3.0, avg: 6.9, max: 35.0) [2023-10-08 00:26:26,211][50642] Avg episode reward: [(0, '14.900'), (1, '14.140')] [2023-10-08 00:26:26,212][51605] Saving new best policy, reward=14.900! [2023-10-08 00:26:26,756][52059] Updated weights for policy 1, policy_version 14312 (0.0008) [2023-10-08 00:26:27,121][52059] Updated weights for policy 1, policy_version 14322 (0.0008) [2023-10-08 00:26:27,487][52059] Updated weights for policy 1, policy_version 14332 (0.0007) [2023-10-08 00:26:29,265][52060] Updated weights for policy 0, policy_version 14150 (0.0011) [2023-10-08 00:26:29,628][52060] Updated weights for policy 0, policy_version 14160 (0.0008) [2023-10-08 00:26:30,000][52060] Updated weights for policy 0, policy_version 14170 (0.0009) [2023-10-08 00:26:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29196288. Throughput: 0: 1680.0, 1: 1747.5. Samples: 7310860. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 00:26:31,211][50642] Avg episode reward: [(0, '14.450'), (1, '15.400')] [2023-10-08 00:26:31,254][52059] Updated weights for policy 1, policy_version 14342 (0.0007) [2023-10-08 00:26:31,611][52059] Updated weights for policy 1, policy_version 14352 (0.0008) [2023-10-08 00:26:31,974][52059] Updated weights for policy 1, policy_version 14362 (0.0010) [2023-10-08 00:26:34,104][52060] Updated weights for policy 0, policy_version 14180 (0.0009) [2023-10-08 00:26:34,485][52060] Updated weights for policy 0, policy_version 14190 (0.0010) [2023-10-08 00:26:34,858][52060] Updated weights for policy 0, policy_version 14200 (0.0010) [2023-10-08 00:26:36,008][52059] Updated weights for policy 1, policy_version 14372 (0.0009) [2023-10-08 00:26:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29261824. Throughput: 0: 1714.5, 1: 1721.2. Samples: 7321526. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 00:26:36,211][50642] Avg episode reward: [(0, '14.090'), (1, '15.740')] [2023-10-08 00:26:36,380][52059] Updated weights for policy 1, policy_version 14382 (0.0011) [2023-10-08 00:26:36,754][52059] Updated weights for policy 1, policy_version 14392 (0.0009) [2023-10-08 00:26:38,874][52060] Updated weights for policy 0, policy_version 14210 (0.0011) [2023-10-08 00:26:39,246][52060] Updated weights for policy 0, policy_version 14220 (0.0008) [2023-10-08 00:26:39,617][52060] Updated weights for policy 0, policy_version 14230 (0.0008) [2023-10-08 00:26:39,994][52060] Updated weights for policy 0, policy_version 14240 (0.0008) [2023-10-08 00:26:40,734][52059] Updated weights for policy 1, policy_version 14402 (0.0008) [2023-10-08 00:26:41,097][52059] Updated weights for policy 1, policy_version 14412 (0.0008) [2023-10-08 00:26:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29327360. Throughput: 0: 1684.5, 1: 1743.0. Samples: 7341580. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 00:26:41,211][50642] Avg episode reward: [(0, '14.240'), (1, '15.000')] [2023-10-08 00:26:41,466][52059] Updated weights for policy 1, policy_version 14422 (0.0009) [2023-10-08 00:26:41,839][52059] Updated weights for policy 1, policy_version 14432 (0.0007) [2023-10-08 00:26:43,776][52060] Updated weights for policy 0, policy_version 14250 (0.0008) [2023-10-08 00:26:44,139][52060] Updated weights for policy 0, policy_version 14260 (0.0008) [2023-10-08 00:26:44,522][52060] Updated weights for policy 0, policy_version 14270 (0.0011) [2023-10-08 00:26:45,788][52059] Updated weights for policy 1, policy_version 14442 (0.0010) [2023-10-08 00:26:46,161][52059] Updated weights for policy 1, policy_version 14452 (0.0009) [2023-10-08 00:26:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29392896. Throughput: 0: 1700.8, 1: 1732.7. Samples: 7362382. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:26:46,211][50642] Avg episode reward: [(0, '14.650'), (1, '17.270')] [2023-10-08 00:26:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000014272_14614528.pth... [2023-10-08 00:26:46,251][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000012672_12976128.pth [2023-10-08 00:26:46,526][52059] Updated weights for policy 1, policy_version 14462 (0.0008) [2023-10-08 00:26:46,595][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000014464_14811136.pth... [2023-10-08 00:26:46,635][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000012832_13139968.pth [2023-10-08 00:26:48,562][52060] Updated weights for policy 0, policy_version 14280 (0.0008) [2023-10-08 00:26:48,934][52060] Updated weights for policy 0, policy_version 14290 (0.0007) [2023-10-08 00:26:49,297][52060] Updated weights for policy 0, policy_version 14300 (0.0007) [2023-10-08 00:26:50,495][52059] Updated weights for policy 1, policy_version 14472 (0.0009) [2023-10-08 00:26:50,865][52059] Updated weights for policy 1, policy_version 14482 (0.0007) [2023-10-08 00:26:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29458432. Throughput: 0: 1696.4, 1: 1741.0. Samples: 7372876. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:26:51,211][50642] Avg episode reward: [(0, '13.920'), (1, '16.070')] [2023-10-08 00:26:51,227][52059] Updated weights for policy 1, policy_version 14492 (0.0009) [2023-10-08 00:26:53,347][52060] Updated weights for policy 0, policy_version 14310 (0.0007) [2023-10-08 00:26:53,718][52060] Updated weights for policy 0, policy_version 14320 (0.0007) [2023-10-08 00:26:54,091][52060] Updated weights for policy 0, policy_version 14330 (0.0008) [2023-10-08 00:26:55,312][52059] Updated weights for policy 1, policy_version 14502 (0.0010) [2023-10-08 00:26:55,691][52059] Updated weights for policy 1, policy_version 14512 (0.0011) [2023-10-08 00:26:56,054][52059] Updated weights for policy 1, policy_version 14522 (0.0010) [2023-10-08 00:26:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 29523968. Throughput: 0: 1680.1, 1: 1744.4. Samples: 7393292. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:26:56,211][50642] Avg episode reward: [(0, '14.270'), (1, '13.590')] [2023-10-08 00:26:57,999][52060] Updated weights for policy 0, policy_version 14340 (0.0009) [2023-10-08 00:26:58,374][52060] Updated weights for policy 0, policy_version 14350 (0.0008) [2023-10-08 00:26:58,747][52060] Updated weights for policy 0, policy_version 14360 (0.0009) [2023-10-08 00:26:59,980][52059] Updated weights for policy 1, policy_version 14532 (0.0007) [2023-10-08 00:27:00,340][52059] Updated weights for policy 1, policy_version 14542 (0.0008) [2023-10-08 00:27:00,706][52059] Updated weights for policy 1, policy_version 14552 (0.0009) [2023-10-08 00:27:01,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 29622272. Throughput: 0: 1708.5, 1: 1710.5. Samples: 7413316. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:27:01,211][50642] Avg episode reward: [(0, '14.290'), (1, '17.120')] [2023-10-08 00:27:02,730][52060] Updated weights for policy 0, policy_version 14370 (0.0009) [2023-10-08 00:27:03,113][52060] Updated weights for policy 0, policy_version 14380 (0.0007) [2023-10-08 00:27:03,486][52060] Updated weights for policy 0, policy_version 14390 (0.0008) [2023-10-08 00:27:03,858][52060] Updated weights for policy 0, policy_version 14400 (0.0007) [2023-10-08 00:27:04,482][52059] Updated weights for policy 1, policy_version 14562 (0.0009) [2023-10-08 00:27:04,848][52059] Updated weights for policy 1, policy_version 14572 (0.0007) [2023-10-08 00:27:05,210][52059] Updated weights for policy 1, policy_version 14582 (0.0007) [2023-10-08 00:27:05,574][52059] Updated weights for policy 1, policy_version 14592 (0.0009) [2023-10-08 00:27:06,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 29687808. Throughput: 0: 1680.9, 1: 1739.7. Samples: 7424028. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:27:06,211][50642] Avg episode reward: [(0, '14.330'), (1, '16.140')] [2023-10-08 00:27:07,907][52060] Updated weights for policy 0, policy_version 14410 (0.0008) [2023-10-08 00:27:08,273][52060] Updated weights for policy 0, policy_version 14420 (0.0008) [2023-10-08 00:27:08,645][52060] Updated weights for policy 0, policy_version 14430 (0.0009) [2023-10-08 00:27:09,386][52059] Updated weights for policy 1, policy_version 14602 (0.0008) [2023-10-08 00:27:09,751][52059] Updated weights for policy 1, policy_version 14612 (0.0008) [2023-10-08 00:27:10,114][52059] Updated weights for policy 1, policy_version 14622 (0.0011) [2023-10-08 00:27:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 29753344. Throughput: 0: 1697.9, 1: 1722.2. Samples: 7444270. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:27:11,211][50642] Avg episode reward: [(0, '15.070'), (1, '16.550')] [2023-10-08 00:27:11,211][51605] Saving new best policy, reward=15.070! [2023-10-08 00:27:12,666][52060] Updated weights for policy 0, policy_version 14440 (0.0008) [2023-10-08 00:27:13,032][52060] Updated weights for policy 0, policy_version 14450 (0.0010) [2023-10-08 00:27:13,411][52060] Updated weights for policy 0, policy_version 14460 (0.0008) [2023-10-08 00:27:14,068][52059] Updated weights for policy 1, policy_version 14632 (0.0010) [2023-10-08 00:27:14,436][52059] Updated weights for policy 1, policy_version 14642 (0.0010) [2023-10-08 00:27:14,797][52059] Updated weights for policy 1, policy_version 14652 (0.0007) [2023-10-08 00:27:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 29818880. Throughput: 0: 1720.8, 1: 1712.7. Samples: 7465370. Policy #0 lag: (min: 31.0, avg: 33.1, max: 62.0) [2023-10-08 00:27:16,211][50642] Avg episode reward: [(0, '14.360'), (1, '16.280')] [2023-10-08 00:27:17,213][52060] Updated weights for policy 0, policy_version 14470 (0.0007) [2023-10-08 00:27:17,590][52060] Updated weights for policy 0, policy_version 14480 (0.0009) [2023-10-08 00:27:17,972][52060] Updated weights for policy 0, policy_version 14490 (0.0009) [2023-10-08 00:27:18,619][52059] Updated weights for policy 1, policy_version 14662 (0.0007) [2023-10-08 00:27:18,980][52059] Updated weights for policy 1, policy_version 14672 (0.0007) [2023-10-08 00:27:19,349][52059] Updated weights for policy 1, policy_version 14682 (0.0008) [2023-10-08 00:27:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 29884416. Throughput: 0: 1691.1, 1: 1738.9. Samples: 7475874. Policy #0 lag: (min: 31.0, avg: 33.1, max: 62.0) [2023-10-08 00:27:21,211][50642] Avg episode reward: [(0, '14.700'), (1, '16.320')] [2023-10-08 00:27:21,917][52060] Updated weights for policy 0, policy_version 14500 (0.0008) [2023-10-08 00:27:22,285][52060] Updated weights for policy 0, policy_version 14510 (0.0008) [2023-10-08 00:27:22,661][52060] Updated weights for policy 0, policy_version 14520 (0.0007) [2023-10-08 00:27:23,255][52059] Updated weights for policy 1, policy_version 14692 (0.0008) [2023-10-08 00:27:23,624][52059] Updated weights for policy 1, policy_version 14702 (0.0009) [2023-10-08 00:27:23,982][52059] Updated weights for policy 1, policy_version 14712 (0.0008) [2023-10-08 00:27:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 29949952. Throughput: 0: 1718.9, 1: 1722.1. Samples: 7496424. Policy #0 lag: (min: 31.0, avg: 33.1, max: 62.0) [2023-10-08 00:27:26,211][50642] Avg episode reward: [(0, '14.640'), (1, '16.820')] [2023-10-08 00:27:26,672][52060] Updated weights for policy 0, policy_version 14530 (0.0007) [2023-10-08 00:27:27,048][52060] Updated weights for policy 0, policy_version 14540 (0.0008) [2023-10-08 00:27:27,420][52060] Updated weights for policy 0, policy_version 14550 (0.0008) [2023-10-08 00:27:27,784][52060] Updated weights for policy 0, policy_version 14560 (0.0008) [2023-10-08 00:27:28,005][52059] Updated weights for policy 1, policy_version 14722 (0.0008) [2023-10-08 00:27:28,364][52059] Updated weights for policy 1, policy_version 14732 (0.0009) [2023-10-08 00:27:28,726][52059] Updated weights for policy 1, policy_version 14742 (0.0007) [2023-10-08 00:27:29,086][52059] Updated weights for policy 1, policy_version 14752 (0.0008) [2023-10-08 00:27:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 30015488. Throughput: 0: 1718.4, 1: 1725.8. Samples: 7517370. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:27:31,211][50642] Avg episode reward: [(0, '14.210'), (1, '16.930')] [2023-10-08 00:27:31,898][52060] Updated weights for policy 0, policy_version 14570 (0.0009) [2023-10-08 00:27:32,266][52060] Updated weights for policy 0, policy_version 14580 (0.0008) [2023-10-08 00:27:32,636][52060] Updated weights for policy 0, policy_version 14590 (0.0010) [2023-10-08 00:27:32,984][52059] Updated weights for policy 1, policy_version 14762 (0.0010) [2023-10-08 00:27:33,354][52059] Updated weights for policy 1, policy_version 14772 (0.0009) [2023-10-08 00:27:33,730][52059] Updated weights for policy 1, policy_version 14782 (0.0009) [2023-10-08 00:27:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 30081024. Throughput: 0: 1704.0, 1: 1716.4. Samples: 7526794. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:27:36,211][50642] Avg episode reward: [(0, '15.440'), (1, '15.670')] [2023-10-08 00:27:36,211][51605] Saving new best policy, reward=15.440! [2023-10-08 00:27:36,653][52060] Updated weights for policy 0, policy_version 14600 (0.0008) [2023-10-08 00:27:37,032][52060] Updated weights for policy 0, policy_version 14610 (0.0007) [2023-10-08 00:27:37,396][52060] Updated weights for policy 0, policy_version 14620 (0.0007) [2023-10-08 00:27:37,689][52059] Updated weights for policy 1, policy_version 14792 (0.0010) [2023-10-08 00:27:38,065][52059] Updated weights for policy 1, policy_version 14802 (0.0009) [2023-10-08 00:27:38,423][52059] Updated weights for policy 1, policy_version 14812 (0.0009) [2023-10-08 00:27:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 30146560. Throughput: 0: 1720.5, 1: 1712.6. Samples: 7547784. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:27:41,211][50642] Avg episode reward: [(0, '13.880'), (1, '16.830')] [2023-10-08 00:27:41,378][52060] Updated weights for policy 0, policy_version 14630 (0.0010) [2023-10-08 00:27:41,749][52060] Updated weights for policy 0, policy_version 14640 (0.0008) [2023-10-08 00:27:42,127][52060] Updated weights for policy 0, policy_version 14650 (0.0008) [2023-10-08 00:27:42,375][52059] Updated weights for policy 1, policy_version 14822 (0.0009) [2023-10-08 00:27:42,761][52059] Updated weights for policy 1, policy_version 14832 (0.0009) [2023-10-08 00:27:43,124][52059] Updated weights for policy 1, policy_version 14842 (0.0009) [2023-10-08 00:27:45,855][52060] Updated weights for policy 0, policy_version 14660 (0.0007) [2023-10-08 00:27:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 30212096. Throughput: 0: 1722.3, 1: 1740.3. Samples: 7569130. Policy #0 lag: (min: 31.0, avg: 36.1, max: 63.0) [2023-10-08 00:27:46,211][50642] Avg episode reward: [(0, '15.290'), (1, '17.000')] [2023-10-08 00:27:46,230][52060] Updated weights for policy 0, policy_version 14670 (0.0008) [2023-10-08 00:27:46,591][52060] Updated weights for policy 0, policy_version 14680 (0.0009) [2023-10-08 00:27:47,119][52059] Updated weights for policy 1, policy_version 14852 (0.0009) [2023-10-08 00:27:47,493][52059] Updated weights for policy 1, policy_version 14862 (0.0007) [2023-10-08 00:27:47,854][52059] Updated weights for policy 1, policy_version 14872 (0.0009) [2023-10-08 00:27:50,647][52060] Updated weights for policy 0, policy_version 14690 (0.0011) [2023-10-08 00:27:51,054][52060] Updated weights for policy 0, policy_version 14700 (0.0009) [2023-10-08 00:27:51,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 30277632. Throughput: 0: 1722.8, 1: 1713.7. Samples: 7578672. Policy #0 lag: (min: 31.0, avg: 36.1, max: 63.0) [2023-10-08 00:27:51,211][50642] Avg episode reward: [(0, '15.460'), (1, '16.270')] [2023-10-08 00:27:51,433][52060] Updated weights for policy 0, policy_version 14710 (0.0009) [2023-10-08 00:27:51,790][51605] Saving new best policy, reward=15.460! [2023-10-08 00:27:51,792][52060] Updated weights for policy 0, policy_version 14720 (0.0009) [2023-10-08 00:27:51,842][52059] Updated weights for policy 1, policy_version 14882 (0.0010) [2023-10-08 00:27:52,215][52059] Updated weights for policy 1, policy_version 14892 (0.0008) [2023-10-08 00:27:52,573][52059] Updated weights for policy 1, policy_version 14902 (0.0008) [2023-10-08 00:27:52,931][52059] Updated weights for policy 1, policy_version 14912 (0.0010) [2023-10-08 00:27:55,842][52060] Updated weights for policy 0, policy_version 14730 (0.0009) [2023-10-08 00:27:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 30343168. Throughput: 0: 1722.0, 1: 1730.8. Samples: 7599648. Policy #0 lag: (min: 31.0, avg: 36.1, max: 63.0) [2023-10-08 00:27:56,211][50642] Avg episode reward: [(0, '13.620'), (1, '15.000')] [2023-10-08 00:27:56,225][52060] Updated weights for policy 0, policy_version 14740 (0.0007) [2023-10-08 00:27:56,595][52060] Updated weights for policy 0, policy_version 14750 (0.0008) [2023-10-08 00:27:56,926][52059] Updated weights for policy 1, policy_version 14922 (0.0011) [2023-10-08 00:27:57,296][52059] Updated weights for policy 1, policy_version 14932 (0.0011) [2023-10-08 00:27:57,664][52059] Updated weights for policy 1, policy_version 14942 (0.0011) [2023-10-08 00:28:00,547][52060] Updated weights for policy 0, policy_version 14760 (0.0008) [2023-10-08 00:28:00,929][52060] Updated weights for policy 0, policy_version 14770 (0.0008) [2023-10-08 00:28:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 30408704. Throughput: 0: 1704.7, 1: 1740.5. Samples: 7620402. Policy #0 lag: (min: 5.0, avg: 7.1, max: 36.0) [2023-10-08 00:28:01,212][50642] Avg episode reward: [(0, '15.100'), (1, '16.070')] [2023-10-08 00:28:01,288][52060] Updated weights for policy 0, policy_version 14780 (0.0009) [2023-10-08 00:28:01,512][52059] Updated weights for policy 1, policy_version 14952 (0.0009) [2023-10-08 00:28:01,886][52059] Updated weights for policy 1, policy_version 14962 (0.0008) [2023-10-08 00:28:02,245][52059] Updated weights for policy 1, policy_version 14972 (0.0008) [2023-10-08 00:28:05,354][52060] Updated weights for policy 0, policy_version 14790 (0.0007) [2023-10-08 00:28:05,721][52060] Updated weights for policy 0, policy_version 14800 (0.0007) [2023-10-08 00:28:06,007][52059] Updated weights for policy 1, policy_version 14982 (0.0008) [2023-10-08 00:28:06,093][52060] Updated weights for policy 0, policy_version 14810 (0.0007) [2023-10-08 00:28:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 30474240. Throughput: 0: 1715.5, 1: 1716.6. Samples: 7630318. Policy #0 lag: (min: 5.0, avg: 7.1, max: 36.0) [2023-10-08 00:28:06,211][50642] Avg episode reward: [(0, '14.110'), (1, '15.010')] [2023-10-08 00:28:06,382][52059] Updated weights for policy 1, policy_version 14992 (0.0008) [2023-10-08 00:28:06,743][52059] Updated weights for policy 1, policy_version 15002 (0.0007) [2023-10-08 00:28:10,139][52060] Updated weights for policy 0, policy_version 14820 (0.0008) [2023-10-08 00:28:10,494][52060] Updated weights for policy 0, policy_version 14830 (0.0009) [2023-10-08 00:28:10,617][52059] Updated weights for policy 1, policy_version 15012 (0.0009) [2023-10-08 00:28:10,868][52060] Updated weights for policy 0, policy_version 14840 (0.0009) [2023-10-08 00:28:10,974][52059] Updated weights for policy 1, policy_version 15022 (0.0008) [2023-10-08 00:28:11,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 30572544. Throughput: 0: 1714.6, 1: 1740.0. Samples: 7651880. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:28:11,211][50642] Avg episode reward: [(0, '13.440'), (1, '15.720')] [2023-10-08 00:28:11,337][52059] Updated weights for policy 1, policy_version 15032 (0.0008) [2023-10-08 00:28:14,735][52060] Updated weights for policy 0, policy_version 14850 (0.0009) [2023-10-08 00:28:15,100][52060] Updated weights for policy 0, policy_version 14860 (0.0007) [2023-10-08 00:28:15,413][52059] Updated weights for policy 1, policy_version 15042 (0.0008) [2023-10-08 00:28:15,481][52060] Updated weights for policy 0, policy_version 14870 (0.0007) [2023-10-08 00:28:15,788][52059] Updated weights for policy 1, policy_version 15052 (0.0010) [2023-10-08 00:28:15,853][52060] Updated weights for policy 0, policy_version 14880 (0.0007) [2023-10-08 00:28:16,150][52059] Updated weights for policy 1, policy_version 15062 (0.0008) [2023-10-08 00:28:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 30638080. Throughput: 0: 1689.4, 1: 1733.5. Samples: 7671400. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:28:16,211][50642] Avg episode reward: [(0, '14.570'), (1, '16.750')] [2023-10-08 00:28:16,510][52059] Updated weights for policy 1, policy_version 15072 (0.0009) [2023-10-08 00:28:19,712][52060] Updated weights for policy 0, policy_version 14890 (0.0007) [2023-10-08 00:28:20,083][52060] Updated weights for policy 0, policy_version 14900 (0.0008) [2023-10-08 00:28:20,232][52059] Updated weights for policy 1, policy_version 15082 (0.0007) [2023-10-08 00:28:20,448][52060] Updated weights for policy 0, policy_version 14910 (0.0008) [2023-10-08 00:28:20,602][52059] Updated weights for policy 1, policy_version 15092 (0.0007) [2023-10-08 00:28:20,973][52059] Updated weights for policy 1, policy_version 15102 (0.0010) [2023-10-08 00:28:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 30736384. Throughput: 0: 1716.4, 1: 1744.7. Samples: 7682546. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:28:21,211][50642] Avg episode reward: [(0, '14.500'), (1, '16.020')] [2023-10-08 00:28:24,411][52060] Updated weights for policy 0, policy_version 14920 (0.0008) [2023-10-08 00:28:24,793][52060] Updated weights for policy 0, policy_version 14930 (0.0008) [2023-10-08 00:28:24,962][52059] Updated weights for policy 1, policy_version 15112 (0.0007) [2023-10-08 00:28:25,161][52060] Updated weights for policy 0, policy_version 14940 (0.0007) [2023-10-08 00:28:25,321][52059] Updated weights for policy 1, policy_version 15122 (0.0009) [2023-10-08 00:28:25,685][52059] Updated weights for policy 1, policy_version 15132 (0.0007) [2023-10-08 00:28:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 30801920. Throughput: 0: 1705.0, 1: 1743.3. Samples: 7702956. Policy #0 lag: (min: 5.0, avg: 8.8, max: 37.0) [2023-10-08 00:28:26,211][50642] Avg episode reward: [(0, '14.760'), (1, '15.730')] [2023-10-08 00:28:29,208][52060] Updated weights for policy 0, policy_version 14950 (0.0008) [2023-10-08 00:28:29,580][52060] Updated weights for policy 0, policy_version 14960 (0.0008) [2023-10-08 00:28:29,733][52059] Updated weights for policy 1, policy_version 15142 (0.0009) [2023-10-08 00:28:29,953][52060] Updated weights for policy 0, policy_version 14970 (0.0009) [2023-10-08 00:28:30,101][52059] Updated weights for policy 1, policy_version 15152 (0.0007) [2023-10-08 00:28:30,465][52059] Updated weights for policy 1, policy_version 15162 (0.0010) [2023-10-08 00:28:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 30867456. Throughput: 0: 1687.6, 1: 1719.5. Samples: 7722452. Policy #0 lag: (min: 5.0, avg: 8.8, max: 37.0) [2023-10-08 00:28:31,211][50642] Avg episode reward: [(0, '15.320'), (1, '15.840')] [2023-10-08 00:28:33,859][52060] Updated weights for policy 0, policy_version 14980 (0.0009) [2023-10-08 00:28:34,226][52060] Updated weights for policy 0, policy_version 14990 (0.0008) [2023-10-08 00:28:34,379][52059] Updated weights for policy 1, policy_version 15172 (0.0008) [2023-10-08 00:28:34,596][52060] Updated weights for policy 0, policy_version 15000 (0.0009) [2023-10-08 00:28:34,741][52059] Updated weights for policy 1, policy_version 15182 (0.0007) [2023-10-08 00:28:35,109][52059] Updated weights for policy 1, policy_version 15192 (0.0010) [2023-10-08 00:28:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 30932992. Throughput: 0: 1713.6, 1: 1745.7. Samples: 7734338. Policy #0 lag: (min: 5.0, avg: 8.8, max: 37.0) [2023-10-08 00:28:36,211][50642] Avg episode reward: [(0, '14.800'), (1, '15.540')] [2023-10-08 00:28:38,660][52060] Updated weights for policy 0, policy_version 15010 (0.0010) [2023-10-08 00:28:39,064][52060] Updated weights for policy 0, policy_version 15020 (0.0008) [2023-10-08 00:28:39,119][52059] Updated weights for policy 1, policy_version 15202 (0.0008) [2023-10-08 00:28:39,436][52060] Updated weights for policy 0, policy_version 15030 (0.0007) [2023-10-08 00:28:39,472][52059] Updated weights for policy 1, policy_version 15212 (0.0007) [2023-10-08 00:28:39,807][52060] Updated weights for policy 0, policy_version 15040 (0.0007) [2023-10-08 00:28:39,839][52059] Updated weights for policy 1, policy_version 15222 (0.0007) [2023-10-08 00:28:40,196][52059] Updated weights for policy 1, policy_version 15232 (0.0007) [2023-10-08 00:28:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 30998528. Throughput: 0: 1691.4, 1: 1726.0. Samples: 7753430. Policy #0 lag: (min: 31.0, avg: 35.5, max: 63.0) [2023-10-08 00:28:41,211][50642] Avg episode reward: [(0, '14.310'), (1, '15.930')] [2023-10-08 00:28:43,757][52060] Updated weights for policy 0, policy_version 15050 (0.0010) [2023-10-08 00:28:44,096][52059] Updated weights for policy 1, policy_version 15242 (0.0008) [2023-10-08 00:28:44,125][52060] Updated weights for policy 0, policy_version 15060 (0.0010) [2023-10-08 00:28:44,456][52059] Updated weights for policy 1, policy_version 15252 (0.0007) [2023-10-08 00:28:44,495][52060] Updated weights for policy 0, policy_version 15070 (0.0009) [2023-10-08 00:28:44,821][52059] Updated weights for policy 1, policy_version 15262 (0.0007) [2023-10-08 00:28:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 31064064. Throughput: 0: 1704.2, 1: 1712.3. Samples: 7774144. Policy #0 lag: (min: 31.0, avg: 35.5, max: 63.0) [2023-10-08 00:28:46,211][50642] Avg episode reward: [(0, '14.580'), (1, '16.800')] [2023-10-08 00:28:46,225][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000015072_15433728.pth... [2023-10-08 00:28:46,225][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000015264_15630336.pth... [2023-10-08 00:28:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000013632_13959168.pth [2023-10-08 00:28:46,264][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000013472_13795328.pth [2023-10-08 00:28:48,539][52060] Updated weights for policy 0, policy_version 15080 (0.0009) [2023-10-08 00:28:48,873][52059] Updated weights for policy 1, policy_version 15272 (0.0007) [2023-10-08 00:28:48,916][52060] Updated weights for policy 0, policy_version 15090 (0.0008) [2023-10-08 00:28:49,235][52059] Updated weights for policy 1, policy_version 15282 (0.0009) [2023-10-08 00:28:49,284][52060] Updated weights for policy 0, policy_version 15100 (0.0009) [2023-10-08 00:28:49,602][52059] Updated weights for policy 1, policy_version 15292 (0.0007) [2023-10-08 00:28:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 31129600. Throughput: 0: 1707.0, 1: 1729.6. Samples: 7784968. Policy #0 lag: (min: 31.0, avg: 35.5, max: 63.0) [2023-10-08 00:28:51,211][50642] Avg episode reward: [(0, '15.020'), (1, '15.370')] [2023-10-08 00:28:53,417][52060] Updated weights for policy 0, policy_version 15110 (0.0007) [2023-10-08 00:28:53,523][52059] Updated weights for policy 1, policy_version 15302 (0.0008) [2023-10-08 00:28:53,781][52060] Updated weights for policy 0, policy_version 15120 (0.0007) [2023-10-08 00:28:53,888][52059] Updated weights for policy 1, policy_version 15312 (0.0009) [2023-10-08 00:28:54,154][52060] Updated weights for policy 0, policy_version 15130 (0.0008) [2023-10-08 00:28:54,259][52059] Updated weights for policy 1, policy_version 15322 (0.0010) [2023-10-08 00:28:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 31195136. Throughput: 0: 1683.9, 1: 1703.1. Samples: 7804294. Policy #0 lag: (min: 33.0, avg: 54.5, max: 56.0) [2023-10-08 00:28:56,211][50642] Avg episode reward: [(0, '14.410'), (1, '15.700')] [2023-10-08 00:28:58,164][52060] Updated weights for policy 0, policy_version 15140 (0.0009) [2023-10-08 00:28:58,249][52059] Updated weights for policy 1, policy_version 15332 (0.0008) [2023-10-08 00:28:58,547][52060] Updated weights for policy 0, policy_version 15150 (0.0009) [2023-10-08 00:28:58,615][52059] Updated weights for policy 1, policy_version 15342 (0.0009) [2023-10-08 00:28:58,916][52060] Updated weights for policy 0, policy_version 15160 (0.0008) [2023-10-08 00:28:58,981][52059] Updated weights for policy 1, policy_version 15352 (0.0007) [2023-10-08 00:29:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 31260672. Throughput: 0: 1709.3, 1: 1722.3. Samples: 7825820. Policy #0 lag: (min: 33.0, avg: 54.5, max: 56.0) [2023-10-08 00:29:01,211][50642] Avg episode reward: [(0, '15.360'), (1, '17.630')] [2023-10-08 00:29:02,790][52059] Updated weights for policy 1, policy_version 15362 (0.0008) [2023-10-08 00:29:02,897][52060] Updated weights for policy 0, policy_version 15170 (0.0008) [2023-10-08 00:29:03,168][52059] Updated weights for policy 1, policy_version 15372 (0.0009) [2023-10-08 00:29:03,269][52060] Updated weights for policy 0, policy_version 15180 (0.0008) [2023-10-08 00:29:03,522][52059] Updated weights for policy 1, policy_version 15382 (0.0008) [2023-10-08 00:29:03,637][52060] Updated weights for policy 0, policy_version 15190 (0.0009) [2023-10-08 00:29:03,889][52059] Updated weights for policy 1, policy_version 15392 (0.0009) [2023-10-08 00:29:03,999][52060] Updated weights for policy 0, policy_version 15200 (0.0007) [2023-10-08 00:29:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 31326208. Throughput: 0: 1688.5, 1: 1713.0. Samples: 7835614. Policy #0 lag: (min: 33.0, avg: 54.5, max: 56.0) [2023-10-08 00:29:06,211][50642] Avg episode reward: [(0, '14.730'), (1, '16.110')] [2023-10-08 00:29:07,896][52059] Updated weights for policy 1, policy_version 15402 (0.0008) [2023-10-08 00:29:07,987][52060] Updated weights for policy 0, policy_version 15210 (0.0009) [2023-10-08 00:29:08,251][52059] Updated weights for policy 1, policy_version 15412 (0.0008) [2023-10-08 00:29:08,356][52060] Updated weights for policy 0, policy_version 15220 (0.0009) [2023-10-08 00:29:08,618][52059] Updated weights for policy 1, policy_version 15422 (0.0008) [2023-10-08 00:29:08,736][52060] Updated weights for policy 0, policy_version 15230 (0.0011) [2023-10-08 00:29:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 31391744. Throughput: 0: 1698.0, 1: 1715.8. Samples: 7856574. Policy #0 lag: (min: 31.0, avg: 31.0, max: 35.0) [2023-10-08 00:29:11,211][50642] Avg episode reward: [(0, '13.760'), (1, '15.490')] [2023-10-08 00:29:12,557][52059] Updated weights for policy 1, policy_version 15432 (0.0008) [2023-10-08 00:29:12,697][52060] Updated weights for policy 0, policy_version 15240 (0.0009) [2023-10-08 00:29:12,922][52059] Updated weights for policy 1, policy_version 15442 (0.0008) [2023-10-08 00:29:13,063][52060] Updated weights for policy 0, policy_version 15250 (0.0008) [2023-10-08 00:29:13,294][52059] Updated weights for policy 1, policy_version 15452 (0.0007) [2023-10-08 00:29:13,429][52060] Updated weights for policy 0, policy_version 15260 (0.0008) [2023-10-08 00:29:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 31457280. Throughput: 0: 1710.4, 1: 1743.6. Samples: 7877882. Policy #0 lag: (min: 31.0, avg: 31.0, max: 35.0) [2023-10-08 00:29:16,211][50642] Avg episode reward: [(0, '15.820'), (1, '17.030')] [2023-10-08 00:29:16,217][51605] Saving new best policy, reward=15.820! [2023-10-08 00:29:17,215][52059] Updated weights for policy 1, policy_version 15462 (0.0008) [2023-10-08 00:29:17,421][52060] Updated weights for policy 0, policy_version 15270 (0.0008) [2023-10-08 00:29:17,591][52059] Updated weights for policy 1, policy_version 15472 (0.0007) [2023-10-08 00:29:17,796][52060] Updated weights for policy 0, policy_version 15280 (0.0007) [2023-10-08 00:29:17,965][52059] Updated weights for policy 1, policy_version 15482 (0.0008) [2023-10-08 00:29:18,160][52060] Updated weights for policy 0, policy_version 15290 (0.0008) [2023-10-08 00:29:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 31522816. Throughput: 0: 1682.0, 1: 1711.9. Samples: 7887060. Policy #0 lag: (min: 31.0, avg: 31.0, max: 35.0) [2023-10-08 00:29:21,211][50642] Avg episode reward: [(0, '14.380'), (1, '15.800')] [2023-10-08 00:29:21,810][52059] Updated weights for policy 1, policy_version 15492 (0.0010) [2023-10-08 00:29:22,166][52059] Updated weights for policy 1, policy_version 15502 (0.0008) [2023-10-08 00:29:22,177][52060] Updated weights for policy 0, policy_version 15300 (0.0008) [2023-10-08 00:29:22,539][52059] Updated weights for policy 1, policy_version 15512 (0.0009) [2023-10-08 00:29:22,542][52060] Updated weights for policy 0, policy_version 15310 (0.0008) [2023-10-08 00:29:22,918][52060] Updated weights for policy 0, policy_version 15320 (0.0009) [2023-10-08 00:29:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 31588352. Throughput: 0: 1711.3, 1: 1733.9. Samples: 7908464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:26,211][50642] Avg episode reward: [(0, '15.110'), (1, '16.420')] [2023-10-08 00:29:26,497][52059] Updated weights for policy 1, policy_version 15522 (0.0007) [2023-10-08 00:29:26,860][52059] Updated weights for policy 1, policy_version 15532 (0.0009) [2023-10-08 00:29:26,942][52060] Updated weights for policy 0, policy_version 15330 (0.0009) [2023-10-08 00:29:27,220][52059] Updated weights for policy 1, policy_version 15542 (0.0009) [2023-10-08 00:29:27,334][52060] Updated weights for policy 0, policy_version 15340 (0.0009) [2023-10-08 00:29:27,584][52059] Updated weights for policy 1, policy_version 15552 (0.0009) [2023-10-08 00:29:27,706][52060] Updated weights for policy 0, policy_version 15350 (0.0007) [2023-10-08 00:29:28,074][52060] Updated weights for policy 0, policy_version 15360 (0.0008) [2023-10-08 00:29:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 31653888. Throughput: 0: 1706.7, 1: 1744.1. Samples: 7929430. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:31,211][50642] Avg episode reward: [(0, '16.350'), (1, '16.760')] [2023-10-08 00:29:31,221][51605] Saving new best policy, reward=16.350! [2023-10-08 00:29:31,737][52059] Updated weights for policy 1, policy_version 15562 (0.0009) [2023-10-08 00:29:32,065][52060] Updated weights for policy 0, policy_version 15370 (0.0008) [2023-10-08 00:29:32,102][52059] Updated weights for policy 1, policy_version 15572 (0.0007) [2023-10-08 00:29:32,435][52060] Updated weights for policy 0, policy_version 15380 (0.0010) [2023-10-08 00:29:32,466][52059] Updated weights for policy 1, policy_version 15582 (0.0007) [2023-10-08 00:29:32,811][52060] Updated weights for policy 0, policy_version 15390 (0.0008) [2023-10-08 00:29:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 31719424. Throughput: 0: 1691.9, 1: 1725.9. Samples: 7938768. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:36,211][50642] Avg episode reward: [(0, '13.990'), (1, '16.430')] [2023-10-08 00:29:36,458][52059] Updated weights for policy 1, policy_version 15592 (0.0009) [2023-10-08 00:29:36,653][52060] Updated weights for policy 0, policy_version 15400 (0.0008) [2023-10-08 00:29:36,828][52059] Updated weights for policy 1, policy_version 15602 (0.0008) [2023-10-08 00:29:37,020][52060] Updated weights for policy 0, policy_version 15410 (0.0008) [2023-10-08 00:29:37,196][52059] Updated weights for policy 1, policy_version 15612 (0.0008) [2023-10-08 00:29:37,397][52060] Updated weights for policy 0, policy_version 15420 (0.0008) [2023-10-08 00:29:41,111][52059] Updated weights for policy 1, policy_version 15622 (0.0009) [2023-10-08 00:29:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 31784960. Throughput: 0: 1716.2, 1: 1747.8. Samples: 7960174. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:41,211][50642] Avg episode reward: [(0, '15.420'), (1, '15.350')] [2023-10-08 00:29:41,475][52059] Updated weights for policy 1, policy_version 15632 (0.0009) [2023-10-08 00:29:41,490][52060] Updated weights for policy 0, policy_version 15430 (0.0009) [2023-10-08 00:29:41,836][52059] Updated weights for policy 1, policy_version 15642 (0.0008) [2023-10-08 00:29:41,855][52060] Updated weights for policy 0, policy_version 15440 (0.0008) [2023-10-08 00:29:42,231][52060] Updated weights for policy 0, policy_version 15450 (0.0008) [2023-10-08 00:29:45,711][52059] Updated weights for policy 1, policy_version 15652 (0.0009) [2023-10-08 00:29:46,070][52059] Updated weights for policy 1, policy_version 15662 (0.0009) [2023-10-08 00:29:46,200][52060] Updated weights for policy 0, policy_version 15460 (0.0008) [2023-10-08 00:29:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 31850496. Throughput: 0: 1717.4, 1: 1732.8. Samples: 7981080. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:46,211][50642] Avg episode reward: [(0, '15.050'), (1, '17.660')] [2023-10-08 00:29:46,440][52059] Updated weights for policy 1, policy_version 15672 (0.0008) [2023-10-08 00:29:46,577][52060] Updated weights for policy 0, policy_version 15470 (0.0009) [2023-10-08 00:29:46,955][52060] Updated weights for policy 0, policy_version 15480 (0.0009) [2023-10-08 00:29:50,350][52059] Updated weights for policy 1, policy_version 15682 (0.0009) [2023-10-08 00:29:50,716][52059] Updated weights for policy 1, policy_version 15692 (0.0008) [2023-10-08 00:29:50,988][52060] Updated weights for policy 0, policy_version 15490 (0.0010) [2023-10-08 00:29:51,080][52059] Updated weights for policy 1, policy_version 15702 (0.0007) [2023-10-08 00:29:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 31916032. Throughput: 0: 1709.4, 1: 1736.7. Samples: 7990686. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:51,211][50642] Avg episode reward: [(0, '13.570'), (1, '16.230')] [2023-10-08 00:29:51,360][52060] Updated weights for policy 0, policy_version 15500 (0.0007) [2023-10-08 00:29:51,441][52059] Updated weights for policy 1, policy_version 15712 (0.0007) [2023-10-08 00:29:51,724][52060] Updated weights for policy 0, policy_version 15510 (0.0007) [2023-10-08 00:29:52,092][52060] Updated weights for policy 0, policy_version 15520 (0.0007) [2023-10-08 00:29:55,352][52059] Updated weights for policy 1, policy_version 15722 (0.0007) [2023-10-08 00:29:55,725][52059] Updated weights for policy 1, policy_version 15732 (0.0010) [2023-10-08 00:29:56,026][52060] Updated weights for policy 0, policy_version 15530 (0.0008) [2023-10-08 00:29:56,094][52059] Updated weights for policy 1, policy_version 15742 (0.0007) [2023-10-08 00:29:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 32014336. Throughput: 0: 1719.0, 1: 1741.6. Samples: 8012300. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:29:56,211][50642] Avg episode reward: [(0, '15.960'), (1, '15.560')] [2023-10-08 00:29:56,386][52060] Updated weights for policy 0, policy_version 15540 (0.0009) [2023-10-08 00:29:56,753][52060] Updated weights for policy 0, policy_version 15550 (0.0009) [2023-10-08 00:29:59,991][52059] Updated weights for policy 1, policy_version 15752 (0.0008) [2023-10-08 00:30:00,358][52059] Updated weights for policy 1, policy_version 15762 (0.0008) [2023-10-08 00:30:00,607][52060] Updated weights for policy 0, policy_version 15560 (0.0008) [2023-10-08 00:30:00,718][52059] Updated weights for policy 1, policy_version 15772 (0.0009) [2023-10-08 00:30:00,971][52060] Updated weights for policy 0, policy_version 15570 (0.0009) [2023-10-08 00:30:01,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 32079872. Throughput: 0: 1707.6, 1: 1711.5. Samples: 8031742. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:30:01,211][50642] Avg episode reward: [(0, '14.780'), (1, '18.000')] [2023-10-08 00:30:01,337][52060] Updated weights for policy 0, policy_version 15580 (0.0009) [2023-10-08 00:30:04,727][52059] Updated weights for policy 1, policy_version 15782 (0.0010) [2023-10-08 00:30:05,115][52059] Updated weights for policy 1, policy_version 15792 (0.0008) [2023-10-08 00:30:05,250][52060] Updated weights for policy 0, policy_version 15590 (0.0007) [2023-10-08 00:30:05,477][52059] Updated weights for policy 1, policy_version 15802 (0.0008) [2023-10-08 00:30:05,621][52060] Updated weights for policy 0, policy_version 15600 (0.0009) [2023-10-08 00:30:05,995][52060] Updated weights for policy 0, policy_version 15610 (0.0008) [2023-10-08 00:30:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 32145408. Throughput: 0: 1724.7, 1: 1741.6. Samples: 8043042. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:30:06,211][50642] Avg episode reward: [(0, '14.360'), (1, '16.890')] [2023-10-08 00:30:09,301][52059] Updated weights for policy 1, policy_version 15812 (0.0008) [2023-10-08 00:30:09,660][52059] Updated weights for policy 1, policy_version 15822 (0.0007) [2023-10-08 00:30:10,017][52059] Updated weights for policy 1, policy_version 15832 (0.0007) [2023-10-08 00:30:10,178][52060] Updated weights for policy 0, policy_version 15620 (0.0010) [2023-10-08 00:30:10,550][52060] Updated weights for policy 0, policy_version 15630 (0.0008) [2023-10-08 00:30:10,919][52060] Updated weights for policy 0, policy_version 15640 (0.0009) [2023-10-08 00:30:11,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 32243712. Throughput: 0: 1722.0, 1: 1720.8. Samples: 8063386. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 00:30:11,211][50642] Avg episode reward: [(0, '15.930'), (1, '14.330')] [2023-10-08 00:30:13,818][52059] Updated weights for policy 1, policy_version 15842 (0.0008) [2023-10-08 00:30:14,197][52059] Updated weights for policy 1, policy_version 15852 (0.0010) [2023-10-08 00:30:14,562][52059] Updated weights for policy 1, policy_version 15862 (0.0007) [2023-10-08 00:30:14,821][52060] Updated weights for policy 0, policy_version 15650 (0.0009) [2023-10-08 00:30:14,921][52059] Updated weights for policy 1, policy_version 15872 (0.0008) [2023-10-08 00:30:15,229][52060] Updated weights for policy 0, policy_version 15660 (0.0009) [2023-10-08 00:30:15,595][52060] Updated weights for policy 0, policy_version 15670 (0.0009) [2023-10-08 00:30:15,956][52060] Updated weights for policy 0, policy_version 15680 (0.0010) [2023-10-08 00:30:16,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 32309248. Throughput: 0: 1701.1, 1: 1716.2. Samples: 8083212. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 00:30:16,211][50642] Avg episode reward: [(0, '14.350'), (1, '18.470')] [2023-10-08 00:30:18,912][52059] Updated weights for policy 1, policy_version 15882 (0.0010) [2023-10-08 00:30:19,278][52059] Updated weights for policy 1, policy_version 15892 (0.0008) [2023-10-08 00:30:19,645][52059] Updated weights for policy 1, policy_version 15902 (0.0007) [2023-10-08 00:30:19,961][52060] Updated weights for policy 0, policy_version 15690 (0.0008) [2023-10-08 00:30:20,335][52060] Updated weights for policy 0, policy_version 15700 (0.0008) [2023-10-08 00:30:20,700][52060] Updated weights for policy 0, policy_version 15710 (0.0007) [2023-10-08 00:30:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 32374784. Throughput: 0: 1724.9, 1: 1737.5. Samples: 8094578. Policy #0 lag: (min: 1.0, avg: 14.9, max: 33.0) [2023-10-08 00:30:21,211][50642] Avg episode reward: [(0, '14.640'), (1, '15.560')] [2023-10-08 00:30:23,698][52059] Updated weights for policy 1, policy_version 15912 (0.0009) [2023-10-08 00:30:24,058][52059] Updated weights for policy 1, policy_version 15922 (0.0007) [2023-10-08 00:30:24,420][52059] Updated weights for policy 1, policy_version 15932 (0.0009) [2023-10-08 00:30:24,717][52060] Updated weights for policy 0, policy_version 15720 (0.0009) [2023-10-08 00:30:25,094][52060] Updated weights for policy 0, policy_version 15730 (0.0008) [2023-10-08 00:30:25,462][52060] Updated weights for policy 0, policy_version 15740 (0.0008) [2023-10-08 00:30:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 32440320. Throughput: 0: 1712.9, 1: 1715.4. Samples: 8114446. Policy #0 lag: (min: 1.0, avg: 14.9, max: 33.0) [2023-10-08 00:30:26,211][50642] Avg episode reward: [(0, '15.340'), (1, '16.440')] [2023-10-08 00:30:28,123][52059] Updated weights for policy 1, policy_version 15942 (0.0007) [2023-10-08 00:30:28,482][52059] Updated weights for policy 1, policy_version 15952 (0.0008) [2023-10-08 00:30:28,848][52059] Updated weights for policy 1, policy_version 15962 (0.0007) [2023-10-08 00:30:29,407][52060] Updated weights for policy 0, policy_version 15750 (0.0008) [2023-10-08 00:30:29,768][52060] Updated weights for policy 0, policy_version 15760 (0.0009) [2023-10-08 00:30:30,149][52060] Updated weights for policy 0, policy_version 15770 (0.0010) [2023-10-08 00:30:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 32505856. Throughput: 0: 1692.6, 1: 1728.7. Samples: 8135038. Policy #0 lag: (min: 1.0, avg: 14.9, max: 33.0) [2023-10-08 00:30:31,211][50642] Avg episode reward: [(0, '14.190'), (1, '18.140')] [2023-10-08 00:30:32,925][52059] Updated weights for policy 1, policy_version 15972 (0.0008) [2023-10-08 00:30:33,285][52059] Updated weights for policy 1, policy_version 15982 (0.0008) [2023-10-08 00:30:33,653][52059] Updated weights for policy 1, policy_version 15992 (0.0008) [2023-10-08 00:30:34,125][52060] Updated weights for policy 0, policy_version 15780 (0.0009) [2023-10-08 00:30:34,496][52060] Updated weights for policy 0, policy_version 15790 (0.0008) [2023-10-08 00:30:34,869][52060] Updated weights for policy 0, policy_version 15800 (0.0007) [2023-10-08 00:30:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 32571392. Throughput: 0: 1727.3, 1: 1722.9. Samples: 8145944. Policy #0 lag: (min: 2.0, avg: 16.5, max: 34.0) [2023-10-08 00:30:36,211][50642] Avg episode reward: [(0, '14.520'), (1, '16.100')] [2023-10-08 00:30:37,543][52059] Updated weights for policy 1, policy_version 16002 (0.0008) [2023-10-08 00:30:37,905][52059] Updated weights for policy 1, policy_version 16012 (0.0010) [2023-10-08 00:30:38,270][52059] Updated weights for policy 1, policy_version 16022 (0.0011) [2023-10-08 00:30:38,641][52059] Updated weights for policy 1, policy_version 16032 (0.0009) [2023-10-08 00:30:38,826][52060] Updated weights for policy 0, policy_version 15810 (0.0007) [2023-10-08 00:30:39,185][52060] Updated weights for policy 0, policy_version 15820 (0.0010) [2023-10-08 00:30:39,553][52060] Updated weights for policy 0, policy_version 15830 (0.0009) [2023-10-08 00:30:39,925][52060] Updated weights for policy 0, policy_version 15840 (0.0010) [2023-10-08 00:30:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 32636928. Throughput: 0: 1697.2, 1: 1722.5. Samples: 8166186. Policy #0 lag: (min: 2.0, avg: 16.5, max: 34.0) [2023-10-08 00:30:41,211][50642] Avg episode reward: [(0, '14.290'), (1, '16.040')] [2023-10-08 00:30:42,623][52059] Updated weights for policy 1, policy_version 16042 (0.0009) [2023-10-08 00:30:42,986][52059] Updated weights for policy 1, policy_version 16052 (0.0010) [2023-10-08 00:30:43,357][52059] Updated weights for policy 1, policy_version 16062 (0.0009) [2023-10-08 00:30:44,085][52060] Updated weights for policy 0, policy_version 15850 (0.0009) [2023-10-08 00:30:44,460][52060] Updated weights for policy 0, policy_version 15860 (0.0009) [2023-10-08 00:30:44,820][52060] Updated weights for policy 0, policy_version 15870 (0.0007) [2023-10-08 00:30:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 32702464. Throughput: 0: 1704.3, 1: 1751.1. Samples: 8187238. Policy #0 lag: (min: 2.0, avg: 16.5, max: 34.0) [2023-10-08 00:30:46,211][50642] Avg episode reward: [(0, '14.220'), (1, '17.080')] [2023-10-08 00:30:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000015872_16252928.pth... [2023-10-08 00:30:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000016064_16449536.pth... [2023-10-08 00:30:46,260][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000014272_14614528.pth [2023-10-08 00:30:46,261][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000014464_14811136.pth [2023-10-08 00:30:46,266][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000015872_16252928.pth [2023-10-08 00:30:46,266][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000016064_16449536.pth [2023-10-08 00:30:47,379][52059] Updated weights for policy 1, policy_version 16072 (0.0008) [2023-10-08 00:30:47,740][52059] Updated weights for policy 1, policy_version 16082 (0.0010) [2023-10-08 00:30:48,111][52059] Updated weights for policy 1, policy_version 16092 (0.0009) [2023-10-08 00:30:48,760][52060] Updated weights for policy 0, policy_version 15880 (0.0010) [2023-10-08 00:30:49,127][52060] Updated weights for policy 0, policy_version 15890 (0.0009) [2023-10-08 00:30:49,494][52060] Updated weights for policy 0, policy_version 15900 (0.0007) [2023-10-08 00:30:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 32768000. Throughput: 0: 1706.9, 1: 1721.0. Samples: 8197298. Policy #0 lag: (min: 0.0, avg: 22.8, max: 32.0) [2023-10-08 00:30:51,211][50642] Avg episode reward: [(0, '14.730'), (1, '14.690')] [2023-10-08 00:30:52,024][52059] Updated weights for policy 1, policy_version 16102 (0.0010) [2023-10-08 00:30:52,393][52059] Updated weights for policy 1, policy_version 16112 (0.0010) [2023-10-08 00:30:52,756][52059] Updated weights for policy 1, policy_version 16122 (0.0010) [2023-10-08 00:30:53,414][52060] Updated weights for policy 0, policy_version 15910 (0.0010) [2023-10-08 00:30:53,777][52060] Updated weights for policy 0, policy_version 15920 (0.0008) [2023-10-08 00:30:54,150][52060] Updated weights for policy 0, policy_version 15930 (0.0007) [2023-10-08 00:30:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 32833536. Throughput: 0: 1687.6, 1: 1740.7. Samples: 8217658. Policy #0 lag: (min: 0.0, avg: 22.8, max: 32.0) [2023-10-08 00:30:56,211][50642] Avg episode reward: [(0, '15.300'), (1, '17.580')] [2023-10-08 00:30:56,723][52059] Updated weights for policy 1, policy_version 16132 (0.0011) [2023-10-08 00:30:57,110][52059] Updated weights for policy 1, policy_version 16142 (0.0007) [2023-10-08 00:30:57,473][52059] Updated weights for policy 1, policy_version 16152 (0.0008) [2023-10-08 00:30:58,130][52060] Updated weights for policy 0, policy_version 15940 (0.0008) [2023-10-08 00:30:58,502][52060] Updated weights for policy 0, policy_version 15950 (0.0007) [2023-10-08 00:30:58,872][52060] Updated weights for policy 0, policy_version 15960 (0.0009) [2023-10-08 00:31:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 32899072. Throughput: 0: 1712.5, 1: 1745.7. Samples: 8238830. Policy #0 lag: (min: 0.0, avg: 22.8, max: 32.0) [2023-10-08 00:31:01,210][50642] Avg episode reward: [(0, '14.830'), (1, '17.640')] [2023-10-08 00:31:01,307][52059] Updated weights for policy 1, policy_version 16162 (0.0010) [2023-10-08 00:31:01,666][52059] Updated weights for policy 1, policy_version 16172 (0.0009) [2023-10-08 00:31:02,044][52059] Updated weights for policy 1, policy_version 16182 (0.0009) [2023-10-08 00:31:02,411][52059] Updated weights for policy 1, policy_version 16192 (0.0008) [2023-10-08 00:31:02,763][52060] Updated weights for policy 0, policy_version 15970 (0.0010) [2023-10-08 00:31:03,144][52060] Updated weights for policy 0, policy_version 15980 (0.0008) [2023-10-08 00:31:03,518][52060] Updated weights for policy 0, policy_version 15990 (0.0007) [2023-10-08 00:31:03,885][52060] Updated weights for policy 0, policy_version 16000 (0.0007) [2023-10-08 00:31:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 32964608. Throughput: 0: 1690.5, 1: 1724.0. Samples: 8248228. Policy #0 lag: (min: 31.0, avg: 39.4, max: 63.0) [2023-10-08 00:31:06,211][50642] Avg episode reward: [(0, '14.340'), (1, '16.670')] [2023-10-08 00:31:06,284][52059] Updated weights for policy 1, policy_version 16202 (0.0007) [2023-10-08 00:31:06,655][52059] Updated weights for policy 1, policy_version 16212 (0.0009) [2023-10-08 00:31:07,023][52059] Updated weights for policy 1, policy_version 16222 (0.0009) [2023-10-08 00:31:07,818][52060] Updated weights for policy 0, policy_version 16010 (0.0008) [2023-10-08 00:31:08,192][52060] Updated weights for policy 0, policy_version 16020 (0.0009) [2023-10-08 00:31:08,555][52060] Updated weights for policy 0, policy_version 16030 (0.0010) [2023-10-08 00:31:11,044][52059] Updated weights for policy 1, policy_version 16232 (0.0007) [2023-10-08 00:31:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 33030144. Throughput: 0: 1699.5, 1: 1745.2. Samples: 8269458. Policy #0 lag: (min: 31.0, avg: 39.4, max: 63.0) [2023-10-08 00:31:11,211][50642] Avg episode reward: [(0, '14.770'), (1, '17.550')] [2023-10-08 00:31:11,416][52059] Updated weights for policy 1, policy_version 16242 (0.0009) [2023-10-08 00:31:11,779][52059] Updated weights for policy 1, policy_version 16252 (0.0009) [2023-10-08 00:31:12,533][52060] Updated weights for policy 0, policy_version 16040 (0.0010) [2023-10-08 00:31:12,910][52060] Updated weights for policy 0, policy_version 16050 (0.0011) [2023-10-08 00:31:13,273][52060] Updated weights for policy 0, policy_version 16060 (0.0009) [2023-10-08 00:31:15,622][52059] Updated weights for policy 1, policy_version 16262 (0.0009) [2023-10-08 00:31:16,000][52059] Updated weights for policy 1, policy_version 16272 (0.0008) [2023-10-08 00:31:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 33095680. Throughput: 0: 1726.8, 1: 1731.7. Samples: 8290668. Policy #0 lag: (min: 31.0, avg: 39.4, max: 63.0) [2023-10-08 00:31:16,211][50642] Avg episode reward: [(0, '13.940'), (1, '15.860')] [2023-10-08 00:31:16,362][52059] Updated weights for policy 1, policy_version 16282 (0.0008) [2023-10-08 00:31:17,228][52060] Updated weights for policy 0, policy_version 16070 (0.0009) [2023-10-08 00:31:17,599][52060] Updated weights for policy 0, policy_version 16080 (0.0008) [2023-10-08 00:31:17,968][52060] Updated weights for policy 0, policy_version 16090 (0.0008) [2023-10-08 00:31:20,214][52059] Updated weights for policy 1, policy_version 16292 (0.0009) [2023-10-08 00:31:20,584][52059] Updated weights for policy 1, policy_version 16302 (0.0010) [2023-10-08 00:31:20,954][52059] Updated weights for policy 1, policy_version 16312 (0.0008) [2023-10-08 00:31:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 33161216. Throughput: 0: 1693.5, 1: 1744.4. Samples: 8300650. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) [2023-10-08 00:31:21,211][50642] Avg episode reward: [(0, '15.300'), (1, '16.450')] [2023-10-08 00:31:21,854][52060] Updated weights for policy 0, policy_version 16100 (0.0009) [2023-10-08 00:31:22,221][52060] Updated weights for policy 0, policy_version 16110 (0.0009) [2023-10-08 00:31:22,586][52060] Updated weights for policy 0, policy_version 16120 (0.0012) [2023-10-08 00:31:25,081][52059] Updated weights for policy 1, policy_version 16322 (0.0007) [2023-10-08 00:31:25,448][52059] Updated weights for policy 1, policy_version 16332 (0.0008) [2023-10-08 00:31:25,812][52059] Updated weights for policy 1, policy_version 16342 (0.0008) [2023-10-08 00:31:26,182][52059] Updated weights for policy 1, policy_version 16352 (0.0007) [2023-10-08 00:31:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 33259520. Throughput: 0: 1724.7, 1: 1735.2. Samples: 8321882. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) [2023-10-08 00:31:26,211][50642] Avg episode reward: [(0, '14.950'), (1, '17.350')] [2023-10-08 00:31:26,514][52060] Updated weights for policy 0, policy_version 16130 (0.0010) [2023-10-08 00:31:26,882][52060] Updated weights for policy 0, policy_version 16140 (0.0009) [2023-10-08 00:31:27,247][52060] Updated weights for policy 0, policy_version 16150 (0.0007) [2023-10-08 00:31:27,613][52060] Updated weights for policy 0, policy_version 16160 (0.0009) [2023-10-08 00:31:30,151][52059] Updated weights for policy 1, policy_version 16362 (0.0009) [2023-10-08 00:31:30,512][52059] Updated weights for policy 1, policy_version 16372 (0.0007) [2023-10-08 00:31:30,882][52059] Updated weights for policy 1, policy_version 16382 (0.0009) [2023-10-08 00:31:31,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 33325056. Throughput: 0: 1726.9, 1: 1708.8. Samples: 8341848. Policy #0 lag: (min: 31.0, avg: 34.6, max: 63.0) [2023-10-08 00:31:31,211][50642] Avg episode reward: [(0, '14.210'), (1, '17.060')] [2023-10-08 00:31:31,590][52060] Updated weights for policy 0, policy_version 16170 (0.0007) [2023-10-08 00:31:31,957][52060] Updated weights for policy 0, policy_version 16180 (0.0009) [2023-10-08 00:31:32,328][52060] Updated weights for policy 0, policy_version 16190 (0.0007) [2023-10-08 00:31:34,599][52059] Updated weights for policy 1, policy_version 16392 (0.0007) [2023-10-08 00:31:34,964][52059] Updated weights for policy 1, policy_version 16402 (0.0007) [2023-10-08 00:31:35,330][52059] Updated weights for policy 1, policy_version 16412 (0.0009) [2023-10-08 00:31:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 33390592. Throughput: 0: 1703.8, 1: 1745.2. Samples: 8352504. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:31:36,211][50642] Avg episode reward: [(0, '15.220'), (1, '17.800')] [2023-10-08 00:31:36,371][52060] Updated weights for policy 0, policy_version 16200 (0.0008) [2023-10-08 00:31:36,736][52060] Updated weights for policy 0, policy_version 16210 (0.0008) [2023-10-08 00:31:37,106][52060] Updated weights for policy 0, policy_version 16220 (0.0008) [2023-10-08 00:31:39,199][52059] Updated weights for policy 1, policy_version 16422 (0.0008) [2023-10-08 00:31:39,559][52059] Updated weights for policy 1, policy_version 16432 (0.0010) [2023-10-08 00:31:39,925][52059] Updated weights for policy 1, policy_version 16442 (0.0007) [2023-10-08 00:31:41,088][52060] Updated weights for policy 0, policy_version 16230 (0.0009) [2023-10-08 00:31:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 33456128. Throughput: 0: 1723.7, 1: 1730.9. Samples: 8373118. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:31:41,211][50642] Avg episode reward: [(0, '14.420'), (1, '17.160')] [2023-10-08 00:31:41,463][52060] Updated weights for policy 0, policy_version 16240 (0.0009) [2023-10-08 00:31:41,838][52060] Updated weights for policy 0, policy_version 16250 (0.0009) [2023-10-08 00:31:43,932][52059] Updated weights for policy 1, policy_version 16452 (0.0009) [2023-10-08 00:31:44,332][52059] Updated weights for policy 1, policy_version 16462 (0.0007) [2023-10-08 00:31:44,702][52059] Updated weights for policy 1, policy_version 16472 (0.0007) [2023-10-08 00:31:45,615][52060] Updated weights for policy 0, policy_version 16260 (0.0007) [2023-10-08 00:31:45,977][52060] Updated weights for policy 0, policy_version 16270 (0.0007) [2023-10-08 00:31:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 33521664. Throughput: 0: 1722.5, 1: 1720.5. Samples: 8393764. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:31:46,211][50642] Avg episode reward: [(0, '14.530'), (1, '16.900')] [2023-10-08 00:31:46,357][52060] Updated weights for policy 0, policy_version 16280 (0.0008) [2023-10-08 00:31:48,549][52059] Updated weights for policy 1, policy_version 16482 (0.0011) [2023-10-08 00:31:48,912][52059] Updated weights for policy 1, policy_version 16492 (0.0010) [2023-10-08 00:31:49,271][52059] Updated weights for policy 1, policy_version 16502 (0.0008) [2023-10-08 00:31:49,633][52059] Updated weights for policy 1, policy_version 16512 (0.0009) [2023-10-08 00:31:50,460][52060] Updated weights for policy 0, policy_version 16290 (0.0008) [2023-10-08 00:31:50,858][52060] Updated weights for policy 0, policy_version 16300 (0.0007) [2023-10-08 00:31:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 33587200. Throughput: 0: 1731.8, 1: 1742.0. Samples: 8404548. Policy #0 lag: (min: 27.0, avg: 32.1, max: 59.0) [2023-10-08 00:31:51,211][50642] Avg episode reward: [(0, '16.150'), (1, '17.640')] [2023-10-08 00:31:51,222][52060] Updated weights for policy 0, policy_version 16310 (0.0008) [2023-10-08 00:31:51,585][52060] Updated weights for policy 0, policy_version 16320 (0.0007) [2023-10-08 00:31:53,467][52059] Updated weights for policy 1, policy_version 16522 (0.0007) [2023-10-08 00:31:53,830][52059] Updated weights for policy 1, policy_version 16532 (0.0007) [2023-10-08 00:31:54,191][52059] Updated weights for policy 1, policy_version 16542 (0.0007) [2023-10-08 00:31:55,525][52060] Updated weights for policy 0, policy_version 16330 (0.0007) [2023-10-08 00:31:55,887][52060] Updated weights for policy 0, policy_version 16340 (0.0011) [2023-10-08 00:31:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 33652736. Throughput: 0: 1731.1, 1: 1725.4. Samples: 8424998. Policy #0 lag: (min: 27.0, avg: 32.1, max: 59.0) [2023-10-08 00:31:56,211][50642] Avg episode reward: [(0, '13.990'), (1, '15.950')] [2023-10-08 00:31:56,263][52060] Updated weights for policy 0, policy_version 16350 (0.0008) [2023-10-08 00:31:58,136][52059] Updated weights for policy 1, policy_version 16552 (0.0008) [2023-10-08 00:31:58,492][52059] Updated weights for policy 1, policy_version 16562 (0.0010) [2023-10-08 00:31:58,859][52059] Updated weights for policy 1, policy_version 16572 (0.0008) [2023-10-08 00:32:00,288][52060] Updated weights for policy 0, policy_version 16360 (0.0008) [2023-10-08 00:32:00,648][52060] Updated weights for policy 0, policy_version 16370 (0.0008) [2023-10-08 00:32:01,029][52060] Updated weights for policy 0, policy_version 16380 (0.0010) [2023-10-08 00:32:01,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 33751040. Throughput: 0: 1704.7, 1: 1735.0. Samples: 8445454. Policy #0 lag: (min: 8.0, avg: 30.8, max: 40.0) [2023-10-08 00:32:01,211][50642] Avg episode reward: [(0, '15.760'), (1, '16.600')] [2023-10-08 00:32:02,832][52059] Updated weights for policy 1, policy_version 16582 (0.0008) [2023-10-08 00:32:03,205][52059] Updated weights for policy 1, policy_version 16592 (0.0007) [2023-10-08 00:32:03,561][52059] Updated weights for policy 1, policy_version 16602 (0.0007) [2023-10-08 00:32:04,926][52060] Updated weights for policy 0, policy_version 16390 (0.0009) [2023-10-08 00:32:05,295][52060] Updated weights for policy 0, policy_version 16400 (0.0009) [2023-10-08 00:32:05,656][52060] Updated weights for policy 0, policy_version 16410 (0.0011) [2023-10-08 00:32:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 33816576. Throughput: 0: 1725.3, 1: 1719.7. Samples: 8455672. Policy #0 lag: (min: 8.0, avg: 30.8, max: 40.0) [2023-10-08 00:32:06,211][50642] Avg episode reward: [(0, '15.330'), (1, '16.980')] [2023-10-08 00:32:07,426][52059] Updated weights for policy 1, policy_version 16612 (0.0008) [2023-10-08 00:32:07,797][52059] Updated weights for policy 1, policy_version 16622 (0.0009) [2023-10-08 00:32:08,169][52059] Updated weights for policy 1, policy_version 16632 (0.0009) [2023-10-08 00:32:09,778][52060] Updated weights for policy 0, policy_version 16420 (0.0007) [2023-10-08 00:32:10,150][52060] Updated weights for policy 0, policy_version 16430 (0.0007) [2023-10-08 00:32:10,515][52060] Updated weights for policy 0, policy_version 16440 (0.0008) [2023-10-08 00:32:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 33882112. Throughput: 0: 1715.4, 1: 1728.8. Samples: 8476872. Policy #0 lag: (min: 8.0, avg: 30.8, max: 40.0) [2023-10-08 00:32:11,211][50642] Avg episode reward: [(0, '14.100'), (1, '18.090')] [2023-10-08 00:32:12,093][52059] Updated weights for policy 1, policy_version 16642 (0.0011) [2023-10-08 00:32:12,463][52059] Updated weights for policy 1, policy_version 16652 (0.0009) [2023-10-08 00:32:12,833][52059] Updated weights for policy 1, policy_version 16662 (0.0009) [2023-10-08 00:32:13,203][52059] Updated weights for policy 1, policy_version 16672 (0.0009) [2023-10-08 00:32:14,432][52060] Updated weights for policy 0, policy_version 16450 (0.0009) [2023-10-08 00:32:14,803][52060] Updated weights for policy 0, policy_version 16460 (0.0008) [2023-10-08 00:32:15,177][52060] Updated weights for policy 0, policy_version 16470 (0.0008) [2023-10-08 00:32:15,538][52060] Updated weights for policy 0, policy_version 16480 (0.0011) [2023-10-08 00:32:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 33947648. Throughput: 0: 1696.6, 1: 1759.4. Samples: 8497366. Policy #0 lag: (min: 6.0, avg: 15.4, max: 38.0) [2023-10-08 00:32:16,211][50642] Avg episode reward: [(0, '16.160'), (1, '16.870')] [2023-10-08 00:32:17,021][52059] Updated weights for policy 1, policy_version 16682 (0.0008) [2023-10-08 00:32:17,387][52059] Updated weights for policy 1, policy_version 16692 (0.0008) [2023-10-08 00:32:17,750][52059] Updated weights for policy 1, policy_version 16702 (0.0008) [2023-10-08 00:32:19,305][52060] Updated weights for policy 0, policy_version 16490 (0.0007) [2023-10-08 00:32:19,672][52060] Updated weights for policy 0, policy_version 16500 (0.0009) [2023-10-08 00:32:20,042][52060] Updated weights for policy 0, policy_version 16510 (0.0009) [2023-10-08 00:32:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 34013184. Throughput: 0: 1735.1, 1: 1726.3. Samples: 8508264. Policy #0 lag: (min: 6.0, avg: 15.4, max: 38.0) [2023-10-08 00:32:21,211][50642] Avg episode reward: [(0, '13.350'), (1, '15.780')] [2023-10-08 00:32:21,776][52059] Updated weights for policy 1, policy_version 16712 (0.0008) [2023-10-08 00:32:22,145][52059] Updated weights for policy 1, policy_version 16722 (0.0008) [2023-10-08 00:32:22,513][52059] Updated weights for policy 1, policy_version 16732 (0.0008) [2023-10-08 00:32:23,973][52060] Updated weights for policy 0, policy_version 16520 (0.0009) [2023-10-08 00:32:24,343][52060] Updated weights for policy 0, policy_version 16530 (0.0009) [2023-10-08 00:32:24,713][52060] Updated weights for policy 0, policy_version 16540 (0.0009) [2023-10-08 00:32:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 34078720. Throughput: 0: 1709.7, 1: 1742.6. Samples: 8528474. Policy #0 lag: (min: 6.0, avg: 15.4, max: 38.0) [2023-10-08 00:32:26,211][50642] Avg episode reward: [(0, '14.920'), (1, '17.860')] [2023-10-08 00:32:26,359][52059] Updated weights for policy 1, policy_version 16742 (0.0010) [2023-10-08 00:32:26,729][52059] Updated weights for policy 1, policy_version 16752 (0.0011) [2023-10-08 00:32:27,088][52059] Updated weights for policy 1, policy_version 16762 (0.0007) [2023-10-08 00:32:28,711][52060] Updated weights for policy 0, policy_version 16550 (0.0008) [2023-10-08 00:32:29,081][52060] Updated weights for policy 0, policy_version 16560 (0.0009) [2023-10-08 00:32:29,440][52060] Updated weights for policy 0, policy_version 16570 (0.0008) [2023-10-08 00:32:30,928][52059] Updated weights for policy 1, policy_version 16772 (0.0009) [2023-10-08 00:32:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 34144256. Throughput: 0: 1708.3, 1: 1753.5. Samples: 8549546. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) [2023-10-08 00:32:31,211][50642] Avg episode reward: [(0, '15.260'), (1, '16.170')] [2023-10-08 00:32:31,332][52059] Updated weights for policy 1, policy_version 16782 (0.0009) [2023-10-08 00:32:31,696][52059] Updated weights for policy 1, policy_version 16792 (0.0009) [2023-10-08 00:32:33,529][52060] Updated weights for policy 0, policy_version 16580 (0.0008) [2023-10-08 00:32:33,902][52060] Updated weights for policy 0, policy_version 16590 (0.0008) [2023-10-08 00:32:34,261][52060] Updated weights for policy 0, policy_version 16600 (0.0008) [2023-10-08 00:32:35,727][52059] Updated weights for policy 1, policy_version 16802 (0.0009) [2023-10-08 00:32:36,085][52059] Updated weights for policy 1, policy_version 16812 (0.0010) [2023-10-08 00:32:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 34209792. Throughput: 0: 1713.8, 1: 1731.9. Samples: 8559604. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) [2023-10-08 00:32:36,211][50642] Avg episode reward: [(0, '14.620'), (1, '17.000')] [2023-10-08 00:32:36,446][52059] Updated weights for policy 1, policy_version 16822 (0.0011) [2023-10-08 00:32:36,814][52059] Updated weights for policy 1, policy_version 16832 (0.0010) [2023-10-08 00:32:38,196][52060] Updated weights for policy 0, policy_version 16610 (0.0009) [2023-10-08 00:32:38,569][52060] Updated weights for policy 0, policy_version 16620 (0.0010) [2023-10-08 00:32:38,933][52060] Updated weights for policy 0, policy_version 16630 (0.0009) [2023-10-08 00:32:39,303][52060] Updated weights for policy 0, policy_version 16640 (0.0011) [2023-10-08 00:32:40,884][52059] Updated weights for policy 1, policy_version 16842 (0.0011) [2023-10-08 00:32:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 34275328. Throughput: 0: 1699.3, 1: 1749.6. Samples: 8580198. Policy #0 lag: (min: 9.0, avg: 9.0, max: 9.0) [2023-10-08 00:32:41,211][50642] Avg episode reward: [(0, '15.320'), (1, '18.620')] [2023-10-08 00:32:41,253][52059] Updated weights for policy 1, policy_version 16852 (0.0009) [2023-10-08 00:32:41,618][52059] Updated weights for policy 1, policy_version 16862 (0.0007) [2023-10-08 00:32:43,154][52060] Updated weights for policy 0, policy_version 16650 (0.0009) [2023-10-08 00:32:43,520][52060] Updated weights for policy 0, policy_version 16660 (0.0007) [2023-10-08 00:32:43,895][52060] Updated weights for policy 0, policy_version 16670 (0.0009) [2023-10-08 00:32:45,326][52059] Updated weights for policy 1, policy_version 16872 (0.0010) [2023-10-08 00:32:45,692][52059] Updated weights for policy 1, policy_version 16882 (0.0010) [2023-10-08 00:32:46,054][52059] Updated weights for policy 1, policy_version 16892 (0.0011) [2023-10-08 00:32:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 34373632. Throughput: 0: 1721.0, 1: 1736.5. Samples: 8601040. Policy #0 lag: (min: 23.0, avg: 23.7, max: 41.0) [2023-10-08 00:32:46,211][50642] Avg episode reward: [(0, '13.880'), (1, '17.690')] [2023-10-08 00:32:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000016896_17301504.pth... [2023-10-08 00:32:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000016672_17072128.pth... [2023-10-08 00:32:46,249][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000015264_15630336.pth [2023-10-08 00:32:46,254][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000015072_15433728.pth [2023-10-08 00:32:47,945][52060] Updated weights for policy 0, policy_version 16680 (0.0010) [2023-10-08 00:32:48,299][52060] Updated weights for policy 0, policy_version 16690 (0.0011) [2023-10-08 00:32:48,668][52060] Updated weights for policy 0, policy_version 16700 (0.0011) [2023-10-08 00:32:49,829][52059] Updated weights for policy 1, policy_version 16902 (0.0009) [2023-10-08 00:32:50,200][52059] Updated weights for policy 1, policy_version 16912 (0.0010) [2023-10-08 00:32:50,554][52059] Updated weights for policy 1, policy_version 16922 (0.0008) [2023-10-08 00:32:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 34439168. Throughput: 0: 1701.1, 1: 1760.3. Samples: 8611440. Policy #0 lag: (min: 23.0, avg: 23.7, max: 41.0) [2023-10-08 00:32:51,211][50642] Avg episode reward: [(0, '15.330'), (1, '16.700')] [2023-10-08 00:32:52,579][52060] Updated weights for policy 0, policy_version 16710 (0.0009) [2023-10-08 00:32:52,958][52060] Updated weights for policy 0, policy_version 16720 (0.0008) [2023-10-08 00:32:53,322][52060] Updated weights for policy 0, policy_version 16730 (0.0007) [2023-10-08 00:32:54,317][52059] Updated weights for policy 1, policy_version 16932 (0.0009) [2023-10-08 00:32:54,675][52059] Updated weights for policy 1, policy_version 16942 (0.0007) [2023-10-08 00:32:55,036][52059] Updated weights for policy 1, policy_version 16952 (0.0007) [2023-10-08 00:32:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 34504704. Throughput: 0: 1706.7, 1: 1744.2. Samples: 8632160. Policy #0 lag: (min: 23.0, avg: 23.7, max: 41.0) [2023-10-08 00:32:56,211][50642] Avg episode reward: [(0, '13.620'), (1, '18.040')] [2023-10-08 00:32:57,348][52060] Updated weights for policy 0, policy_version 16740 (0.0008) [2023-10-08 00:32:57,720][52060] Updated weights for policy 0, policy_version 16750 (0.0008) [2023-10-08 00:32:58,101][52060] Updated weights for policy 0, policy_version 16760 (0.0009) [2023-10-08 00:32:59,019][52059] Updated weights for policy 1, policy_version 16962 (0.0008) [2023-10-08 00:32:59,384][52059] Updated weights for policy 1, policy_version 16972 (0.0009) [2023-10-08 00:32:59,753][52059] Updated weights for policy 1, policy_version 16982 (0.0009) [2023-10-08 00:33:00,115][52059] Updated weights for policy 1, policy_version 16992 (0.0010) [2023-10-08 00:33:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 34570240. Throughput: 0: 1725.5, 1: 1726.3. Samples: 8652696. Policy #0 lag: (min: 31.0, avg: 31.1, max: 39.0) [2023-10-08 00:33:01,211][50642] Avg episode reward: [(0, '14.320'), (1, '16.490')] [2023-10-08 00:33:02,233][52060] Updated weights for policy 0, policy_version 16770 (0.0008) [2023-10-08 00:33:02,601][52060] Updated weights for policy 0, policy_version 16780 (0.0008) [2023-10-08 00:33:02,976][52060] Updated weights for policy 0, policy_version 16790 (0.0010) [2023-10-08 00:33:03,348][52060] Updated weights for policy 0, policy_version 16800 (0.0011) [2023-10-08 00:33:04,044][52059] Updated weights for policy 1, policy_version 17002 (0.0007) [2023-10-08 00:33:04,407][52059] Updated weights for policy 1, policy_version 17012 (0.0007) [2023-10-08 00:33:04,777][52059] Updated weights for policy 1, policy_version 17022 (0.0007) [2023-10-08 00:33:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 34635776. Throughput: 0: 1686.0, 1: 1749.7. Samples: 8662872. Policy #0 lag: (min: 31.0, avg: 31.1, max: 39.0) [2023-10-08 00:33:06,211][50642] Avg episode reward: [(0, '15.100'), (1, '16.600')] [2023-10-08 00:33:07,372][52060] Updated weights for policy 0, policy_version 16810 (0.0007) [2023-10-08 00:33:07,740][52060] Updated weights for policy 0, policy_version 16820 (0.0008) [2023-10-08 00:33:08,105][52060] Updated weights for policy 0, policy_version 16830 (0.0009) [2023-10-08 00:33:08,725][52059] Updated weights for policy 1, policy_version 17032 (0.0008) [2023-10-08 00:33:09,098][52059] Updated weights for policy 1, policy_version 17042 (0.0010) [2023-10-08 00:33:09,462][52059] Updated weights for policy 1, policy_version 17052 (0.0008) [2023-10-08 00:33:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 34701312. Throughput: 0: 1712.2, 1: 1724.6. Samples: 8683128. Policy #0 lag: (min: 31.0, avg: 31.1, max: 39.0) [2023-10-08 00:33:11,211][50642] Avg episode reward: [(0, '14.430'), (1, '17.280')] [2023-10-08 00:33:12,283][52060] Updated weights for policy 0, policy_version 16840 (0.0010) [2023-10-08 00:33:12,654][52060] Updated weights for policy 0, policy_version 16850 (0.0010) [2023-10-08 00:33:13,015][52060] Updated weights for policy 0, policy_version 16860 (0.0010) [2023-10-08 00:33:13,372][52059] Updated weights for policy 1, policy_version 17062 (0.0009) [2023-10-08 00:33:13,732][52059] Updated weights for policy 1, policy_version 17072 (0.0010) [2023-10-08 00:33:14,096][52059] Updated weights for policy 1, policy_version 17082 (0.0010) [2023-10-08 00:33:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 34766848. Throughput: 0: 1709.3, 1: 1725.1. Samples: 8704096. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:33:16,211][50642] Avg episode reward: [(0, '15.260'), (1, '15.710')] [2023-10-08 00:33:16,974][52060] Updated weights for policy 0, policy_version 16870 (0.0010) [2023-10-08 00:33:17,343][52060] Updated weights for policy 0, policy_version 16880 (0.0010) [2023-10-08 00:33:17,712][52060] Updated weights for policy 0, policy_version 16890 (0.0009) [2023-10-08 00:33:18,125][52059] Updated weights for policy 1, policy_version 17092 (0.0007) [2023-10-08 00:33:18,534][52059] Updated weights for policy 1, policy_version 17102 (0.0009) [2023-10-08 00:33:18,901][52059] Updated weights for policy 1, policy_version 17112 (0.0008) [2023-10-08 00:33:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 34832384. Throughput: 0: 1694.7, 1: 1739.1. Samples: 8714126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:33:21,211][50642] Avg episode reward: [(0, '14.520'), (1, '18.330')] [2023-10-08 00:33:21,673][52060] Updated weights for policy 0, policy_version 16900 (0.0008) [2023-10-08 00:33:22,051][52060] Updated weights for policy 0, policy_version 16910 (0.0009) [2023-10-08 00:33:22,421][52060] Updated weights for policy 0, policy_version 16920 (0.0009) [2023-10-08 00:33:22,763][52059] Updated weights for policy 1, policy_version 17122 (0.0008) [2023-10-08 00:33:23,117][52059] Updated weights for policy 1, policy_version 17132 (0.0009) [2023-10-08 00:33:23,480][52059] Updated weights for policy 1, policy_version 17142 (0.0009) [2023-10-08 00:33:23,846][52059] Updated weights for policy 1, policy_version 17152 (0.0010) [2023-10-08 00:33:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 34897920. Throughput: 0: 1712.8, 1: 1731.2. Samples: 8735182. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:33:26,211][50642] Avg episode reward: [(0, '15.200'), (1, '18.210')] [2023-10-08 00:33:26,377][52060] Updated weights for policy 0, policy_version 16930 (0.0008) [2023-10-08 00:33:26,778][52060] Updated weights for policy 0, policy_version 16940 (0.0008) [2023-10-08 00:33:27,151][52060] Updated weights for policy 0, policy_version 16950 (0.0009) [2023-10-08 00:33:27,519][52060] Updated weights for policy 0, policy_version 16960 (0.0009) [2023-10-08 00:33:27,768][52059] Updated weights for policy 1, policy_version 17162 (0.0008) [2023-10-08 00:33:28,137][52059] Updated weights for policy 1, policy_version 17172 (0.0008) [2023-10-08 00:33:28,506][52059] Updated weights for policy 1, policy_version 17182 (0.0009) [2023-10-08 00:33:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 34963456. Throughput: 0: 1710.3, 1: 1742.4. Samples: 8756408. Policy #0 lag: (min: 12.0, avg: 12.1, max: 18.0) [2023-10-08 00:33:31,210][50642] Avg episode reward: [(0, '14.470'), (1, '17.290')] [2023-10-08 00:33:31,372][52060] Updated weights for policy 0, policy_version 16970 (0.0011) [2023-10-08 00:33:31,746][52060] Updated weights for policy 0, policy_version 16980 (0.0009) [2023-10-08 00:33:32,108][52060] Updated weights for policy 0, policy_version 16990 (0.0009) [2023-10-08 00:33:32,477][52059] Updated weights for policy 1, policy_version 17192 (0.0010) [2023-10-08 00:33:32,839][52059] Updated weights for policy 1, policy_version 17202 (0.0007) [2023-10-08 00:33:33,204][52059] Updated weights for policy 1, policy_version 17212 (0.0008) [2023-10-08 00:33:35,998][52060] Updated weights for policy 0, policy_version 17000 (0.0009) [2023-10-08 00:33:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 35028992. Throughput: 0: 1712.7, 1: 1719.0. Samples: 8765868. Policy #0 lag: (min: 12.0, avg: 12.1, max: 18.0) [2023-10-08 00:33:36,211][50642] Avg episode reward: [(0, '15.430'), (1, '20.280')] [2023-10-08 00:33:36,212][51710] Saving new best policy, reward=20.280! [2023-10-08 00:33:36,366][52060] Updated weights for policy 0, policy_version 17010 (0.0008) [2023-10-08 00:33:36,725][52060] Updated weights for policy 0, policy_version 17020 (0.0008) [2023-10-08 00:33:37,170][52059] Updated weights for policy 1, policy_version 17222 (0.0008) [2023-10-08 00:33:37,536][52059] Updated weights for policy 1, policy_version 17232 (0.0010) [2023-10-08 00:33:37,905][52059] Updated weights for policy 1, policy_version 17242 (0.0010) [2023-10-08 00:33:40,707][52060] Updated weights for policy 0, policy_version 17030 (0.0009) [2023-10-08 00:33:41,076][52060] Updated weights for policy 0, policy_version 17040 (0.0010) [2023-10-08 00:33:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 35094528. Throughput: 0: 1719.2, 1: 1732.0. Samples: 8787462. Policy #0 lag: (min: 12.0, avg: 12.1, max: 18.0) [2023-10-08 00:33:41,211][50642] Avg episode reward: [(0, '15.300'), (1, '16.600')] [2023-10-08 00:33:41,449][52060] Updated weights for policy 0, policy_version 17050 (0.0009) [2023-10-08 00:33:41,724][52059] Updated weights for policy 1, policy_version 17252 (0.0009) [2023-10-08 00:33:42,085][52059] Updated weights for policy 1, policy_version 17262 (0.0010) [2023-10-08 00:33:42,455][52059] Updated weights for policy 1, policy_version 17272 (0.0008) [2023-10-08 00:33:45,192][52060] Updated weights for policy 0, policy_version 17060 (0.0007) [2023-10-08 00:33:45,557][52060] Updated weights for policy 0, policy_version 17070 (0.0009) [2023-10-08 00:33:45,927][52060] Updated weights for policy 0, policy_version 17080 (0.0007) [2023-10-08 00:33:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 35160064. Throughput: 0: 1710.8, 1: 1750.3. Samples: 8808446. Policy #0 lag: (min: 23.0, avg: 25.7, max: 55.0) [2023-10-08 00:33:46,211][50642] Avg episode reward: [(0, '15.110'), (1, '15.930')] [2023-10-08 00:33:46,440][52059] Updated weights for policy 1, policy_version 17282 (0.0008) [2023-10-08 00:33:46,808][52059] Updated weights for policy 1, policy_version 17292 (0.0009) [2023-10-08 00:33:47,176][52059] Updated weights for policy 1, policy_version 17302 (0.0009) [2023-10-08 00:33:47,547][52059] Updated weights for policy 1, policy_version 17312 (0.0008) [2023-10-08 00:33:49,855][52060] Updated weights for policy 0, policy_version 17090 (0.0009) [2023-10-08 00:33:50,228][52060] Updated weights for policy 0, policy_version 17100 (0.0011) [2023-10-08 00:33:50,603][52060] Updated weights for policy 0, policy_version 17110 (0.0009) [2023-10-08 00:33:50,964][52060] Updated weights for policy 0, policy_version 17120 (0.0009) [2023-10-08 00:33:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 35258368. Throughput: 0: 1739.8, 1: 1725.8. Samples: 8818826. Policy #0 lag: (min: 23.0, avg: 25.7, max: 55.0) [2023-10-08 00:33:51,211][50642] Avg episode reward: [(0, '16.210'), (1, '18.260')] [2023-10-08 00:33:51,451][52059] Updated weights for policy 1, policy_version 17322 (0.0009) [2023-10-08 00:33:51,815][52059] Updated weights for policy 1, policy_version 17332 (0.0008) [2023-10-08 00:33:52,181][52059] Updated weights for policy 1, policy_version 17342 (0.0008) [2023-10-08 00:33:54,921][52060] Updated weights for policy 0, policy_version 17130 (0.0007) [2023-10-08 00:33:55,288][52060] Updated weights for policy 0, policy_version 17140 (0.0007) [2023-10-08 00:33:55,661][52060] Updated weights for policy 0, policy_version 17150 (0.0010) [2023-10-08 00:33:56,157][52059] Updated weights for policy 1, policy_version 17352 (0.0007) [2023-10-08 00:33:56,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 35323904. Throughput: 0: 1734.6, 1: 1745.4. Samples: 8839728. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:33:56,211][50642] Avg episode reward: [(0, '14.290'), (1, '15.140')] [2023-10-08 00:33:56,525][52059] Updated weights for policy 1, policy_version 17362 (0.0009) [2023-10-08 00:33:56,896][52059] Updated weights for policy 1, policy_version 17372 (0.0011) [2023-10-08 00:33:59,545][52060] Updated weights for policy 0, policy_version 17160 (0.0008) [2023-10-08 00:33:59,926][52060] Updated weights for policy 0, policy_version 17170 (0.0008) [2023-10-08 00:34:00,290][52060] Updated weights for policy 0, policy_version 17180 (0.0010) [2023-10-08 00:34:00,680][52059] Updated weights for policy 1, policy_version 17382 (0.0009) [2023-10-08 00:34:01,043][52059] Updated weights for policy 1, policy_version 17392 (0.0008) [2023-10-08 00:34:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 35389440. Throughput: 0: 1717.6, 1: 1741.0. Samples: 8859736. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:34:01,211][50642] Avg episode reward: [(0, '15.630'), (1, '17.660')] [2023-10-08 00:34:01,418][52059] Updated weights for policy 1, policy_version 17402 (0.0008) [2023-10-08 00:34:04,253][52060] Updated weights for policy 0, policy_version 17190 (0.0009) [2023-10-08 00:34:04,624][52060] Updated weights for policy 0, policy_version 17200 (0.0010) [2023-10-08 00:34:05,001][52060] Updated weights for policy 0, policy_version 17210 (0.0008) [2023-10-08 00:34:05,364][52059] Updated weights for policy 1, policy_version 17412 (0.0009) [2023-10-08 00:34:05,762][52059] Updated weights for policy 1, policy_version 17422 (0.0009) [2023-10-08 00:34:06,129][52059] Updated weights for policy 1, policy_version 17432 (0.0007) [2023-10-08 00:34:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 35454976. Throughput: 0: 1745.2, 1: 1738.6. Samples: 8870898. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:34:06,211][50642] Avg episode reward: [(0, '14.330'), (1, '19.010')] [2023-10-08 00:34:08,856][52060] Updated weights for policy 0, policy_version 17220 (0.0009) [2023-10-08 00:34:09,220][52060] Updated weights for policy 0, policy_version 17230 (0.0008) [2023-10-08 00:34:09,593][52060] Updated weights for policy 0, policy_version 17240 (0.0009) [2023-10-08 00:34:09,975][52059] Updated weights for policy 1, policy_version 17442 (0.0008) [2023-10-08 00:34:10,336][52059] Updated weights for policy 1, policy_version 17452 (0.0008) [2023-10-08 00:34:10,703][52059] Updated weights for policy 1, policy_version 17462 (0.0008) [2023-10-08 00:34:11,072][52059] Updated weights for policy 1, policy_version 17472 (0.0009) [2023-10-08 00:34:11,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 35553280. Throughput: 0: 1719.6, 1: 1744.0. Samples: 8891044. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:34:11,211][50642] Avg episode reward: [(0, '14.860'), (1, '17.360')] [2023-10-08 00:34:13,587][52060] Updated weights for policy 0, policy_version 17250 (0.0008) [2023-10-08 00:34:13,990][52060] Updated weights for policy 0, policy_version 17260 (0.0010) [2023-10-08 00:34:14,361][52060] Updated weights for policy 0, policy_version 17270 (0.0007) [2023-10-08 00:34:14,732][52060] Updated weights for policy 0, policy_version 17280 (0.0007) [2023-10-08 00:34:14,924][52059] Updated weights for policy 1, policy_version 17482 (0.0010) [2023-10-08 00:34:15,297][52059] Updated weights for policy 1, policy_version 17492 (0.0010) [2023-10-08 00:34:15,672][52059] Updated weights for policy 1, policy_version 17502 (0.0010) [2023-10-08 00:34:16,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 35618816. Throughput: 0: 1715.0, 1: 1715.4. Samples: 8910778. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:34:16,211][50642] Avg episode reward: [(0, '15.560'), (1, '14.850')] [2023-10-08 00:34:18,693][52060] Updated weights for policy 0, policy_version 17290 (0.0010) [2023-10-08 00:34:19,061][52060] Updated weights for policy 0, policy_version 17300 (0.0009) [2023-10-08 00:34:19,428][52060] Updated weights for policy 0, policy_version 17310 (0.0007) [2023-10-08 00:34:19,573][52059] Updated weights for policy 1, policy_version 17512 (0.0009) [2023-10-08 00:34:19,945][52059] Updated weights for policy 1, policy_version 17522 (0.0010) [2023-10-08 00:34:20,310][52059] Updated weights for policy 1, policy_version 17532 (0.0007) [2023-10-08 00:34:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 35684352. Throughput: 0: 1730.3, 1: 1747.7. Samples: 8922380. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:34:21,211][50642] Avg episode reward: [(0, '14.370'), (1, '19.420')] [2023-10-08 00:34:23,537][52060] Updated weights for policy 0, policy_version 17320 (0.0009) [2023-10-08 00:34:23,910][52060] Updated weights for policy 0, policy_version 17330 (0.0009) [2023-10-08 00:34:24,101][52059] Updated weights for policy 1, policy_version 17542 (0.0008) [2023-10-08 00:34:24,278][52060] Updated weights for policy 0, policy_version 17340 (0.0008) [2023-10-08 00:34:24,474][52059] Updated weights for policy 1, policy_version 17552 (0.0008) [2023-10-08 00:34:24,839][52059] Updated weights for policy 1, policy_version 17562 (0.0008) [2023-10-08 00:34:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 35749888. Throughput: 0: 1706.9, 1: 1726.3. Samples: 8941956. Policy #0 lag: (min: 15.0, avg: 23.0, max: 47.0) [2023-10-08 00:34:26,211][50642] Avg episode reward: [(0, '16.620'), (1, '17.150')] [2023-10-08 00:34:26,213][51605] Saving new best policy, reward=16.620! [2023-10-08 00:34:28,263][52060] Updated weights for policy 0, policy_version 17350 (0.0008) [2023-10-08 00:34:28,639][52060] Updated weights for policy 0, policy_version 17360 (0.0007) [2023-10-08 00:34:28,853][52059] Updated weights for policy 1, policy_version 17572 (0.0009) [2023-10-08 00:34:29,000][52060] Updated weights for policy 0, policy_version 17370 (0.0009) [2023-10-08 00:34:29,208][52059] Updated weights for policy 1, policy_version 17582 (0.0010) [2023-10-08 00:34:29,573][52059] Updated weights for policy 1, policy_version 17592 (0.0007) [2023-10-08 00:34:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 35815424. Throughput: 0: 1715.7, 1: 1713.6. Samples: 8962766. Policy #0 lag: (min: 15.0, avg: 23.0, max: 47.0) [2023-10-08 00:34:31,211][50642] Avg episode reward: [(0, '15.210'), (1, '16.550')] [2023-10-08 00:34:32,797][52060] Updated weights for policy 0, policy_version 17380 (0.0009) [2023-10-08 00:34:33,175][52060] Updated weights for policy 0, policy_version 17390 (0.0009) [2023-10-08 00:34:33,536][52060] Updated weights for policy 0, policy_version 17400 (0.0008) [2023-10-08 00:34:33,617][52059] Updated weights for policy 1, policy_version 17602 (0.0010) [2023-10-08 00:34:33,974][52059] Updated weights for policy 1, policy_version 17612 (0.0009) [2023-10-08 00:34:34,338][52059] Updated weights for policy 1, policy_version 17622 (0.0008) [2023-10-08 00:34:34,695][52059] Updated weights for policy 1, policy_version 17632 (0.0007) [2023-10-08 00:34:36,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 35880960. Throughput: 0: 1694.3, 1: 1736.4. Samples: 8973204. Policy #0 lag: (min: 15.0, avg: 23.0, max: 47.0) [2023-10-08 00:34:36,211][50642] Avg episode reward: [(0, '14.870'), (1, '19.910')] [2023-10-08 00:34:37,559][52060] Updated weights for policy 0, policy_version 17410 (0.0011) [2023-10-08 00:34:37,926][52060] Updated weights for policy 0, policy_version 17420 (0.0007) [2023-10-08 00:34:38,295][52060] Updated weights for policy 0, policy_version 17430 (0.0007) [2023-10-08 00:34:38,674][52060] Updated weights for policy 0, policy_version 17440 (0.0008) [2023-10-08 00:34:38,789][52059] Updated weights for policy 1, policy_version 17642 (0.0007) [2023-10-08 00:34:39,161][52059] Updated weights for policy 1, policy_version 17652 (0.0009) [2023-10-08 00:34:39,519][52059] Updated weights for policy 1, policy_version 17662 (0.0010) [2023-10-08 00:34:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 35946496. Throughput: 0: 1701.7, 1: 1713.5. Samples: 8993410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:34:41,211][50642] Avg episode reward: [(0, '14.930'), (1, '15.570')] [2023-10-08 00:34:42,564][52060] Updated weights for policy 0, policy_version 17450 (0.0009) [2023-10-08 00:34:42,929][52060] Updated weights for policy 0, policy_version 17460 (0.0010) [2023-10-08 00:34:43,301][52060] Updated weights for policy 0, policy_version 17470 (0.0008) [2023-10-08 00:34:43,430][52059] Updated weights for policy 1, policy_version 17672 (0.0009) [2023-10-08 00:34:43,796][52059] Updated weights for policy 1, policy_version 17682 (0.0008) [2023-10-08 00:34:44,175][52059] Updated weights for policy 1, policy_version 17692 (0.0010) [2023-10-08 00:34:46,210][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 36012032. Throughput: 0: 1728.1, 1: 1717.3. Samples: 9014778. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:34:46,211][50642] Avg episode reward: [(0, '14.550'), (1, '16.060')] [2023-10-08 00:34:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000017472_17891328.pth... [2023-10-08 00:34:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000017696_18120704.pth... [2023-10-08 00:34:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000015872_16252928.pth [2023-10-08 00:34:46,253][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000016064_16449536.pth [2023-10-08 00:34:47,276][52060] Updated weights for policy 0, policy_version 17480 (0.0009) [2023-10-08 00:34:47,649][52060] Updated weights for policy 0, policy_version 17490 (0.0009) [2023-10-08 00:34:48,020][52060] Updated weights for policy 0, policy_version 17500 (0.0009) [2023-10-08 00:34:48,172][52059] Updated weights for policy 1, policy_version 17702 (0.0008) [2023-10-08 00:34:48,539][52059] Updated weights for policy 1, policy_version 17712 (0.0010) [2023-10-08 00:34:48,908][52059] Updated weights for policy 1, policy_version 17722 (0.0007) [2023-10-08 00:34:51,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 36077568. Throughput: 0: 1697.3, 1: 1710.4. Samples: 9024242. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:34:51,211][50642] Avg episode reward: [(0, '15.450'), (1, '19.230')] [2023-10-08 00:34:51,964][52060] Updated weights for policy 0, policy_version 17510 (0.0008) [2023-10-08 00:34:52,343][52060] Updated weights for policy 0, policy_version 17520 (0.0007) [2023-10-08 00:34:52,711][52060] Updated weights for policy 0, policy_version 17530 (0.0008) [2023-10-08 00:34:52,800][52059] Updated weights for policy 1, policy_version 17732 (0.0008) [2023-10-08 00:34:53,166][52059] Updated weights for policy 1, policy_version 17742 (0.0007) [2023-10-08 00:34:53,535][52059] Updated weights for policy 1, policy_version 17752 (0.0007) [2023-10-08 00:34:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 36143104. Throughput: 0: 1722.2, 1: 1705.2. Samples: 9045276. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:34:56,211][50642] Avg episode reward: [(0, '14.470'), (1, '15.450')] [2023-10-08 00:34:56,769][52060] Updated weights for policy 0, policy_version 17540 (0.0009) [2023-10-08 00:34:57,143][52060] Updated weights for policy 0, policy_version 17550 (0.0010) [2023-10-08 00:34:57,517][52060] Updated weights for policy 0, policy_version 17560 (0.0009) [2023-10-08 00:34:57,581][52059] Updated weights for policy 1, policy_version 17762 (0.0007) [2023-10-08 00:34:57,962][52059] Updated weights for policy 1, policy_version 17772 (0.0007) [2023-10-08 00:34:58,328][52059] Updated weights for policy 1, policy_version 17782 (0.0007) [2023-10-08 00:34:58,689][52059] Updated weights for policy 1, policy_version 17792 (0.0008) [2023-10-08 00:35:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 36208640. Throughput: 0: 1720.8, 1: 1733.9. Samples: 9066242. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:35:01,211][50642] Avg episode reward: [(0, '15.230'), (1, '16.040')] [2023-10-08 00:35:01,592][52060] Updated weights for policy 0, policy_version 17570 (0.0009) [2023-10-08 00:35:01,990][52060] Updated weights for policy 0, policy_version 17580 (0.0011) [2023-10-08 00:35:02,373][52060] Updated weights for policy 0, policy_version 17590 (0.0009) [2023-10-08 00:35:02,574][52059] Updated weights for policy 1, policy_version 17802 (0.0007) [2023-10-08 00:35:02,733][52060] Updated weights for policy 0, policy_version 17600 (0.0008) [2023-10-08 00:35:02,939][52059] Updated weights for policy 1, policy_version 17812 (0.0008) [2023-10-08 00:35:03,305][52059] Updated weights for policy 1, policy_version 17822 (0.0008) [2023-10-08 00:35:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 36274176. Throughput: 0: 1699.4, 1: 1699.7. Samples: 9075340. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:35:06,211][50642] Avg episode reward: [(0, '16.020'), (1, '17.970')] [2023-10-08 00:35:06,647][52060] Updated weights for policy 0, policy_version 17610 (0.0010) [2023-10-08 00:35:07,014][52060] Updated weights for policy 0, policy_version 17620 (0.0009) [2023-10-08 00:35:07,371][52059] Updated weights for policy 1, policy_version 17832 (0.0007) [2023-10-08 00:35:07,392][52060] Updated weights for policy 0, policy_version 17630 (0.0008) [2023-10-08 00:35:07,733][52059] Updated weights for policy 1, policy_version 17842 (0.0008) [2023-10-08 00:35:08,100][52059] Updated weights for policy 1, policy_version 17852 (0.0007) [2023-10-08 00:35:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 36339712. Throughput: 0: 1713.6, 1: 1719.6. Samples: 9096450. Policy #0 lag: (min: 24.0, avg: 40.8, max: 56.0) [2023-10-08 00:35:11,211][50642] Avg episode reward: [(0, '14.710'), (1, '16.350')] [2023-10-08 00:35:11,557][52060] Updated weights for policy 0, policy_version 17640 (0.0010) [2023-10-08 00:35:11,925][52060] Updated weights for policy 0, policy_version 17650 (0.0009) [2023-10-08 00:35:12,028][52059] Updated weights for policy 1, policy_version 17862 (0.0008) [2023-10-08 00:35:12,301][52060] Updated weights for policy 0, policy_version 17660 (0.0007) [2023-10-08 00:35:12,390][52059] Updated weights for policy 1, policy_version 17872 (0.0009) [2023-10-08 00:35:12,750][52059] Updated weights for policy 1, policy_version 17882 (0.0008) [2023-10-08 00:35:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 36405248. Throughput: 0: 1712.8, 1: 1727.6. Samples: 9117580. Policy #0 lag: (min: 24.0, avg: 40.8, max: 56.0) [2023-10-08 00:35:16,211][50642] Avg episode reward: [(0, '15.610'), (1, '15.040')] [2023-10-08 00:35:16,341][52060] Updated weights for policy 0, policy_version 17670 (0.0009) [2023-10-08 00:35:16,707][52060] Updated weights for policy 0, policy_version 17680 (0.0008) [2023-10-08 00:35:16,833][52059] Updated weights for policy 1, policy_version 17892 (0.0010) [2023-10-08 00:35:17,080][52060] Updated weights for policy 0, policy_version 17690 (0.0007) [2023-10-08 00:35:17,203][52059] Updated weights for policy 1, policy_version 17902 (0.0008) [2023-10-08 00:35:17,562][52059] Updated weights for policy 1, policy_version 17912 (0.0008) [2023-10-08 00:35:21,155][52060] Updated weights for policy 0, policy_version 17700 (0.0007) [2023-10-08 00:35:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 36470784. Throughput: 0: 1708.1, 1: 1707.0. Samples: 9126884. Policy #0 lag: (min: 24.0, avg: 40.8, max: 56.0) [2023-10-08 00:35:21,211][50642] Avg episode reward: [(0, '15.960'), (1, '17.230')] [2023-10-08 00:35:21,477][52059] Updated weights for policy 1, policy_version 17922 (0.0010) [2023-10-08 00:35:21,511][52060] Updated weights for policy 0, policy_version 17710 (0.0009) [2023-10-08 00:35:21,836][52059] Updated weights for policy 1, policy_version 17932 (0.0010) [2023-10-08 00:35:21,879][52060] Updated weights for policy 0, policy_version 17720 (0.0010) [2023-10-08 00:35:22,201][52059] Updated weights for policy 1, policy_version 17942 (0.0007) [2023-10-08 00:35:22,556][52059] Updated weights for policy 1, policy_version 17952 (0.0008) [2023-10-08 00:35:25,921][52060] Updated weights for policy 0, policy_version 17730 (0.0009) [2023-10-08 00:35:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 36536320. Throughput: 0: 1702.4, 1: 1737.1. Samples: 9148188. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:35:26,211][50642] Avg episode reward: [(0, '14.990'), (1, '15.220')] [2023-10-08 00:35:26,292][52060] Updated weights for policy 0, policy_version 17740 (0.0007) [2023-10-08 00:35:26,481][52059] Updated weights for policy 1, policy_version 17962 (0.0008) [2023-10-08 00:35:26,665][52060] Updated weights for policy 0, policy_version 17750 (0.0007) [2023-10-08 00:35:26,844][52059] Updated weights for policy 1, policy_version 17972 (0.0009) [2023-10-08 00:35:27,031][52060] Updated weights for policy 0, policy_version 17760 (0.0007) [2023-10-08 00:35:27,203][52059] Updated weights for policy 1, policy_version 17982 (0.0010) [2023-10-08 00:35:31,070][52060] Updated weights for policy 0, policy_version 17770 (0.0008) [2023-10-08 00:35:31,096][52059] Updated weights for policy 1, policy_version 17992 (0.0007) [2023-10-08 00:35:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 36601856. Throughput: 0: 1699.6, 1: 1737.6. Samples: 9169452. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:35:31,212][50642] Avg episode reward: [(0, '16.780'), (1, '16.230')] [2023-10-08 00:35:31,430][52060] Updated weights for policy 0, policy_version 17780 (0.0007) [2023-10-08 00:35:31,460][52059] Updated weights for policy 1, policy_version 18002 (0.0007) [2023-10-08 00:35:31,803][52060] Updated weights for policy 0, policy_version 17790 (0.0009) [2023-10-08 00:35:31,829][52059] Updated weights for policy 1, policy_version 18012 (0.0007) [2023-10-08 00:35:31,872][51605] Saving new best policy, reward=16.780! [2023-10-08 00:35:35,707][52060] Updated weights for policy 0, policy_version 17800 (0.0008) [2023-10-08 00:35:35,714][52059] Updated weights for policy 1, policy_version 18022 (0.0011) [2023-10-08 00:35:36,074][52060] Updated weights for policy 0, policy_version 17810 (0.0007) [2023-10-08 00:35:36,088][52059] Updated weights for policy 1, policy_version 18032 (0.0007) [2023-10-08 00:35:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 36667392. Throughput: 0: 1702.8, 1: 1733.2. Samples: 9178862. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:35:36,211][50642] Avg episode reward: [(0, '14.800'), (1, '18.700')] [2023-10-08 00:35:36,441][52060] Updated weights for policy 0, policy_version 17820 (0.0007) [2023-10-08 00:35:36,459][52059] Updated weights for policy 1, policy_version 18042 (0.0008) [2023-10-08 00:35:40,512][52060] Updated weights for policy 0, policy_version 17830 (0.0008) [2023-10-08 00:35:40,543][52059] Updated weights for policy 1, policy_version 18052 (0.0009) [2023-10-08 00:35:40,880][52060] Updated weights for policy 0, policy_version 17840 (0.0009) [2023-10-08 00:35:40,906][52059] Updated weights for policy 1, policy_version 18062 (0.0009) [2023-10-08 00:35:41,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 36732928. Throughput: 0: 1700.5, 1: 1734.4. Samples: 9199846. Policy #0 lag: (min: 18.0, avg: 19.9, max: 39.0) [2023-10-08 00:35:41,211][50642] Avg episode reward: [(0, '15.340'), (1, '14.980')] [2023-10-08 00:35:41,247][52060] Updated weights for policy 0, policy_version 17850 (0.0008) [2023-10-08 00:35:41,274][52059] Updated weights for policy 1, policy_version 18072 (0.0007) [2023-10-08 00:35:45,159][52060] Updated weights for policy 0, policy_version 17860 (0.0008) [2023-10-08 00:35:45,186][52059] Updated weights for policy 1, policy_version 18082 (0.0008) [2023-10-08 00:35:45,533][52060] Updated weights for policy 0, policy_version 17870 (0.0008) [2023-10-08 00:35:45,607][52059] Updated weights for policy 1, policy_version 18092 (0.0008) [2023-10-08 00:35:45,900][52060] Updated weights for policy 0, policy_version 17880 (0.0009) [2023-10-08 00:35:45,965][52059] Updated weights for policy 1, policy_version 18102 (0.0009) [2023-10-08 00:35:46,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 36831232. Throughput: 0: 1690.3, 1: 1716.6. Samples: 9219552. Policy #0 lag: (min: 18.0, avg: 19.9, max: 39.0) [2023-10-08 00:35:46,211][50642] Avg episode reward: [(0, '15.760'), (1, '17.760')] [2023-10-08 00:35:46,330][52059] Updated weights for policy 1, policy_version 18112 (0.0008) [2023-10-08 00:35:49,952][52060] Updated weights for policy 0, policy_version 17890 (0.0008) [2023-10-08 00:35:50,149][52059] Updated weights for policy 1, policy_version 18122 (0.0008) [2023-10-08 00:35:50,356][52060] Updated weights for policy 0, policy_version 17900 (0.0008) [2023-10-08 00:35:50,511][52059] Updated weights for policy 1, policy_version 18132 (0.0008) [2023-10-08 00:35:50,728][52060] Updated weights for policy 0, policy_version 17910 (0.0008) [2023-10-08 00:35:50,864][52059] Updated weights for policy 1, policy_version 18142 (0.0008) [2023-10-08 00:35:51,094][52060] Updated weights for policy 0, policy_version 17920 (0.0008) [2023-10-08 00:35:51,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 36929536. Throughput: 0: 1713.2, 1: 1733.9. Samples: 9230458. Policy #0 lag: (min: 22.0, avg: 24.2, max: 54.0) [2023-10-08 00:35:51,211][50642] Avg episode reward: [(0, '15.440'), (1, '18.770')] [2023-10-08 00:35:54,871][52059] Updated weights for policy 1, policy_version 18152 (0.0007) [2023-10-08 00:35:55,015][52060] Updated weights for policy 0, policy_version 17930 (0.0008) [2023-10-08 00:35:55,234][52059] Updated weights for policy 1, policy_version 18162 (0.0007) [2023-10-08 00:35:55,381][52060] Updated weights for policy 0, policy_version 17940 (0.0008) [2023-10-08 00:35:55,589][52059] Updated weights for policy 1, policy_version 18172 (0.0007) [2023-10-08 00:35:55,751][52060] Updated weights for policy 0, policy_version 17950 (0.0009) [2023-10-08 00:35:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 36995072. Throughput: 0: 1706.6, 1: 1730.9. Samples: 9251138. Policy #0 lag: (min: 22.0, avg: 24.2, max: 54.0) [2023-10-08 00:35:56,211][50642] Avg episode reward: [(0, '16.540'), (1, '15.490')] [2023-10-08 00:35:59,423][52059] Updated weights for policy 1, policy_version 18182 (0.0007) [2023-10-08 00:35:59,776][52060] Updated weights for policy 0, policy_version 17960 (0.0008) [2023-10-08 00:35:59,781][52059] Updated weights for policy 1, policy_version 18192 (0.0007) [2023-10-08 00:36:00,139][52059] Updated weights for policy 1, policy_version 18202 (0.0008) [2023-10-08 00:36:00,151][52060] Updated weights for policy 0, policy_version 17970 (0.0007) [2023-10-08 00:36:00,519][52060] Updated weights for policy 0, policy_version 17980 (0.0009) [2023-10-08 00:36:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 37060608. Throughput: 0: 1686.3, 1: 1710.5. Samples: 9270436. Policy #0 lag: (min: 22.0, avg: 24.2, max: 54.0) [2023-10-08 00:36:01,211][50642] Avg episode reward: [(0, '15.940'), (1, '15.780')] [2023-10-08 00:36:04,109][52059] Updated weights for policy 1, policy_version 18212 (0.0009) [2023-10-08 00:36:04,469][52059] Updated weights for policy 1, policy_version 18222 (0.0007) [2023-10-08 00:36:04,571][52060] Updated weights for policy 0, policy_version 17990 (0.0008) [2023-10-08 00:36:04,834][52059] Updated weights for policy 1, policy_version 18232 (0.0007) [2023-10-08 00:36:04,935][52060] Updated weights for policy 0, policy_version 18000 (0.0008) [2023-10-08 00:36:05,310][52060] Updated weights for policy 0, policy_version 18010 (0.0008) [2023-10-08 00:36:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 37126144. Throughput: 0: 1716.0, 1: 1739.5. Samples: 9282382. Policy #0 lag: (min: 46.0, avg: 55.5, max: 56.0) [2023-10-08 00:36:06,211][50642] Avg episode reward: [(0, '15.540'), (1, '18.760')] [2023-10-08 00:36:08,904][52059] Updated weights for policy 1, policy_version 18242 (0.0007) [2023-10-08 00:36:09,230][52060] Updated weights for policy 0, policy_version 18020 (0.0009) [2023-10-08 00:36:09,267][52059] Updated weights for policy 1, policy_version 18252 (0.0008) [2023-10-08 00:36:09,591][52060] Updated weights for policy 0, policy_version 18030 (0.0007) [2023-10-08 00:36:09,628][52059] Updated weights for policy 1, policy_version 18262 (0.0007) [2023-10-08 00:36:09,968][52060] Updated weights for policy 0, policy_version 18040 (0.0009) [2023-10-08 00:36:09,994][52059] Updated weights for policy 1, policy_version 18272 (0.0007) [2023-10-08 00:36:11,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 37191680. Throughput: 0: 1704.8, 1: 1706.8. Samples: 9301708. Policy #0 lag: (min: 46.0, avg: 55.5, max: 56.0) [2023-10-08 00:36:11,211][50642] Avg episode reward: [(0, '15.310'), (1, '16.590')] [2023-10-08 00:36:13,837][52060] Updated weights for policy 0, policy_version 18050 (0.0009) [2023-10-08 00:36:14,120][52059] Updated weights for policy 1, policy_version 18282 (0.0009) [2023-10-08 00:36:14,206][52060] Updated weights for policy 0, policy_version 18060 (0.0007) [2023-10-08 00:36:14,481][52059] Updated weights for policy 1, policy_version 18292 (0.0008) [2023-10-08 00:36:14,576][52060] Updated weights for policy 0, policy_version 18070 (0.0008) [2023-10-08 00:36:14,846][52059] Updated weights for policy 1, policy_version 18302 (0.0009) [2023-10-08 00:36:14,940][52060] Updated weights for policy 0, policy_version 18080 (0.0008) [2023-10-08 00:36:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 37257216. Throughput: 0: 1695.3, 1: 1695.9. Samples: 9322054. Policy #0 lag: (min: 46.0, avg: 55.5, max: 56.0) [2023-10-08 00:36:16,211][50642] Avg episode reward: [(0, '15.010'), (1, '15.250')] [2023-10-08 00:36:18,674][52059] Updated weights for policy 1, policy_version 18312 (0.0009) [2023-10-08 00:36:18,975][52060] Updated weights for policy 0, policy_version 18090 (0.0008) [2023-10-08 00:36:19,034][52059] Updated weights for policy 1, policy_version 18322 (0.0008) [2023-10-08 00:36:19,338][52060] Updated weights for policy 0, policy_version 18100 (0.0007) [2023-10-08 00:36:19,406][52059] Updated weights for policy 1, policy_version 18332 (0.0009) [2023-10-08 00:36:19,702][52060] Updated weights for policy 0, policy_version 18110 (0.0010) [2023-10-08 00:36:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 37322752. Throughput: 0: 1714.2, 1: 1718.5. Samples: 9333332. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 00:36:21,211][50642] Avg episode reward: [(0, '14.430'), (1, '17.740')] [2023-10-08 00:36:23,207][52059] Updated weights for policy 1, policy_version 18342 (0.0008) [2023-10-08 00:36:23,579][52059] Updated weights for policy 1, policy_version 18352 (0.0008) [2023-10-08 00:36:23,698][52060] Updated weights for policy 0, policy_version 18120 (0.0008) [2023-10-08 00:36:23,930][52059] Updated weights for policy 1, policy_version 18362 (0.0008) [2023-10-08 00:36:24,066][52060] Updated weights for policy 0, policy_version 18130 (0.0008) [2023-10-08 00:36:24,433][52060] Updated weights for policy 0, policy_version 18140 (0.0007) [2023-10-08 00:36:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 37388288. Throughput: 0: 1686.8, 1: 1710.1. Samples: 9352708. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 00:36:26,211][50642] Avg episode reward: [(0, '15.800'), (1, '16.550')] [2023-10-08 00:36:27,977][52059] Updated weights for policy 1, policy_version 18372 (0.0008) [2023-10-08 00:36:28,344][52059] Updated weights for policy 1, policy_version 18382 (0.0007) [2023-10-08 00:36:28,431][52060] Updated weights for policy 0, policy_version 18150 (0.0008) [2023-10-08 00:36:28,716][52059] Updated weights for policy 1, policy_version 18392 (0.0007) [2023-10-08 00:36:28,805][52060] Updated weights for policy 0, policy_version 18160 (0.0009) [2023-10-08 00:36:29,174][52060] Updated weights for policy 0, policy_version 18170 (0.0009) [2023-10-08 00:36:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 37453824. Throughput: 0: 1705.2, 1: 1728.2. Samples: 9374054. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 00:36:31,211][50642] Avg episode reward: [(0, '15.250'), (1, '16.500')] [2023-10-08 00:36:32,494][52059] Updated weights for policy 1, policy_version 18402 (0.0007) [2023-10-08 00:36:32,871][52059] Updated weights for policy 1, policy_version 18412 (0.0010) [2023-10-08 00:36:33,161][52060] Updated weights for policy 0, policy_version 18180 (0.0007) [2023-10-08 00:36:33,233][52059] Updated weights for policy 1, policy_version 18422 (0.0008) [2023-10-08 00:36:33,527][52060] Updated weights for policy 0, policy_version 18190 (0.0008) [2023-10-08 00:36:33,601][52059] Updated weights for policy 1, policy_version 18432 (0.0009) [2023-10-08 00:36:33,893][52060] Updated weights for policy 0, policy_version 18200 (0.0007) [2023-10-08 00:36:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 37519360. Throughput: 0: 1693.4, 1: 1712.3. Samples: 9383714. Policy #0 lag: (min: 16.0, avg: 38.8, max: 48.0) [2023-10-08 00:36:36,211][50642] Avg episode reward: [(0, '15.100'), (1, '17.710')] [2023-10-08 00:36:37,717][52059] Updated weights for policy 1, policy_version 18442 (0.0008) [2023-10-08 00:36:38,028][52060] Updated weights for policy 0, policy_version 18210 (0.0008) [2023-10-08 00:36:38,076][52059] Updated weights for policy 1, policy_version 18452 (0.0009) [2023-10-08 00:36:38,430][52060] Updated weights for policy 0, policy_version 18220 (0.0008) [2023-10-08 00:36:38,437][52059] Updated weights for policy 1, policy_version 18462 (0.0009) [2023-10-08 00:36:38,805][52060] Updated weights for policy 0, policy_version 18230 (0.0010) [2023-10-08 00:36:39,176][52060] Updated weights for policy 0, policy_version 18240 (0.0011) [2023-10-08 00:36:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 37584896. Throughput: 0: 1688.9, 1: 1711.3. Samples: 9404150. Policy #0 lag: (min: 16.0, avg: 38.8, max: 48.0) [2023-10-08 00:36:41,211][50642] Avg episode reward: [(0, '16.250'), (1, '17.220')] [2023-10-08 00:36:42,512][52059] Updated weights for policy 1, policy_version 18472 (0.0010) [2023-10-08 00:36:42,881][52059] Updated weights for policy 1, policy_version 18482 (0.0009) [2023-10-08 00:36:43,123][52060] Updated weights for policy 0, policy_version 18250 (0.0008) [2023-10-08 00:36:43,234][52059] Updated weights for policy 1, policy_version 18492 (0.0007) [2023-10-08 00:36:43,487][52060] Updated weights for policy 0, policy_version 18260 (0.0009) [2023-10-08 00:36:43,865][52060] Updated weights for policy 0, policy_version 18270 (0.0009) [2023-10-08 00:36:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 37650432. Throughput: 0: 1710.0, 1: 1737.1. Samples: 9425554. Policy #0 lag: (min: 16.0, avg: 38.8, max: 48.0) [2023-10-08 00:36:46,211][50642] Avg episode reward: [(0, '14.330'), (1, '17.950')] [2023-10-08 00:36:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000018272_18710528.pth... [2023-10-08 00:36:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000018496_18939904.pth... [2023-10-08 00:36:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000016672_17072128.pth [2023-10-08 00:36:46,253][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000016896_17301504.pth [2023-10-08 00:36:47,169][52059] Updated weights for policy 1, policy_version 18502 (0.0007) [2023-10-08 00:36:47,534][52059] Updated weights for policy 1, policy_version 18512 (0.0008) [2023-10-08 00:36:47,894][52059] Updated weights for policy 1, policy_version 18522 (0.0009) [2023-10-08 00:36:47,930][52060] Updated weights for policy 0, policy_version 18280 (0.0008) [2023-10-08 00:36:48,297][52060] Updated weights for policy 0, policy_version 18290 (0.0010) [2023-10-08 00:36:48,673][52060] Updated weights for policy 0, policy_version 18300 (0.0010) [2023-10-08 00:36:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 37715968. Throughput: 0: 1680.9, 1: 1707.1. Samples: 9434844. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 00:36:51,211][50642] Avg episode reward: [(0, '15.070'), (1, '17.500')] [2023-10-08 00:36:51,739][52059] Updated weights for policy 1, policy_version 18532 (0.0007) [2023-10-08 00:36:52,111][52059] Updated weights for policy 1, policy_version 18542 (0.0009) [2023-10-08 00:36:52,467][52059] Updated weights for policy 1, policy_version 18552 (0.0008) [2023-10-08 00:36:52,805][52060] Updated weights for policy 0, policy_version 18310 (0.0009) [2023-10-08 00:36:53,163][52060] Updated weights for policy 0, policy_version 18320 (0.0009) [2023-10-08 00:36:53,537][52060] Updated weights for policy 0, policy_version 18330 (0.0007) [2023-10-08 00:36:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 37781504. Throughput: 0: 1692.7, 1: 1747.0. Samples: 9456492. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 00:36:56,211][50642] Avg episode reward: [(0, '12.590'), (1, '18.680')] [2023-10-08 00:36:56,222][52059] Updated weights for policy 1, policy_version 18562 (0.0009) [2023-10-08 00:36:56,584][52059] Updated weights for policy 1, policy_version 18572 (0.0007) [2023-10-08 00:36:56,945][52059] Updated weights for policy 1, policy_version 18582 (0.0007) [2023-10-08 00:36:57,305][52059] Updated weights for policy 1, policy_version 18592 (0.0007) [2023-10-08 00:36:57,436][52060] Updated weights for policy 0, policy_version 18340 (0.0009) [2023-10-08 00:36:57,802][52060] Updated weights for policy 0, policy_version 18350 (0.0010) [2023-10-08 00:36:58,175][52060] Updated weights for policy 0, policy_version 18360 (0.0010) [2023-10-08 00:37:01,131][52059] Updated weights for policy 1, policy_version 18602 (0.0007) [2023-10-08 00:37:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 37847040. Throughput: 0: 1703.8, 1: 1753.2. Samples: 9477618. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 00:37:01,211][50642] Avg episode reward: [(0, '13.560'), (1, '19.050')] [2023-10-08 00:37:01,500][52059] Updated weights for policy 1, policy_version 18612 (0.0007) [2023-10-08 00:37:01,863][52059] Updated weights for policy 1, policy_version 18622 (0.0008) [2023-10-08 00:37:02,257][52060] Updated weights for policy 0, policy_version 18370 (0.0009) [2023-10-08 00:37:02,627][52060] Updated weights for policy 0, policy_version 18380 (0.0009) [2023-10-08 00:37:02,990][52060] Updated weights for policy 0, policy_version 18390 (0.0008) [2023-10-08 00:37:03,359][52060] Updated weights for policy 0, policy_version 18400 (0.0008) [2023-10-08 00:37:05,844][52059] Updated weights for policy 1, policy_version 18632 (0.0008) [2023-10-08 00:37:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 37912576. Throughput: 0: 1679.1, 1: 1733.6. Samples: 9486904. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:37:06,211][50642] Avg episode reward: [(0, '13.420'), (1, '16.490')] [2023-10-08 00:37:06,213][52059] Updated weights for policy 1, policy_version 18642 (0.0007) [2023-10-08 00:37:06,571][52059] Updated weights for policy 1, policy_version 18652 (0.0007) [2023-10-08 00:37:07,294][52060] Updated weights for policy 0, policy_version 18410 (0.0007) [2023-10-08 00:37:07,665][52060] Updated weights for policy 0, policy_version 18420 (0.0007) [2023-10-08 00:37:08,036][52060] Updated weights for policy 0, policy_version 18430 (0.0007) [2023-10-08 00:37:10,369][52059] Updated weights for policy 1, policy_version 18662 (0.0008) [2023-10-08 00:37:10,733][52059] Updated weights for policy 1, policy_version 18672 (0.0009) [2023-10-08 00:37:11,097][52059] Updated weights for policy 1, policy_version 18682 (0.0008) [2023-10-08 00:37:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 37978112. Throughput: 0: 1711.1, 1: 1752.1. Samples: 9508548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:37:11,211][50642] Avg episode reward: [(0, '13.880'), (1, '17.120')] [2023-10-08 00:37:11,813][52060] Updated weights for policy 0, policy_version 18440 (0.0010) [2023-10-08 00:37:12,184][52060] Updated weights for policy 0, policy_version 18450 (0.0011) [2023-10-08 00:37:12,555][52060] Updated weights for policy 0, policy_version 18460 (0.0011) [2023-10-08 00:37:15,143][52059] Updated weights for policy 1, policy_version 18692 (0.0008) [2023-10-08 00:37:15,504][52059] Updated weights for policy 1, policy_version 18702 (0.0009) [2023-10-08 00:37:15,876][52059] Updated weights for policy 1, policy_version 18712 (0.0010) [2023-10-08 00:37:16,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 38076416. Throughput: 0: 1706.1, 1: 1732.3. Samples: 9528782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:37:16,211][50642] Avg episode reward: [(0, '17.120'), (1, '18.560')] [2023-10-08 00:37:16,222][51605] Saving new best policy, reward=17.120! [2023-10-08 00:37:16,653][52060] Updated weights for policy 0, policy_version 18470 (0.0009) [2023-10-08 00:37:17,038][52060] Updated weights for policy 0, policy_version 18480 (0.0010) [2023-10-08 00:37:17,400][52060] Updated weights for policy 0, policy_version 18490 (0.0008) [2023-10-08 00:37:19,842][52059] Updated weights for policy 1, policy_version 18722 (0.0007) [2023-10-08 00:37:20,239][52059] Updated weights for policy 1, policy_version 18732 (0.0010) [2023-10-08 00:37:20,605][52059] Updated weights for policy 1, policy_version 18742 (0.0009) [2023-10-08 00:37:20,961][52059] Updated weights for policy 1, policy_version 18752 (0.0010) [2023-10-08 00:37:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 38141952. Throughput: 0: 1696.4, 1: 1756.8. Samples: 9539106. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:37:21,211][50642] Avg episode reward: [(0, '13.590'), (1, '18.310')] [2023-10-08 00:37:21,575][52060] Updated weights for policy 0, policy_version 18500 (0.0007) [2023-10-08 00:37:21,949][52060] Updated weights for policy 0, policy_version 18510 (0.0009) [2023-10-08 00:37:22,330][52060] Updated weights for policy 0, policy_version 18520 (0.0009) [2023-10-08 00:37:24,816][52059] Updated weights for policy 1, policy_version 18762 (0.0007) [2023-10-08 00:37:25,184][52059] Updated weights for policy 1, policy_version 18772 (0.0007) [2023-10-08 00:37:25,551][52059] Updated weights for policy 1, policy_version 18782 (0.0008) [2023-10-08 00:37:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 38207488. Throughput: 0: 1709.8, 1: 1748.7. Samples: 9559782. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:37:26,211][50642] Avg episode reward: [(0, '15.290'), (1, '17.470')] [2023-10-08 00:37:26,415][52060] Updated weights for policy 0, policy_version 18530 (0.0009) [2023-10-08 00:37:26,834][52060] Updated weights for policy 0, policy_version 18540 (0.0008) [2023-10-08 00:37:27,199][52060] Updated weights for policy 0, policy_version 18550 (0.0008) [2023-10-08 00:37:27,565][52060] Updated weights for policy 0, policy_version 18560 (0.0009) [2023-10-08 00:37:29,417][52059] Updated weights for policy 1, policy_version 18792 (0.0009) [2023-10-08 00:37:29,779][52059] Updated weights for policy 1, policy_version 18802 (0.0009) [2023-10-08 00:37:30,141][52059] Updated weights for policy 1, policy_version 18812 (0.0008) [2023-10-08 00:37:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 38273024. Throughput: 0: 1703.4, 1: 1730.3. Samples: 9580070. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 00:37:31,211][50642] Avg episode reward: [(0, '14.780'), (1, '18.640')] [2023-10-08 00:37:31,642][52060] Updated weights for policy 0, policy_version 18570 (0.0009) [2023-10-08 00:37:32,003][52060] Updated weights for policy 0, policy_version 18580 (0.0009) [2023-10-08 00:37:32,375][52060] Updated weights for policy 0, policy_version 18590 (0.0008) [2023-10-08 00:37:34,087][52059] Updated weights for policy 1, policy_version 18822 (0.0007) [2023-10-08 00:37:34,444][52059] Updated weights for policy 1, policy_version 18832 (0.0007) [2023-10-08 00:37:34,822][52059] Updated weights for policy 1, policy_version 18842 (0.0010) [2023-10-08 00:37:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 38338560. Throughput: 0: 1702.3, 1: 1760.2. Samples: 9590654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:37:36,211][50642] Avg episode reward: [(0, '14.740'), (1, '17.090')] [2023-10-08 00:37:36,282][52060] Updated weights for policy 0, policy_version 18600 (0.0009) [2023-10-08 00:37:36,648][52060] Updated weights for policy 0, policy_version 18610 (0.0008) [2023-10-08 00:37:37,011][52060] Updated weights for policy 0, policy_version 18620 (0.0008) [2023-10-08 00:37:38,645][52059] Updated weights for policy 1, policy_version 18852 (0.0009) [2023-10-08 00:37:39,002][52059] Updated weights for policy 1, policy_version 18862 (0.0008) [2023-10-08 00:37:39,367][52059] Updated weights for policy 1, policy_version 18872 (0.0008) [2023-10-08 00:37:41,036][52060] Updated weights for policy 0, policy_version 18630 (0.0009) [2023-10-08 00:37:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 38404096. Throughput: 0: 1708.0, 1: 1721.1. Samples: 9610800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:37:41,211][50642] Avg episode reward: [(0, '16.550'), (1, '17.040')] [2023-10-08 00:37:41,411][52060] Updated weights for policy 0, policy_version 18640 (0.0010) [2023-10-08 00:37:41,783][52060] Updated weights for policy 0, policy_version 18650 (0.0010) [2023-10-08 00:37:43,226][52059] Updated weights for policy 1, policy_version 18882 (0.0007) [2023-10-08 00:37:43,595][52059] Updated weights for policy 1, policy_version 18892 (0.0007) [2023-10-08 00:37:43,969][52059] Updated weights for policy 1, policy_version 18902 (0.0008) [2023-10-08 00:37:44,336][52059] Updated weights for policy 1, policy_version 18912 (0.0012) [2023-10-08 00:37:45,688][52060] Updated weights for policy 0, policy_version 18660 (0.0009) [2023-10-08 00:37:46,065][52060] Updated weights for policy 0, policy_version 18670 (0.0008) [2023-10-08 00:37:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 38469632. Throughput: 0: 1699.0, 1: 1737.7. Samples: 9632268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:37:46,211][50642] Avg episode reward: [(0, '15.910'), (1, '18.510')] [2023-10-08 00:37:46,437][52060] Updated weights for policy 0, policy_version 18680 (0.0007) [2023-10-08 00:37:48,323][52059] Updated weights for policy 1, policy_version 18922 (0.0010) [2023-10-08 00:37:48,691][52059] Updated weights for policy 1, policy_version 18932 (0.0009) [2023-10-08 00:37:49,061][52059] Updated weights for policy 1, policy_version 18942 (0.0009) [2023-10-08 00:37:50,349][52060] Updated weights for policy 0, policy_version 18690 (0.0008) [2023-10-08 00:37:50,718][52060] Updated weights for policy 0, policy_version 18700 (0.0009) [2023-10-08 00:37:51,089][52060] Updated weights for policy 0, policy_version 18710 (0.0009) [2023-10-08 00:37:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 38535168. Throughput: 0: 1709.5, 1: 1742.2. Samples: 9642228. Policy #0 lag: (min: 11.0, avg: 11.3, max: 22.0) [2023-10-08 00:37:51,211][50642] Avg episode reward: [(0, '15.310'), (1, '18.710')] [2023-10-08 00:37:51,458][52060] Updated weights for policy 0, policy_version 18720 (0.0008) [2023-10-08 00:37:52,944][52059] Updated weights for policy 1, policy_version 18952 (0.0008) [2023-10-08 00:37:53,308][52059] Updated weights for policy 1, policy_version 18962 (0.0007) [2023-10-08 00:37:53,675][52059] Updated weights for policy 1, policy_version 18972 (0.0008) [2023-10-08 00:37:55,535][52060] Updated weights for policy 0, policy_version 18730 (0.0010) [2023-10-08 00:37:55,901][52060] Updated weights for policy 0, policy_version 18740 (0.0008) [2023-10-08 00:37:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 38600704. Throughput: 0: 1705.5, 1: 1734.8. Samples: 9663362. Policy #0 lag: (min: 11.0, avg: 11.3, max: 22.0) [2023-10-08 00:37:56,211][50642] Avg episode reward: [(0, '16.350'), (1, '18.010')] [2023-10-08 00:37:56,273][52060] Updated weights for policy 0, policy_version 18750 (0.0009) [2023-10-08 00:37:57,491][52059] Updated weights for policy 1, policy_version 18982 (0.0007) [2023-10-08 00:37:57,860][52059] Updated weights for policy 1, policy_version 18992 (0.0007) [2023-10-08 00:37:58,219][52059] Updated weights for policy 1, policy_version 19002 (0.0008) [2023-10-08 00:38:00,158][52060] Updated weights for policy 0, policy_version 18760 (0.0008) [2023-10-08 00:38:00,534][52060] Updated weights for policy 0, policy_version 18770 (0.0009) [2023-10-08 00:38:00,899][52060] Updated weights for policy 0, policy_version 18780 (0.0008) [2023-10-08 00:38:01,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 38699008. Throughput: 0: 1690.4, 1: 1755.4. Samples: 9683844. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:38:01,211][50642] Avg episode reward: [(0, '15.170'), (1, '17.220')] [2023-10-08 00:38:02,237][52059] Updated weights for policy 1, policy_version 19012 (0.0009) [2023-10-08 00:38:02,601][52059] Updated weights for policy 1, policy_version 19022 (0.0007) [2023-10-08 00:38:02,972][52059] Updated weights for policy 1, policy_version 19032 (0.0008) [2023-10-08 00:38:04,839][52060] Updated weights for policy 0, policy_version 18790 (0.0010) [2023-10-08 00:38:05,208][52060] Updated weights for policy 0, policy_version 18800 (0.0007) [2023-10-08 00:38:05,579][52060] Updated weights for policy 0, policy_version 18810 (0.0008) [2023-10-08 00:38:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 38764544. Throughput: 0: 1720.2, 1: 1727.1. Samples: 9694236. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:38:06,210][50642] Avg episode reward: [(0, '15.500'), (1, '18.820')] [2023-10-08 00:38:06,876][52059] Updated weights for policy 1, policy_version 19042 (0.0011) [2023-10-08 00:38:07,265][52059] Updated weights for policy 1, policy_version 19052 (0.0008) [2023-10-08 00:38:07,636][52059] Updated weights for policy 1, policy_version 19062 (0.0009) [2023-10-08 00:38:07,995][52059] Updated weights for policy 1, policy_version 19072 (0.0009) [2023-10-08 00:38:09,401][52060] Updated weights for policy 0, policy_version 18820 (0.0008) [2023-10-08 00:38:09,773][52060] Updated weights for policy 0, policy_version 18830 (0.0008) [2023-10-08 00:38:10,136][52060] Updated weights for policy 0, policy_version 18840 (0.0007) [2023-10-08 00:38:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 38830080. Throughput: 0: 1705.7, 1: 1738.8. Samples: 9714788. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:38:11,212][50642] Avg episode reward: [(0, '14.800'), (1, '17.640')] [2023-10-08 00:38:11,919][52059] Updated weights for policy 1, policy_version 19082 (0.0010) [2023-10-08 00:38:12,284][52059] Updated weights for policy 1, policy_version 19092 (0.0009) [2023-10-08 00:38:12,645][52059] Updated weights for policy 1, policy_version 19102 (0.0007) [2023-10-08 00:38:14,172][52060] Updated weights for policy 0, policy_version 18850 (0.0010) [2023-10-08 00:38:14,574][52060] Updated weights for policy 0, policy_version 18860 (0.0007) [2023-10-08 00:38:14,943][52060] Updated weights for policy 0, policy_version 18870 (0.0008) [2023-10-08 00:38:15,314][52060] Updated weights for policy 0, policy_version 18880 (0.0009) [2023-10-08 00:38:16,210][50642] Fps is (10 sec: 13106.6, 60 sec: 13653.3, 300 sec: 13773.6). Total num frames: 38895616. Throughput: 0: 1696.0, 1: 1757.5. Samples: 9735478. Policy #0 lag: (min: 23.0, avg: 30.4, max: 55.0) [2023-10-08 00:38:16,212][50642] Avg episode reward: [(0, '15.490'), (1, '17.470')] [2023-10-08 00:38:16,442][52059] Updated weights for policy 1, policy_version 19112 (0.0009) [2023-10-08 00:38:16,822][52059] Updated weights for policy 1, policy_version 19122 (0.0009) [2023-10-08 00:38:17,181][52059] Updated weights for policy 1, policy_version 19132 (0.0008) [2023-10-08 00:38:19,220][52060] Updated weights for policy 0, policy_version 18890 (0.0007) [2023-10-08 00:38:19,596][52060] Updated weights for policy 0, policy_version 18900 (0.0007) [2023-10-08 00:38:19,965][52060] Updated weights for policy 0, policy_version 18910 (0.0008) [2023-10-08 00:38:21,010][52059] Updated weights for policy 1, policy_version 19142 (0.0008) [2023-10-08 00:38:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 38961152. Throughput: 0: 1732.1, 1: 1728.1. Samples: 9746362. Policy #0 lag: (min: 23.0, avg: 30.4, max: 55.0) [2023-10-08 00:38:21,211][50642] Avg episode reward: [(0, '16.010'), (1, '18.530')] [2023-10-08 00:38:21,375][52059] Updated weights for policy 1, policy_version 19152 (0.0009) [2023-10-08 00:38:21,748][52059] Updated weights for policy 1, policy_version 19162 (0.0008) [2023-10-08 00:38:24,069][52060] Updated weights for policy 0, policy_version 18920 (0.0008) [2023-10-08 00:38:24,444][52060] Updated weights for policy 0, policy_version 18930 (0.0008) [2023-10-08 00:38:24,813][52060] Updated weights for policy 0, policy_version 18940 (0.0008) [2023-10-08 00:38:25,676][52059] Updated weights for policy 1, policy_version 19172 (0.0008) [2023-10-08 00:38:26,046][52059] Updated weights for policy 1, policy_version 19182 (0.0007) [2023-10-08 00:38:26,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 39026688. Throughput: 0: 1702.9, 1: 1760.7. Samples: 9766662. Policy #0 lag: (min: 23.0, avg: 30.4, max: 55.0) [2023-10-08 00:38:26,211][50642] Avg episode reward: [(0, '16.240'), (1, '19.460')] [2023-10-08 00:38:26,409][52059] Updated weights for policy 1, policy_version 19192 (0.0007) [2023-10-08 00:38:28,600][52060] Updated weights for policy 0, policy_version 18950 (0.0008) [2023-10-08 00:38:28,962][52060] Updated weights for policy 0, policy_version 18960 (0.0009) [2023-10-08 00:38:29,333][52060] Updated weights for policy 0, policy_version 18970 (0.0009) [2023-10-08 00:38:30,337][52059] Updated weights for policy 1, policy_version 19202 (0.0009) [2023-10-08 00:38:30,697][52059] Updated weights for policy 1, policy_version 19212 (0.0008) [2023-10-08 00:38:31,065][52059] Updated weights for policy 1, policy_version 19222 (0.0009) [2023-10-08 00:38:31,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 39092224. Throughput: 0: 1706.8, 1: 1738.0. Samples: 9787284. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 00:38:31,211][50642] Avg episode reward: [(0, '15.230'), (1, '17.180')] [2023-10-08 00:38:31,426][52059] Updated weights for policy 1, policy_version 19232 (0.0010) [2023-10-08 00:38:33,375][52060] Updated weights for policy 0, policy_version 18980 (0.0008) [2023-10-08 00:38:33,747][52060] Updated weights for policy 0, policy_version 18990 (0.0008) [2023-10-08 00:38:34,114][52060] Updated weights for policy 0, policy_version 19000 (0.0009) [2023-10-08 00:38:35,470][52059] Updated weights for policy 1, policy_version 19242 (0.0009) [2023-10-08 00:38:35,832][52059] Updated weights for policy 1, policy_version 19252 (0.0008) [2023-10-08 00:38:36,197][52059] Updated weights for policy 1, policy_version 19262 (0.0008) [2023-10-08 00:38:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 39157760. Throughput: 0: 1718.8, 1: 1748.2. Samples: 9798244. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 00:38:36,211][50642] Avg episode reward: [(0, '15.500'), (1, '18.310')] [2023-10-08 00:38:38,106][52060] Updated weights for policy 0, policy_version 19010 (0.0008) [2023-10-08 00:38:38,480][52060] Updated weights for policy 0, policy_version 19020 (0.0008) [2023-10-08 00:38:38,845][52060] Updated weights for policy 0, policy_version 19030 (0.0008) [2023-10-08 00:38:39,213][52060] Updated weights for policy 0, policy_version 19040 (0.0009) [2023-10-08 00:38:40,016][52059] Updated weights for policy 1, policy_version 19272 (0.0010) [2023-10-08 00:38:40,381][52059] Updated weights for policy 1, policy_version 19282 (0.0008) [2023-10-08 00:38:40,745][52059] Updated weights for policy 1, policy_version 19292 (0.0009) [2023-10-08 00:38:41,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 39256064. Throughput: 0: 1703.5, 1: 1747.0. Samples: 9818634. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 00:38:41,211][50642] Avg episode reward: [(0, '16.550'), (1, '19.010')] [2023-10-08 00:38:43,084][52060] Updated weights for policy 0, policy_version 19050 (0.0009) [2023-10-08 00:38:43,444][52060] Updated weights for policy 0, policy_version 19060 (0.0009) [2023-10-08 00:38:43,818][52060] Updated weights for policy 0, policy_version 19070 (0.0009) [2023-10-08 00:38:44,581][52059] Updated weights for policy 1, policy_version 19302 (0.0009) [2023-10-08 00:38:44,941][52059] Updated weights for policy 1, policy_version 19312 (0.0009) [2023-10-08 00:38:45,311][52059] Updated weights for policy 1, policy_version 19322 (0.0009) [2023-10-08 00:38:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 39321600. Throughput: 0: 1725.9, 1: 1723.5. Samples: 9839068. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:38:46,211][50642] Avg episode reward: [(0, '15.910'), (1, '16.870')] [2023-10-08 00:38:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000019072_19529728.pth... [2023-10-08 00:38:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000019328_19791872.pth... [2023-10-08 00:38:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000017696_18120704.pth [2023-10-08 00:38:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000017472_17891328.pth [2023-10-08 00:38:47,869][52060] Updated weights for policy 0, policy_version 19080 (0.0008) [2023-10-08 00:38:48,234][52060] Updated weights for policy 0, policy_version 19090 (0.0009) [2023-10-08 00:38:48,612][52060] Updated weights for policy 0, policy_version 19100 (0.0011) [2023-10-08 00:38:49,364][52059] Updated weights for policy 1, policy_version 19332 (0.0009) [2023-10-08 00:38:49,724][52059] Updated weights for policy 1, policy_version 19342 (0.0007) [2023-10-08 00:38:50,081][52059] Updated weights for policy 1, policy_version 19352 (0.0007) [2023-10-08 00:38:51,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 39387136. Throughput: 0: 1694.5, 1: 1755.3. Samples: 9849478. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:38:51,211][50642] Avg episode reward: [(0, '16.960'), (1, '18.590')] [2023-10-08 00:38:52,566][52060] Updated weights for policy 0, policy_version 19110 (0.0009) [2023-10-08 00:38:52,938][52060] Updated weights for policy 0, policy_version 19120 (0.0011) [2023-10-08 00:38:53,311][52060] Updated weights for policy 0, policy_version 19130 (0.0009) [2023-10-08 00:38:53,936][52059] Updated weights for policy 1, policy_version 19362 (0.0008) [2023-10-08 00:38:54,299][52059] Updated weights for policy 1, policy_version 19372 (0.0007) [2023-10-08 00:38:54,661][52059] Updated weights for policy 1, policy_version 19382 (0.0007) [2023-10-08 00:38:55,025][52059] Updated weights for policy 1, policy_version 19392 (0.0007) [2023-10-08 00:38:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 39452672. Throughput: 0: 1712.3, 1: 1735.8. Samples: 9869952. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:38:56,211][50642] Avg episode reward: [(0, '15.090'), (1, '20.960')] [2023-10-08 00:38:56,212][51710] Saving new best policy, reward=20.960! [2023-10-08 00:38:57,166][52060] Updated weights for policy 0, policy_version 19140 (0.0011) [2023-10-08 00:38:57,525][52060] Updated weights for policy 0, policy_version 19150 (0.0009) [2023-10-08 00:38:57,894][52060] Updated weights for policy 0, policy_version 19160 (0.0007) [2023-10-08 00:38:58,996][52059] Updated weights for policy 1, policy_version 19402 (0.0009) [2023-10-08 00:38:59,366][52059] Updated weights for policy 1, policy_version 19412 (0.0010) [2023-10-08 00:38:59,725][52059] Updated weights for policy 1, policy_version 19422 (0.0011) [2023-10-08 00:39:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 39518208. Throughput: 0: 1735.3, 1: 1720.7. Samples: 9890998. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:01,211][50642] Avg episode reward: [(0, '14.520'), (1, '17.040')] [2023-10-08 00:39:01,882][52060] Updated weights for policy 0, policy_version 19170 (0.0008) [2023-10-08 00:39:02,282][52060] Updated weights for policy 0, policy_version 19180 (0.0007) [2023-10-08 00:39:02,648][52060] Updated weights for policy 0, policy_version 19190 (0.0008) [2023-10-08 00:39:03,015][52060] Updated weights for policy 0, policy_version 19200 (0.0008) [2023-10-08 00:39:03,675][52059] Updated weights for policy 1, policy_version 19432 (0.0009) [2023-10-08 00:39:04,048][52059] Updated weights for policy 1, policy_version 19442 (0.0008) [2023-10-08 00:39:04,417][52059] Updated weights for policy 1, policy_version 19452 (0.0008) [2023-10-08 00:39:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 39583744. Throughput: 0: 1702.7, 1: 1740.6. Samples: 9901310. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:06,211][50642] Avg episode reward: [(0, '16.690'), (1, '14.880')] [2023-10-08 00:39:06,777][52060] Updated weights for policy 0, policy_version 19210 (0.0008) [2023-10-08 00:39:07,148][52060] Updated weights for policy 0, policy_version 19220 (0.0007) [2023-10-08 00:39:07,508][52060] Updated weights for policy 0, policy_version 19230 (0.0008) [2023-10-08 00:39:08,213][52059] Updated weights for policy 1, policy_version 19462 (0.0007) [2023-10-08 00:39:08,572][52059] Updated weights for policy 1, policy_version 19472 (0.0007) [2023-10-08 00:39:08,950][52059] Updated weights for policy 1, policy_version 19482 (0.0009) [2023-10-08 00:39:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 39649280. Throughput: 0: 1733.7, 1: 1722.0. Samples: 9922166. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:11,211][50642] Avg episode reward: [(0, '15.590'), (1, '18.510')] [2023-10-08 00:39:11,522][52060] Updated weights for policy 0, policy_version 19240 (0.0008) [2023-10-08 00:39:11,890][52060] Updated weights for policy 0, policy_version 19250 (0.0007) [2023-10-08 00:39:12,259][52060] Updated weights for policy 0, policy_version 19260 (0.0009) [2023-10-08 00:39:12,892][52059] Updated weights for policy 1, policy_version 19492 (0.0009) [2023-10-08 00:39:13,246][52059] Updated weights for policy 1, policy_version 19502 (0.0007) [2023-10-08 00:39:13,626][52059] Updated weights for policy 1, policy_version 19512 (0.0009) [2023-10-08 00:39:16,169][52060] Updated weights for policy 0, policy_version 19270 (0.0008) [2023-10-08 00:39:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 39714816. Throughput: 0: 1735.4, 1: 1733.4. Samples: 9943378. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:16,211][50642] Avg episode reward: [(0, '17.290'), (1, '14.900')] [2023-10-08 00:39:16,524][52060] Updated weights for policy 0, policy_version 19280 (0.0007) [2023-10-08 00:39:16,898][52060] Updated weights for policy 0, policy_version 19290 (0.0010) [2023-10-08 00:39:17,119][51605] Saving new best policy, reward=17.290! [2023-10-08 00:39:17,538][52059] Updated weights for policy 1, policy_version 19522 (0.0008) [2023-10-08 00:39:17,902][52059] Updated weights for policy 1, policy_version 19532 (0.0007) [2023-10-08 00:39:18,264][52059] Updated weights for policy 1, policy_version 19542 (0.0008) [2023-10-08 00:39:18,637][52059] Updated weights for policy 1, policy_version 19552 (0.0007) [2023-10-08 00:39:20,945][52060] Updated weights for policy 0, policy_version 19300 (0.0009) [2023-10-08 00:39:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 39780352. Throughput: 0: 1714.0, 1: 1718.4. Samples: 9952700. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:21,211][50642] Avg episode reward: [(0, '15.350'), (1, '13.490')] [2023-10-08 00:39:21,316][52060] Updated weights for policy 0, policy_version 19310 (0.0009) [2023-10-08 00:39:21,689][52060] Updated weights for policy 0, policy_version 19320 (0.0010) [2023-10-08 00:39:22,441][52059] Updated weights for policy 1, policy_version 19562 (0.0008) [2023-10-08 00:39:22,807][52059] Updated weights for policy 1, policy_version 19572 (0.0008) [2023-10-08 00:39:23,174][52059] Updated weights for policy 1, policy_version 19582 (0.0009) [2023-10-08 00:39:25,619][52060] Updated weights for policy 0, policy_version 19330 (0.0009) [2023-10-08 00:39:25,992][52060] Updated weights for policy 0, policy_version 19340 (0.0009) [2023-10-08 00:39:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 39845888. Throughput: 0: 1729.1, 1: 1729.5. Samples: 9974272. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:26,211][50642] Avg episode reward: [(0, '14.650'), (1, '17.810')] [2023-10-08 00:39:26,363][52060] Updated weights for policy 0, policy_version 19350 (0.0007) [2023-10-08 00:39:26,736][52060] Updated weights for policy 0, policy_version 19360 (0.0008) [2023-10-08 00:39:27,018][52059] Updated weights for policy 1, policy_version 19592 (0.0007) [2023-10-08 00:39:27,389][52059] Updated weights for policy 1, policy_version 19602 (0.0008) [2023-10-08 00:39:27,752][52059] Updated weights for policy 1, policy_version 19612 (0.0007) [2023-10-08 00:39:30,736][52060] Updated weights for policy 0, policy_version 19370 (0.0008) [2023-10-08 00:39:31,104][52060] Updated weights for policy 0, policy_version 19380 (0.0008) [2023-10-08 00:39:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 39911424. Throughput: 0: 1710.4, 1: 1752.3. Samples: 9994890. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:31,211][50642] Avg episode reward: [(0, '16.260'), (1, '16.690')] [2023-10-08 00:39:31,471][52060] Updated weights for policy 0, policy_version 19390 (0.0008) [2023-10-08 00:39:31,732][52059] Updated weights for policy 1, policy_version 19622 (0.0010) [2023-10-08 00:39:32,095][52059] Updated weights for policy 1, policy_version 19632 (0.0008) [2023-10-08 00:39:32,463][52059] Updated weights for policy 1, policy_version 19642 (0.0007) [2023-10-08 00:39:35,274][52060] Updated weights for policy 0, policy_version 19400 (0.0007) [2023-10-08 00:39:35,645][52060] Updated weights for policy 0, policy_version 19410 (0.0007) [2023-10-08 00:39:36,020][52060] Updated weights for policy 0, policy_version 19420 (0.0008) [2023-10-08 00:39:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 40009728. Throughput: 0: 1732.0, 1: 1729.2. Samples: 10005232. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:36,211][50642] Avg episode reward: [(0, '14.270'), (1, '15.150')] [2023-10-08 00:39:36,344][52059] Updated weights for policy 1, policy_version 19652 (0.0007) [2023-10-08 00:39:36,717][52059] Updated weights for policy 1, policy_version 19662 (0.0009) [2023-10-08 00:39:37,071][52059] Updated weights for policy 1, policy_version 19672 (0.0009) [2023-10-08 00:39:39,923][52060] Updated weights for policy 0, policy_version 19430 (0.0008) [2023-10-08 00:39:40,286][52060] Updated weights for policy 0, policy_version 19440 (0.0007) [2023-10-08 00:39:40,660][52060] Updated weights for policy 0, policy_version 19450 (0.0008) [2023-10-08 00:39:41,139][52059] Updated weights for policy 1, policy_version 19682 (0.0009) [2023-10-08 00:39:41,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 40075264. Throughput: 0: 1730.9, 1: 1747.0. Samples: 10026458. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:41,211][50642] Avg episode reward: [(0, '15.770'), (1, '17.480')] [2023-10-08 00:39:41,532][52059] Updated weights for policy 1, policy_version 19692 (0.0008) [2023-10-08 00:39:41,895][52059] Updated weights for policy 1, policy_version 19702 (0.0007) [2023-10-08 00:39:42,255][52059] Updated weights for policy 1, policy_version 19712 (0.0007) [2023-10-08 00:39:44,671][52060] Updated weights for policy 0, policy_version 19460 (0.0008) [2023-10-08 00:39:45,040][52060] Updated weights for policy 0, policy_version 19470 (0.0007) [2023-10-08 00:39:45,412][52060] Updated weights for policy 0, policy_version 19480 (0.0008) [2023-10-08 00:39:46,104][52059] Updated weights for policy 1, policy_version 19722 (0.0008) [2023-10-08 00:39:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 40140800. Throughput: 0: 1699.3, 1: 1754.1. Samples: 10046402. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:39:46,211][50642] Avg episode reward: [(0, '15.490'), (1, '16.670')] [2023-10-08 00:39:46,470][52059] Updated weights for policy 1, policy_version 19732 (0.0007) [2023-10-08 00:39:46,836][52059] Updated weights for policy 1, policy_version 19742 (0.0007) [2023-10-08 00:39:49,447][52060] Updated weights for policy 0, policy_version 19490 (0.0008) [2023-10-08 00:39:49,852][52060] Updated weights for policy 0, policy_version 19500 (0.0009) [2023-10-08 00:39:50,222][52060] Updated weights for policy 0, policy_version 19510 (0.0011) [2023-10-08 00:39:50,598][52060] Updated weights for policy 0, policy_version 19520 (0.0009) [2023-10-08 00:39:50,737][52059] Updated weights for policy 1, policy_version 19752 (0.0008) [2023-10-08 00:39:51,100][52059] Updated weights for policy 1, policy_version 19762 (0.0008) [2023-10-08 00:39:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 40206336. Throughput: 0: 1728.8, 1: 1737.2. Samples: 10057282. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:39:51,211][50642] Avg episode reward: [(0, '15.800'), (1, '14.180')] [2023-10-08 00:39:51,454][52059] Updated weights for policy 1, policy_version 19772 (0.0008) [2023-10-08 00:39:54,460][52060] Updated weights for policy 0, policy_version 19530 (0.0008) [2023-10-08 00:39:54,830][52060] Updated weights for policy 0, policy_version 19540 (0.0007) [2023-10-08 00:39:55,204][52060] Updated weights for policy 0, policy_version 19550 (0.0008) [2023-10-08 00:39:55,284][52059] Updated weights for policy 1, policy_version 19782 (0.0008) [2023-10-08 00:39:55,638][52059] Updated weights for policy 1, policy_version 19792 (0.0008) [2023-10-08 00:39:55,999][52059] Updated weights for policy 1, policy_version 19802 (0.0008) [2023-10-08 00:39:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 40271872. Throughput: 0: 1704.2, 1: 1758.1. Samples: 10077972. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:39:56,211][50642] Avg episode reward: [(0, '17.060'), (1, '17.360')] [2023-10-08 00:39:59,243][52060] Updated weights for policy 0, policy_version 19560 (0.0008) [2023-10-08 00:39:59,607][52060] Updated weights for policy 0, policy_version 19570 (0.0007) [2023-10-08 00:39:59,978][52060] Updated weights for policy 0, policy_version 19580 (0.0007) [2023-10-08 00:40:00,045][52059] Updated weights for policy 1, policy_version 19812 (0.0010) [2023-10-08 00:40:00,411][52059] Updated weights for policy 1, policy_version 19822 (0.0010) [2023-10-08 00:40:00,776][52059] Updated weights for policy 1, policy_version 19832 (0.0009) [2023-10-08 00:40:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 40370176. Throughput: 0: 1696.1, 1: 1731.6. Samples: 10097624. Policy #0 lag: (min: 26.0, avg: 26.0, max: 26.0) [2023-10-08 00:40:01,211][50642] Avg episode reward: [(0, '15.220'), (1, '15.360')] [2023-10-08 00:40:04,110][52060] Updated weights for policy 0, policy_version 19590 (0.0010) [2023-10-08 00:40:04,474][52060] Updated weights for policy 0, policy_version 19600 (0.0010) [2023-10-08 00:40:04,789][52059] Updated weights for policy 1, policy_version 19842 (0.0009) [2023-10-08 00:40:04,858][52060] Updated weights for policy 0, policy_version 19610 (0.0009) [2023-10-08 00:40:05,159][52059] Updated weights for policy 1, policy_version 19852 (0.0009) [2023-10-08 00:40:05,527][52059] Updated weights for policy 1, policy_version 19862 (0.0008) [2023-10-08 00:40:05,896][52059] Updated weights for policy 1, policy_version 19872 (0.0008) [2023-10-08 00:40:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 40435712. Throughput: 0: 1728.4, 1: 1748.4. Samples: 10109158. Policy #0 lag: (min: 24.0, avg: 51.4, max: 56.0) [2023-10-08 00:40:06,211][50642] Avg episode reward: [(0, '15.400'), (1, '18.000')] [2023-10-08 00:40:08,887][52060] Updated weights for policy 0, policy_version 19620 (0.0008) [2023-10-08 00:40:09,261][52060] Updated weights for policy 0, policy_version 19630 (0.0008) [2023-10-08 00:40:09,627][52060] Updated weights for policy 0, policy_version 19640 (0.0007) [2023-10-08 00:40:09,734][52059] Updated weights for policy 1, policy_version 19882 (0.0010) [2023-10-08 00:40:10,101][52059] Updated weights for policy 1, policy_version 19892 (0.0009) [2023-10-08 00:40:10,473][52059] Updated weights for policy 1, policy_version 19902 (0.0012) [2023-10-08 00:40:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 40501248. Throughput: 0: 1703.1, 1: 1733.1. Samples: 10128900. Policy #0 lag: (min: 24.0, avg: 51.4, max: 56.0) [2023-10-08 00:40:11,211][50642] Avg episode reward: [(0, '16.340'), (1, '17.460')] [2023-10-08 00:40:13,749][52060] Updated weights for policy 0, policy_version 19650 (0.0007) [2023-10-08 00:40:14,117][52060] Updated weights for policy 0, policy_version 19660 (0.0008) [2023-10-08 00:40:14,360][52059] Updated weights for policy 1, policy_version 19912 (0.0008) [2023-10-08 00:40:14,494][52060] Updated weights for policy 0, policy_version 19670 (0.0007) [2023-10-08 00:40:14,722][52059] Updated weights for policy 1, policy_version 19922 (0.0007) [2023-10-08 00:40:14,854][52060] Updated weights for policy 0, policy_version 19680 (0.0007) [2023-10-08 00:40:15,079][52059] Updated weights for policy 1, policy_version 19932 (0.0009) [2023-10-08 00:40:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 40566784. Throughput: 0: 1714.4, 1: 1716.0. Samples: 10149260. Policy #0 lag: (min: 24.0, avg: 51.4, max: 56.0) [2023-10-08 00:40:16,211][50642] Avg episode reward: [(0, '15.380'), (1, '16.020')] [2023-10-08 00:40:18,773][52060] Updated weights for policy 0, policy_version 19690 (0.0007) [2023-10-08 00:40:18,998][52059] Updated weights for policy 1, policy_version 19942 (0.0008) [2023-10-08 00:40:19,149][52060] Updated weights for policy 0, policy_version 19700 (0.0008) [2023-10-08 00:40:19,365][52059] Updated weights for policy 1, policy_version 19952 (0.0008) [2023-10-08 00:40:19,510][52060] Updated weights for policy 0, policy_version 19710 (0.0010) [2023-10-08 00:40:19,727][52059] Updated weights for policy 1, policy_version 19962 (0.0009) [2023-10-08 00:40:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 40632320. Throughput: 0: 1715.8, 1: 1740.2. Samples: 10160752. Policy #0 lag: (min: 2.0, avg: 15.2, max: 34.0) [2023-10-08 00:40:21,211][50642] Avg episode reward: [(0, '16.340'), (1, '16.340')] [2023-10-08 00:40:23,485][52060] Updated weights for policy 0, policy_version 19720 (0.0007) [2023-10-08 00:40:23,794][52059] Updated weights for policy 1, policy_version 19972 (0.0007) [2023-10-08 00:40:23,853][52060] Updated weights for policy 0, policy_version 19730 (0.0009) [2023-10-08 00:40:24,161][52059] Updated weights for policy 1, policy_version 19982 (0.0007) [2023-10-08 00:40:24,222][52060] Updated weights for policy 0, policy_version 19740 (0.0007) [2023-10-08 00:40:24,524][52059] Updated weights for policy 1, policy_version 19992 (0.0007) [2023-10-08 00:40:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 40697856. Throughput: 0: 1694.1, 1: 1716.6. Samples: 10179938. Policy #0 lag: (min: 2.0, avg: 15.2, max: 34.0) [2023-10-08 00:40:26,211][50642] Avg episode reward: [(0, '16.010'), (1, '19.590')] [2023-10-08 00:40:28,042][52060] Updated weights for policy 0, policy_version 19750 (0.0009) [2023-10-08 00:40:28,404][52060] Updated weights for policy 0, policy_version 19760 (0.0008) [2023-10-08 00:40:28,538][52059] Updated weights for policy 1, policy_version 20002 (0.0007) [2023-10-08 00:40:28,773][52060] Updated weights for policy 0, policy_version 19770 (0.0008) [2023-10-08 00:40:28,941][52059] Updated weights for policy 1, policy_version 20012 (0.0009) [2023-10-08 00:40:29,310][52059] Updated weights for policy 1, policy_version 20022 (0.0007) [2023-10-08 00:40:29,671][52059] Updated weights for policy 1, policy_version 20032 (0.0007) [2023-10-08 00:40:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 40763392. Throughput: 0: 1723.5, 1: 1717.4. Samples: 10201242. Policy #0 lag: (min: 2.0, avg: 15.2, max: 34.0) [2023-10-08 00:40:31,211][50642] Avg episode reward: [(0, '15.310'), (1, '15.550')] [2023-10-08 00:40:32,748][52060] Updated weights for policy 0, policy_version 19780 (0.0009) [2023-10-08 00:40:33,126][52060] Updated weights for policy 0, policy_version 19790 (0.0007) [2023-10-08 00:40:33,493][52060] Updated weights for policy 0, policy_version 19800 (0.0007) [2023-10-08 00:40:33,528][52059] Updated weights for policy 1, policy_version 20042 (0.0007) [2023-10-08 00:40:33,890][52059] Updated weights for policy 1, policy_version 20052 (0.0008) [2023-10-08 00:40:34,256][52059] Updated weights for policy 1, policy_version 20062 (0.0009) [2023-10-08 00:40:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 40828928. Throughput: 0: 1691.6, 1: 1724.2. Samples: 10210996. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:40:36,211][50642] Avg episode reward: [(0, '16.630'), (1, '15.930')] [2023-10-08 00:40:37,577][52060] Updated weights for policy 0, policy_version 19810 (0.0009) [2023-10-08 00:40:37,984][52060] Updated weights for policy 0, policy_version 19820 (0.0007) [2023-10-08 00:40:38,206][52059] Updated weights for policy 1, policy_version 20072 (0.0010) [2023-10-08 00:40:38,353][52060] Updated weights for policy 0, policy_version 19830 (0.0007) [2023-10-08 00:40:38,568][52059] Updated weights for policy 1, policy_version 20082 (0.0010) [2023-10-08 00:40:38,723][52060] Updated weights for policy 0, policy_version 19840 (0.0008) [2023-10-08 00:40:38,932][52059] Updated weights for policy 1, policy_version 20092 (0.0009) [2023-10-08 00:40:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 40894464. Throughput: 0: 1703.4, 1: 1707.3. Samples: 10231454. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:40:41,211][50642] Avg episode reward: [(0, '15.060'), (1, '18.010')] [2023-10-08 00:40:42,594][52060] Updated weights for policy 0, policy_version 19850 (0.0009) [2023-10-08 00:40:42,957][52059] Updated weights for policy 1, policy_version 20102 (0.0008) [2023-10-08 00:40:42,961][52060] Updated weights for policy 0, policy_version 19860 (0.0007) [2023-10-08 00:40:43,322][52059] Updated weights for policy 1, policy_version 20112 (0.0008) [2023-10-08 00:40:43,326][52060] Updated weights for policy 0, policy_version 19870 (0.0007) [2023-10-08 00:40:43,699][52059] Updated weights for policy 1, policy_version 20122 (0.0007) [2023-10-08 00:40:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 40960000. Throughput: 0: 1715.3, 1: 1729.5. Samples: 10252640. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:40:46,211][50642] Avg episode reward: [(0, '15.700'), (1, '18.000')] [2023-10-08 00:40:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000019872_20348928.pth... [2023-10-08 00:40:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000020128_20611072.pth... [2023-10-08 00:40:46,249][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000018272_18710528.pth [2023-10-08 00:40:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000018496_18939904.pth [2023-10-08 00:40:47,287][52060] Updated weights for policy 0, policy_version 19880 (0.0010) [2023-10-08 00:40:47,507][52059] Updated weights for policy 1, policy_version 20132 (0.0009) [2023-10-08 00:40:47,650][52060] Updated weights for policy 0, policy_version 19890 (0.0009) [2023-10-08 00:40:47,868][52059] Updated weights for policy 1, policy_version 20142 (0.0007) [2023-10-08 00:40:48,011][52060] Updated weights for policy 0, policy_version 19900 (0.0008) [2023-10-08 00:40:48,234][52059] Updated weights for policy 1, policy_version 20152 (0.0008) [2023-10-08 00:40:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 41025536. Throughput: 0: 1686.0, 1: 1713.6. Samples: 10262138. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 00:40:51,211][50642] Avg episode reward: [(0, '15.960'), (1, '15.200')] [2023-10-08 00:40:52,011][52060] Updated weights for policy 0, policy_version 19910 (0.0010) [2023-10-08 00:40:52,339][52059] Updated weights for policy 1, policy_version 20162 (0.0009) [2023-10-08 00:40:52,375][52060] Updated weights for policy 0, policy_version 19920 (0.0010) [2023-10-08 00:40:52,709][52059] Updated weights for policy 1, policy_version 20172 (0.0007) [2023-10-08 00:40:52,744][52060] Updated weights for policy 0, policy_version 19930 (0.0008) [2023-10-08 00:40:53,070][52059] Updated weights for policy 1, policy_version 20182 (0.0009) [2023-10-08 00:40:53,435][52059] Updated weights for policy 1, policy_version 20192 (0.0009) [2023-10-08 00:40:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 41091072. Throughput: 0: 1709.6, 1: 1724.9. Samples: 10283450. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:40:56,211][50642] Avg episode reward: [(0, '15.450'), (1, '19.110')] [2023-10-08 00:40:56,796][52060] Updated weights for policy 0, policy_version 19940 (0.0008) [2023-10-08 00:40:57,158][52060] Updated weights for policy 0, policy_version 19950 (0.0007) [2023-10-08 00:40:57,323][52059] Updated weights for policy 1, policy_version 20202 (0.0007) [2023-10-08 00:40:57,534][52060] Updated weights for policy 0, policy_version 19960 (0.0008) [2023-10-08 00:40:57,695][52059] Updated weights for policy 1, policy_version 20212 (0.0007) [2023-10-08 00:40:58,062][52059] Updated weights for policy 1, policy_version 20222 (0.0009) [2023-10-08 00:41:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 41156608. Throughput: 0: 1714.4, 1: 1745.9. Samples: 10304974. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:41:01,211][50642] Avg episode reward: [(0, '17.500'), (1, '17.050')] [2023-10-08 00:41:01,223][51605] Saving new best policy, reward=17.500! [2023-10-08 00:41:01,487][52060] Updated weights for policy 0, policy_version 19970 (0.0008) [2023-10-08 00:41:01,852][52060] Updated weights for policy 0, policy_version 19980 (0.0009) [2023-10-08 00:41:01,943][52059] Updated weights for policy 1, policy_version 20232 (0.0008) [2023-10-08 00:41:02,222][52060] Updated weights for policy 0, policy_version 19990 (0.0008) [2023-10-08 00:41:02,314][52059] Updated weights for policy 1, policy_version 20242 (0.0009) [2023-10-08 00:41:02,593][52060] Updated weights for policy 0, policy_version 20000 (0.0007) [2023-10-08 00:41:02,686][52059] Updated weights for policy 1, policy_version 20252 (0.0009) [2023-10-08 00:41:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 41222144. Throughput: 0: 1695.6, 1: 1717.4. Samples: 10314334. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:41:06,211][50642] Avg episode reward: [(0, '15.680'), (1, '16.570')] [2023-10-08 00:41:06,471][52060] Updated weights for policy 0, policy_version 20010 (0.0007) [2023-10-08 00:41:06,520][52059] Updated weights for policy 1, policy_version 20262 (0.0008) [2023-10-08 00:41:06,848][52060] Updated weights for policy 0, policy_version 20020 (0.0008) [2023-10-08 00:41:06,889][52059] Updated weights for policy 1, policy_version 20272 (0.0007) [2023-10-08 00:41:07,210][52060] Updated weights for policy 0, policy_version 20030 (0.0008) [2023-10-08 00:41:07,256][52059] Updated weights for policy 1, policy_version 20282 (0.0009) [2023-10-08 00:41:11,000][52059] Updated weights for policy 1, policy_version 20292 (0.0010) [2023-10-08 00:41:11,200][52060] Updated weights for policy 0, policy_version 20040 (0.0008) [2023-10-08 00:41:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 41287680. Throughput: 0: 1717.8, 1: 1747.9. Samples: 10335894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:41:11,211][50642] Avg episode reward: [(0, '16.050'), (1, '19.280')] [2023-10-08 00:41:11,364][52059] Updated weights for policy 1, policy_version 20302 (0.0007) [2023-10-08 00:41:11,573][52060] Updated weights for policy 0, policy_version 20050 (0.0009) [2023-10-08 00:41:11,723][52059] Updated weights for policy 1, policy_version 20312 (0.0008) [2023-10-08 00:41:11,945][52060] Updated weights for policy 0, policy_version 20060 (0.0007) [2023-10-08 00:41:15,718][52059] Updated weights for policy 1, policy_version 20322 (0.0008) [2023-10-08 00:41:15,912][52060] Updated weights for policy 0, policy_version 20070 (0.0008) [2023-10-08 00:41:16,135][52059] Updated weights for policy 1, policy_version 20332 (0.0009) [2023-10-08 00:41:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 41353216. Throughput: 0: 1708.4, 1: 1745.2. Samples: 10356658. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 00:41:16,211][50642] Avg episode reward: [(0, '15.450'), (1, '16.570')] [2023-10-08 00:41:16,285][52060] Updated weights for policy 0, policy_version 20080 (0.0007) [2023-10-08 00:41:16,501][52059] Updated weights for policy 1, policy_version 20342 (0.0007) [2023-10-08 00:41:16,647][52060] Updated weights for policy 0, policy_version 20090 (0.0009) [2023-10-08 00:41:16,864][52059] Updated weights for policy 1, policy_version 20352 (0.0008) [2023-10-08 00:41:20,746][52060] Updated weights for policy 0, policy_version 20100 (0.0009) [2023-10-08 00:41:20,823][52059] Updated weights for policy 1, policy_version 20362 (0.0008) [2023-10-08 00:41:21,116][52060] Updated weights for policy 0, policy_version 20110 (0.0008) [2023-10-08 00:41:21,187][52059] Updated weights for policy 1, policy_version 20372 (0.0007) [2023-10-08 00:41:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 41418752. Throughput: 0: 1714.9, 1: 1737.7. Samples: 10366362. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 00:41:21,211][50642] Avg episode reward: [(0, '14.730'), (1, '15.220')] [2023-10-08 00:41:21,489][52060] Updated weights for policy 0, policy_version 20120 (0.0010) [2023-10-08 00:41:21,548][52059] Updated weights for policy 1, policy_version 20382 (0.0007) [2023-10-08 00:41:25,529][52059] Updated weights for policy 1, policy_version 20392 (0.0009) [2023-10-08 00:41:25,766][52060] Updated weights for policy 0, policy_version 20130 (0.0009) [2023-10-08 00:41:25,901][52059] Updated weights for policy 1, policy_version 20402 (0.0008) [2023-10-08 00:41:26,180][52060] Updated weights for policy 0, policy_version 20140 (0.0008) [2023-10-08 00:41:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 41484288. Throughput: 0: 1720.6, 1: 1746.2. Samples: 10387460. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 00:41:26,211][50642] Avg episode reward: [(0, '16.220'), (1, '18.880')] [2023-10-08 00:41:26,266][52059] Updated weights for policy 1, policy_version 20412 (0.0008) [2023-10-08 00:41:26,550][52060] Updated weights for policy 0, policy_version 20150 (0.0008) [2023-10-08 00:41:26,914][52060] Updated weights for policy 0, policy_version 20160 (0.0010) [2023-10-08 00:41:30,205][52059] Updated weights for policy 1, policy_version 20422 (0.0009) [2023-10-08 00:41:30,574][52059] Updated weights for policy 1, policy_version 20432 (0.0011) [2023-10-08 00:41:30,886][52060] Updated weights for policy 0, policy_version 20170 (0.0008) [2023-10-08 00:41:30,928][52059] Updated weights for policy 1, policy_version 20442 (0.0008) [2023-10-08 00:41:31,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 41582592. Throughput: 0: 1704.1, 1: 1731.0. Samples: 10407220. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 00:41:31,211][50642] Avg episode reward: [(0, '15.820'), (1, '18.970')] [2023-10-08 00:41:31,258][52060] Updated weights for policy 0, policy_version 20180 (0.0008) [2023-10-08 00:41:31,631][52060] Updated weights for policy 0, policy_version 20190 (0.0009) [2023-10-08 00:41:34,832][52059] Updated weights for policy 1, policy_version 20452 (0.0009) [2023-10-08 00:41:35,187][52059] Updated weights for policy 1, policy_version 20462 (0.0009) [2023-10-08 00:41:35,515][52060] Updated weights for policy 0, policy_version 20200 (0.0008) [2023-10-08 00:41:35,550][52059] Updated weights for policy 1, policy_version 20472 (0.0009) [2023-10-08 00:41:35,890][52060] Updated weights for policy 0, policy_version 20210 (0.0009) [2023-10-08 00:41:36,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 41648128. Throughput: 0: 1710.9, 1: 1749.5. Samples: 10417854. Policy #0 lag: (min: 10.0, avg: 14.1, max: 42.0) [2023-10-08 00:41:36,211][50642] Avg episode reward: [(0, '15.280'), (1, '15.460')] [2023-10-08 00:41:36,262][52060] Updated weights for policy 0, policy_version 20220 (0.0009) [2023-10-08 00:41:39,487][52059] Updated weights for policy 1, policy_version 20482 (0.0007) [2023-10-08 00:41:39,851][52059] Updated weights for policy 1, policy_version 20492 (0.0010) [2023-10-08 00:41:40,105][52060] Updated weights for policy 0, policy_version 20230 (0.0010) [2023-10-08 00:41:40,213][52059] Updated weights for policy 1, policy_version 20502 (0.0007) [2023-10-08 00:41:40,466][52060] Updated weights for policy 0, policy_version 20240 (0.0008) [2023-10-08 00:41:40,586][52059] Updated weights for policy 1, policy_version 20512 (0.0008) [2023-10-08 00:41:40,839][52060] Updated weights for policy 0, policy_version 20250 (0.0011) [2023-10-08 00:41:41,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 41746432. Throughput: 0: 1717.2, 1: 1736.7. Samples: 10438876. Policy #0 lag: (min: 10.0, avg: 14.1, max: 42.0) [2023-10-08 00:41:41,211][50642] Avg episode reward: [(0, '15.740'), (1, '18.010')] [2023-10-08 00:41:44,578][52059] Updated weights for policy 1, policy_version 20522 (0.0010) [2023-10-08 00:41:44,883][52060] Updated weights for policy 0, policy_version 20260 (0.0008) [2023-10-08 00:41:44,944][52059] Updated weights for policy 1, policy_version 20532 (0.0009) [2023-10-08 00:41:45,252][52060] Updated weights for policy 0, policy_version 20270 (0.0008) [2023-10-08 00:41:45,305][52059] Updated weights for policy 1, policy_version 20542 (0.0007) [2023-10-08 00:41:45,619][52060] Updated weights for policy 0, policy_version 20280 (0.0008) [2023-10-08 00:41:46,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 41811968. Throughput: 0: 1687.4, 1: 1713.1. Samples: 10457996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:41:46,211][50642] Avg episode reward: [(0, '15.750'), (1, '20.840')] [2023-10-08 00:41:49,251][52059] Updated weights for policy 1, policy_version 20552 (0.0008) [2023-10-08 00:41:49,426][52060] Updated weights for policy 0, policy_version 20290 (0.0009) [2023-10-08 00:41:49,626][52059] Updated weights for policy 1, policy_version 20562 (0.0008) [2023-10-08 00:41:49,797][52060] Updated weights for policy 0, policy_version 20300 (0.0008) [2023-10-08 00:41:49,990][52059] Updated weights for policy 1, policy_version 20572 (0.0008) [2023-10-08 00:41:50,157][52060] Updated weights for policy 0, policy_version 20310 (0.0007) [2023-10-08 00:41:50,532][52060] Updated weights for policy 0, policy_version 20320 (0.0010) [2023-10-08 00:41:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 41877504. Throughput: 0: 1714.6, 1: 1742.5. Samples: 10469902. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:41:51,211][50642] Avg episode reward: [(0, '16.480'), (1, '15.660')] [2023-10-08 00:41:53,797][52059] Updated weights for policy 1, policy_version 20582 (0.0009) [2023-10-08 00:41:54,165][52059] Updated weights for policy 1, policy_version 20592 (0.0010) [2023-10-08 00:41:54,518][52060] Updated weights for policy 0, policy_version 20330 (0.0007) [2023-10-08 00:41:54,529][52059] Updated weights for policy 1, policy_version 20602 (0.0008) [2023-10-08 00:41:54,892][52060] Updated weights for policy 0, policy_version 20340 (0.0010) [2023-10-08 00:41:55,260][52060] Updated weights for policy 0, policy_version 20350 (0.0008) [2023-10-08 00:41:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 41943040. Throughput: 0: 1696.2, 1: 1714.4. Samples: 10489374. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:41:56,211][50642] Avg episode reward: [(0, '16.160'), (1, '18.100')] [2023-10-08 00:41:58,512][52059] Updated weights for policy 1, policy_version 20612 (0.0008) [2023-10-08 00:41:58,880][52059] Updated weights for policy 1, policy_version 20622 (0.0009) [2023-10-08 00:41:59,243][52059] Updated weights for policy 1, policy_version 20632 (0.0009) [2023-10-08 00:41:59,312][52060] Updated weights for policy 0, policy_version 20360 (0.0008) [2023-10-08 00:41:59,679][52060] Updated weights for policy 0, policy_version 20370 (0.0007) [2023-10-08 00:42:00,045][52060] Updated weights for policy 0, policy_version 20380 (0.0008) [2023-10-08 00:42:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 42008576. Throughput: 0: 1691.1, 1: 1718.3. Samples: 10510082. Policy #0 lag: (min: 18.0, avg: 25.0, max: 50.0) [2023-10-08 00:42:01,211][50642] Avg episode reward: [(0, '16.120'), (1, '18.710')] [2023-10-08 00:42:03,099][52059] Updated weights for policy 1, policy_version 20642 (0.0008) [2023-10-08 00:42:03,511][52059] Updated weights for policy 1, policy_version 20652 (0.0009) [2023-10-08 00:42:03,881][52059] Updated weights for policy 1, policy_version 20662 (0.0008) [2023-10-08 00:42:04,089][52060] Updated weights for policy 0, policy_version 20390 (0.0010) [2023-10-08 00:42:04,253][52059] Updated weights for policy 1, policy_version 20672 (0.0007) [2023-10-08 00:42:04,456][52060] Updated weights for policy 0, policy_version 20400 (0.0007) [2023-10-08 00:42:04,827][52060] Updated weights for policy 0, policy_version 20410 (0.0007) [2023-10-08 00:42:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 42074112. Throughput: 0: 1715.5, 1: 1727.9. Samples: 10521314. Policy #0 lag: (min: 18.0, avg: 25.0, max: 50.0) [2023-10-08 00:42:06,211][50642] Avg episode reward: [(0, '16.260'), (1, '15.590')] [2023-10-08 00:42:08,170][52059] Updated weights for policy 1, policy_version 20682 (0.0009) [2023-10-08 00:42:08,534][52059] Updated weights for policy 1, policy_version 20692 (0.0009) [2023-10-08 00:42:08,783][52060] Updated weights for policy 0, policy_version 20420 (0.0007) [2023-10-08 00:42:08,901][52059] Updated weights for policy 1, policy_version 20702 (0.0007) [2023-10-08 00:42:09,157][52060] Updated weights for policy 0, policy_version 20430 (0.0010) [2023-10-08 00:42:09,523][52060] Updated weights for policy 0, policy_version 20440 (0.0010) [2023-10-08 00:42:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 42139648. Throughput: 0: 1693.3, 1: 1719.3. Samples: 10541030. Policy #0 lag: (min: 18.0, avg: 25.0, max: 50.0) [2023-10-08 00:42:11,211][50642] Avg episode reward: [(0, '15.480'), (1, '17.960')] [2023-10-08 00:42:12,911][52059] Updated weights for policy 1, policy_version 20712 (0.0011) [2023-10-08 00:42:13,263][52059] Updated weights for policy 1, policy_version 20722 (0.0009) [2023-10-08 00:42:13,590][52060] Updated weights for policy 0, policy_version 20450 (0.0007) [2023-10-08 00:42:13,629][52059] Updated weights for policy 1, policy_version 20732 (0.0009) [2023-10-08 00:42:13,985][52060] Updated weights for policy 0, policy_version 20460 (0.0007) [2023-10-08 00:42:14,353][52060] Updated weights for policy 0, policy_version 20470 (0.0007) [2023-10-08 00:42:14,720][52060] Updated weights for policy 0, policy_version 20480 (0.0007) [2023-10-08 00:42:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 42205184. Throughput: 0: 1699.4, 1: 1733.1. Samples: 10561682. Policy #0 lag: (min: 18.0, avg: 25.0, max: 50.0) [2023-10-08 00:42:16,211][50642] Avg episode reward: [(0, '16.860'), (1, '18.280')] [2023-10-08 00:42:17,714][52059] Updated weights for policy 1, policy_version 20742 (0.0008) [2023-10-08 00:42:18,083][52059] Updated weights for policy 1, policy_version 20752 (0.0009) [2023-10-08 00:42:18,446][52059] Updated weights for policy 1, policy_version 20762 (0.0008) [2023-10-08 00:42:18,680][52060] Updated weights for policy 0, policy_version 20490 (0.0007) [2023-10-08 00:42:19,044][52060] Updated weights for policy 0, policy_version 20500 (0.0007) [2023-10-08 00:42:19,416][52060] Updated weights for policy 0, policy_version 20510 (0.0008) [2023-10-08 00:42:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 42270720. Throughput: 0: 1711.5, 1: 1709.1. Samples: 10571778. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) [2023-10-08 00:42:21,211][50642] Avg episode reward: [(0, '15.850'), (1, '16.440')] [2023-10-08 00:42:22,496][52059] Updated weights for policy 1, policy_version 20772 (0.0009) [2023-10-08 00:42:22,859][52059] Updated weights for policy 1, policy_version 20782 (0.0009) [2023-10-08 00:42:23,220][52059] Updated weights for policy 1, policy_version 20792 (0.0009) [2023-10-08 00:42:23,435][52060] Updated weights for policy 0, policy_version 20520 (0.0008) [2023-10-08 00:42:23,814][52060] Updated weights for policy 0, policy_version 20530 (0.0008) [2023-10-08 00:42:24,184][52060] Updated weights for policy 0, policy_version 20540 (0.0008) [2023-10-08 00:42:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 42336256. Throughput: 0: 1689.8, 1: 1713.0. Samples: 10592000. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) [2023-10-08 00:42:26,211][50642] Avg episode reward: [(0, '15.790'), (1, '18.180')] [2023-10-08 00:42:27,216][52059] Updated weights for policy 1, policy_version 20802 (0.0008) [2023-10-08 00:42:27,591][52059] Updated weights for policy 1, policy_version 20812 (0.0009) [2023-10-08 00:42:27,949][52059] Updated weights for policy 1, policy_version 20822 (0.0007) [2023-10-08 00:42:28,186][52060] Updated weights for policy 0, policy_version 20550 (0.0007) [2023-10-08 00:42:28,321][52059] Updated weights for policy 1, policy_version 20832 (0.0007) [2023-10-08 00:42:28,565][52060] Updated weights for policy 0, policy_version 20560 (0.0007) [2023-10-08 00:42:28,936][52060] Updated weights for policy 0, policy_version 20570 (0.0007) [2023-10-08 00:42:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 42401792. Throughput: 0: 1718.3, 1: 1732.9. Samples: 10613298. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) [2023-10-08 00:42:31,211][50642] Avg episode reward: [(0, '16.470'), (1, '20.430')] [2023-10-08 00:42:32,213][52059] Updated weights for policy 1, policy_version 20842 (0.0007) [2023-10-08 00:42:32,584][52059] Updated weights for policy 1, policy_version 20852 (0.0007) [2023-10-08 00:42:32,813][52060] Updated weights for policy 0, policy_version 20580 (0.0007) [2023-10-08 00:42:32,944][52059] Updated weights for policy 1, policy_version 20862 (0.0007) [2023-10-08 00:42:33,170][52060] Updated weights for policy 0, policy_version 20590 (0.0007) [2023-10-08 00:42:33,539][52060] Updated weights for policy 0, policy_version 20600 (0.0008) [2023-10-08 00:42:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 42467328. Throughput: 0: 1692.5, 1: 1706.3. Samples: 10622846. Policy #0 lag: (min: 30.0, avg: 38.0, max: 62.0) [2023-10-08 00:42:36,211][50642] Avg episode reward: [(0, '15.770'), (1, '16.520')] [2023-10-08 00:42:37,005][52059] Updated weights for policy 1, policy_version 20872 (0.0009) [2023-10-08 00:42:37,381][52059] Updated weights for policy 1, policy_version 20882 (0.0009) [2023-10-08 00:42:37,459][52060] Updated weights for policy 0, policy_version 20610 (0.0008) [2023-10-08 00:42:37,744][52059] Updated weights for policy 1, policy_version 20892 (0.0008) [2023-10-08 00:42:37,834][52060] Updated weights for policy 0, policy_version 20620 (0.0007) [2023-10-08 00:42:38,199][52060] Updated weights for policy 0, policy_version 20630 (0.0009) [2023-10-08 00:42:38,560][52060] Updated weights for policy 0, policy_version 20640 (0.0009) [2023-10-08 00:42:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13773.7). Total num frames: 42532864. Throughput: 0: 1709.9, 1: 1723.8. Samples: 10643892. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 00:42:41,211][50642] Avg episode reward: [(0, '17.870'), (1, '17.740')] [2023-10-08 00:42:41,213][51605] Saving new best policy, reward=17.870! [2023-10-08 00:42:41,720][52059] Updated weights for policy 1, policy_version 20902 (0.0008) [2023-10-08 00:42:42,089][52059] Updated weights for policy 1, policy_version 20912 (0.0008) [2023-10-08 00:42:42,449][52059] Updated weights for policy 1, policy_version 20922 (0.0007) [2023-10-08 00:42:42,651][52060] Updated weights for policy 0, policy_version 20650 (0.0007) [2023-10-08 00:42:43,023][52060] Updated weights for policy 0, policy_version 20660 (0.0010) [2023-10-08 00:42:43,396][52060] Updated weights for policy 0, policy_version 20670 (0.0010) [2023-10-08 00:42:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 42598400. Throughput: 0: 1718.8, 1: 1728.8. Samples: 10665228. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 00:42:46,211][50642] Avg episode reward: [(0, '15.980'), (1, '20.710')] [2023-10-08 00:42:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000020672_21168128.pth... [2023-10-08 00:42:46,255][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000019072_19529728.pth [2023-10-08 00:42:46,335][52059] Updated weights for policy 1, policy_version 20932 (0.0009) [2023-10-08 00:42:46,695][52059] Updated weights for policy 1, policy_version 20942 (0.0009) [2023-10-08 00:42:47,063][52059] Updated weights for policy 1, policy_version 20952 (0.0008) [2023-10-08 00:42:47,343][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000020960_21463040.pth... [2023-10-08 00:42:47,369][52060] Updated weights for policy 0, policy_version 20680 (0.0008) [2023-10-08 00:42:47,372][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000019328_19791872.pth [2023-10-08 00:42:47,738][52060] Updated weights for policy 0, policy_version 20690 (0.0007) [2023-10-08 00:42:48,101][52060] Updated weights for policy 0, policy_version 20700 (0.0008) [2023-10-08 00:42:51,058][52059] Updated weights for policy 1, policy_version 20962 (0.0007) [2023-10-08 00:42:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 42663936. Throughput: 0: 1687.0, 1: 1716.3. Samples: 10674464. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 00:42:51,211][50642] Avg episode reward: [(0, '16.380'), (1, '17.340')] [2023-10-08 00:42:51,451][52059] Updated weights for policy 1, policy_version 20972 (0.0007) [2023-10-08 00:42:51,812][52059] Updated weights for policy 1, policy_version 20982 (0.0008) [2023-10-08 00:42:52,065][52060] Updated weights for policy 0, policy_version 20710 (0.0010) [2023-10-08 00:42:52,173][52059] Updated weights for policy 1, policy_version 20992 (0.0009) [2023-10-08 00:42:52,425][52060] Updated weights for policy 0, policy_version 20720 (0.0009) [2023-10-08 00:42:52,802][52060] Updated weights for policy 0, policy_version 20730 (0.0010) [2023-10-08 00:42:55,990][52059] Updated weights for policy 1, policy_version 21002 (0.0009) [2023-10-08 00:42:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 42729472. Throughput: 0: 1711.7, 1: 1722.1. Samples: 10695552. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 00:42:56,211][50642] Avg episode reward: [(0, '17.540'), (1, '17.230')] [2023-10-08 00:42:56,358][52059] Updated weights for policy 1, policy_version 21012 (0.0008) [2023-10-08 00:42:56,684][52060] Updated weights for policy 0, policy_version 20740 (0.0007) [2023-10-08 00:42:56,726][52059] Updated weights for policy 1, policy_version 21022 (0.0010) [2023-10-08 00:42:57,056][52060] Updated weights for policy 0, policy_version 20750 (0.0007) [2023-10-08 00:42:57,428][52060] Updated weights for policy 0, policy_version 20760 (0.0009) [2023-10-08 00:43:00,614][52059] Updated weights for policy 1, policy_version 21032 (0.0011) [2023-10-08 00:43:00,980][52059] Updated weights for policy 1, policy_version 21042 (0.0009) [2023-10-08 00:43:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 42795008. Throughput: 0: 1725.0, 1: 1716.0. Samples: 10716530. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:43:01,211][50642] Avg episode reward: [(0, '15.420'), (1, '19.810')] [2023-10-08 00:43:01,338][52059] Updated weights for policy 1, policy_version 21052 (0.0010) [2023-10-08 00:43:01,390][52060] Updated weights for policy 0, policy_version 20770 (0.0009) [2023-10-08 00:43:01,796][52060] Updated weights for policy 0, policy_version 20780 (0.0009) [2023-10-08 00:43:02,165][52060] Updated weights for policy 0, policy_version 20790 (0.0011) [2023-10-08 00:43:02,532][52060] Updated weights for policy 0, policy_version 20800 (0.0008) [2023-10-08 00:43:05,220][52059] Updated weights for policy 1, policy_version 21062 (0.0007) [2023-10-08 00:43:05,581][52059] Updated weights for policy 1, policy_version 21072 (0.0008) [2023-10-08 00:43:05,949][52059] Updated weights for policy 1, policy_version 21082 (0.0009) [2023-10-08 00:43:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 42893312. Throughput: 0: 1700.8, 1: 1734.0. Samples: 10726344. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:43:06,211][50642] Avg episode reward: [(0, '15.910'), (1, '18.970')] [2023-10-08 00:43:06,387][52060] Updated weights for policy 0, policy_version 20810 (0.0009) [2023-10-08 00:43:06,751][52060] Updated weights for policy 0, policy_version 20820 (0.0009) [2023-10-08 00:43:07,131][52060] Updated weights for policy 0, policy_version 20830 (0.0008) [2023-10-08 00:43:09,801][52059] Updated weights for policy 1, policy_version 21092 (0.0009) [2023-10-08 00:43:10,164][52059] Updated weights for policy 1, policy_version 21102 (0.0009) [2023-10-08 00:43:10,522][52059] Updated weights for policy 1, policy_version 21112 (0.0009) [2023-10-08 00:43:11,059][52060] Updated weights for policy 0, policy_version 20840 (0.0010) [2023-10-08 00:43:11,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 42958848. Throughput: 0: 1720.5, 1: 1737.1. Samples: 10747594. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:43:11,211][50642] Avg episode reward: [(0, '16.490'), (1, '16.340')] [2023-10-08 00:43:11,424][52060] Updated weights for policy 0, policy_version 20850 (0.0008) [2023-10-08 00:43:11,793][52060] Updated weights for policy 0, policy_version 20860 (0.0008) [2023-10-08 00:43:14,487][52059] Updated weights for policy 1, policy_version 21122 (0.0009) [2023-10-08 00:43:14,863][52059] Updated weights for policy 1, policy_version 21132 (0.0010) [2023-10-08 00:43:15,226][52059] Updated weights for policy 1, policy_version 21142 (0.0011) [2023-10-08 00:43:15,589][52059] Updated weights for policy 1, policy_version 21152 (0.0008) [2023-10-08 00:43:15,721][52060] Updated weights for policy 0, policy_version 20870 (0.0008) [2023-10-08 00:43:16,089][52060] Updated weights for policy 0, policy_version 20880 (0.0008) [2023-10-08 00:43:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43024384. Throughput: 0: 1715.9, 1: 1712.2. Samples: 10767562. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:43:16,211][50642] Avg episode reward: [(0, '15.910'), (1, '18.110')] [2023-10-08 00:43:16,467][52060] Updated weights for policy 0, policy_version 20890 (0.0009) [2023-10-08 00:43:19,515][52059] Updated weights for policy 1, policy_version 21162 (0.0007) [2023-10-08 00:43:19,880][52059] Updated weights for policy 1, policy_version 21172 (0.0007) [2023-10-08 00:43:20,238][52059] Updated weights for policy 1, policy_version 21182 (0.0008) [2023-10-08 00:43:20,560][52060] Updated weights for policy 0, policy_version 20900 (0.0008) [2023-10-08 00:43:20,923][52060] Updated weights for policy 0, policy_version 20910 (0.0008) [2023-10-08 00:43:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 43089920. Throughput: 0: 1719.4, 1: 1742.4. Samples: 10778628. Policy #0 lag: (min: 3.0, avg: 8.4, max: 35.0) [2023-10-08 00:43:21,211][50642] Avg episode reward: [(0, '18.230'), (1, '18.020')] [2023-10-08 00:43:21,296][52060] Updated weights for policy 0, policy_version 20920 (0.0007) [2023-10-08 00:43:21,587][51605] Saving new best policy, reward=18.230! [2023-10-08 00:43:24,370][52059] Updated weights for policy 1, policy_version 21192 (0.0007) [2023-10-08 00:43:24,736][52059] Updated weights for policy 1, policy_version 21202 (0.0009) [2023-10-08 00:43:25,091][52059] Updated weights for policy 1, policy_version 21212 (0.0009) [2023-10-08 00:43:25,456][52060] Updated weights for policy 0, policy_version 20930 (0.0008) [2023-10-08 00:43:25,824][52060] Updated weights for policy 0, policy_version 20940 (0.0008) [2023-10-08 00:43:26,191][52060] Updated weights for policy 0, policy_version 20950 (0.0007) [2023-10-08 00:43:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43155456. Throughput: 0: 1718.4, 1: 1724.6. Samples: 10798826. Policy #0 lag: (min: 3.0, avg: 8.4, max: 35.0) [2023-10-08 00:43:26,211][50642] Avg episode reward: [(0, '15.360'), (1, '18.320')] [2023-10-08 00:43:26,563][52060] Updated weights for policy 0, policy_version 20960 (0.0007) [2023-10-08 00:43:28,943][52059] Updated weights for policy 1, policy_version 21222 (0.0009) [2023-10-08 00:43:29,300][52059] Updated weights for policy 1, policy_version 21232 (0.0008) [2023-10-08 00:43:29,665][52059] Updated weights for policy 1, policy_version 21242 (0.0009) [2023-10-08 00:43:30,435][52060] Updated weights for policy 0, policy_version 20970 (0.0008) [2023-10-08 00:43:30,799][52060] Updated weights for policy 0, policy_version 20980 (0.0007) [2023-10-08 00:43:31,175][52060] Updated weights for policy 0, policy_version 20990 (0.0008) [2023-10-08 00:43:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43220992. Throughput: 0: 1705.4, 1: 1715.9. Samples: 10819188. Policy #0 lag: (min: 3.0, avg: 8.4, max: 35.0) [2023-10-08 00:43:31,211][50642] Avg episode reward: [(0, '16.010'), (1, '19.030')] [2023-10-08 00:43:33,596][52059] Updated weights for policy 1, policy_version 21252 (0.0008) [2023-10-08 00:43:33,964][52059] Updated weights for policy 1, policy_version 21262 (0.0009) [2023-10-08 00:43:34,324][52059] Updated weights for policy 1, policy_version 21272 (0.0009) [2023-10-08 00:43:34,890][52060] Updated weights for policy 0, policy_version 21000 (0.0008) [2023-10-08 00:43:35,257][52060] Updated weights for policy 0, policy_version 21010 (0.0008) [2023-10-08 00:43:35,632][52060] Updated weights for policy 0, policy_version 21020 (0.0009) [2023-10-08 00:43:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 43319296. Throughput: 0: 1731.0, 1: 1736.0. Samples: 10830480. Policy #0 lag: (min: 1.0, avg: 6.7, max: 33.0) [2023-10-08 00:43:36,211][50642] Avg episode reward: [(0, '17.150'), (1, '16.380')] [2023-10-08 00:43:38,158][52059] Updated weights for policy 1, policy_version 21282 (0.0009) [2023-10-08 00:43:38,579][52059] Updated weights for policy 1, policy_version 21292 (0.0008) [2023-10-08 00:43:38,955][52059] Updated weights for policy 1, policy_version 21302 (0.0008) [2023-10-08 00:43:39,318][52059] Updated weights for policy 1, policy_version 21312 (0.0007) [2023-10-08 00:43:39,530][52060] Updated weights for policy 0, policy_version 21030 (0.0009) [2023-10-08 00:43:39,909][52060] Updated weights for policy 0, policy_version 21040 (0.0010) [2023-10-08 00:43:40,270][52060] Updated weights for policy 0, policy_version 21050 (0.0010) [2023-10-08 00:43:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 43384832. Throughput: 0: 1723.5, 1: 1719.6. Samples: 10850494. Policy #0 lag: (min: 1.0, avg: 6.7, max: 33.0) [2023-10-08 00:43:41,211][50642] Avg episode reward: [(0, '14.650'), (1, '19.800')] [2023-10-08 00:43:43,161][52059] Updated weights for policy 1, policy_version 21322 (0.0007) [2023-10-08 00:43:43,525][52059] Updated weights for policy 1, policy_version 21332 (0.0007) [2023-10-08 00:43:43,892][52059] Updated weights for policy 1, policy_version 21342 (0.0007) [2023-10-08 00:43:44,337][52060] Updated weights for policy 0, policy_version 21060 (0.0010) [2023-10-08 00:43:44,692][52060] Updated weights for policy 0, policy_version 21070 (0.0010) [2023-10-08 00:43:45,062][52060] Updated weights for policy 0, policy_version 21080 (0.0009) [2023-10-08 00:43:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 43450368. Throughput: 0: 1698.4, 1: 1736.5. Samples: 10871102. Policy #0 lag: (min: 1.0, avg: 6.7, max: 33.0) [2023-10-08 00:43:46,211][50642] Avg episode reward: [(0, '16.900'), (1, '20.830')] [2023-10-08 00:43:47,849][52059] Updated weights for policy 1, policy_version 21352 (0.0009) [2023-10-08 00:43:48,222][52059] Updated weights for policy 1, policy_version 21362 (0.0007) [2023-10-08 00:43:48,581][52059] Updated weights for policy 1, policy_version 21372 (0.0009) [2023-10-08 00:43:49,325][52060] Updated weights for policy 0, policy_version 21090 (0.0010) [2023-10-08 00:43:49,732][52060] Updated weights for policy 0, policy_version 21100 (0.0007) [2023-10-08 00:43:50,104][52060] Updated weights for policy 0, policy_version 21110 (0.0007) [2023-10-08 00:43:50,474][52060] Updated weights for policy 0, policy_version 21120 (0.0007) [2023-10-08 00:43:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 43515904. Throughput: 0: 1731.8, 1: 1719.7. Samples: 10881660. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:43:51,211][50642] Avg episode reward: [(0, '15.880'), (1, '18.890')] [2023-10-08 00:43:52,462][52059] Updated weights for policy 1, policy_version 21382 (0.0010) [2023-10-08 00:43:52,838][52059] Updated weights for policy 1, policy_version 21392 (0.0008) [2023-10-08 00:43:53,200][52059] Updated weights for policy 1, policy_version 21402 (0.0007) [2023-10-08 00:43:54,414][52060] Updated weights for policy 0, policy_version 21130 (0.0010) [2023-10-08 00:43:54,772][52060] Updated weights for policy 0, policy_version 21140 (0.0008) [2023-10-08 00:43:55,148][52060] Updated weights for policy 0, policy_version 21150 (0.0007) [2023-10-08 00:43:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 43581440. Throughput: 0: 1705.7, 1: 1726.0. Samples: 10902022. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:43:56,211][50642] Avg episode reward: [(0, '16.750'), (1, '18.260')] [2023-10-08 00:43:57,123][52059] Updated weights for policy 1, policy_version 21412 (0.0009) [2023-10-08 00:43:57,499][52059] Updated weights for policy 1, policy_version 21422 (0.0008) [2023-10-08 00:43:57,865][52059] Updated weights for policy 1, policy_version 21432 (0.0010) [2023-10-08 00:43:59,062][52060] Updated weights for policy 0, policy_version 21160 (0.0009) [2023-10-08 00:43:59,436][52060] Updated weights for policy 0, policy_version 21170 (0.0009) [2023-10-08 00:43:59,806][52060] Updated weights for policy 0, policy_version 21180 (0.0010) [2023-10-08 00:44:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 43646976. Throughput: 0: 1703.9, 1: 1750.5. Samples: 10923010. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:44:01,211][50642] Avg episode reward: [(0, '17.380'), (1, '19.820')] [2023-10-08 00:44:01,733][52059] Updated weights for policy 1, policy_version 21442 (0.0010) [2023-10-08 00:44:02,095][52059] Updated weights for policy 1, policy_version 21452 (0.0009) [2023-10-08 00:44:02,464][52059] Updated weights for policy 1, policy_version 21462 (0.0009) [2023-10-08 00:44:02,824][52059] Updated weights for policy 1, policy_version 21472 (0.0008) [2023-10-08 00:44:03,648][52060] Updated weights for policy 0, policy_version 21190 (0.0008) [2023-10-08 00:44:04,021][52060] Updated weights for policy 0, policy_version 21200 (0.0007) [2023-10-08 00:44:04,394][52060] Updated weights for policy 0, policy_version 21210 (0.0009) [2023-10-08 00:44:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 43712512. Throughput: 0: 1721.3, 1: 1717.0. Samples: 10933350. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 00:44:06,211][50642] Avg episode reward: [(0, '14.460'), (1, '18.010')] [2023-10-08 00:44:06,725][52059] Updated weights for policy 1, policy_version 21482 (0.0008) [2023-10-08 00:44:07,078][52059] Updated weights for policy 1, policy_version 21492 (0.0007) [2023-10-08 00:44:07,447][52059] Updated weights for policy 1, policy_version 21502 (0.0007) [2023-10-08 00:44:08,376][52060] Updated weights for policy 0, policy_version 21220 (0.0008) [2023-10-08 00:44:08,751][52060] Updated weights for policy 0, policy_version 21230 (0.0008) [2023-10-08 00:44:09,111][52060] Updated weights for policy 0, policy_version 21240 (0.0008) [2023-10-08 00:44:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43778048. Throughput: 0: 1701.8, 1: 1743.4. Samples: 10953860. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:44:11,212][50642] Avg episode reward: [(0, '16.440'), (1, '17.820')] [2023-10-08 00:44:11,430][52059] Updated weights for policy 1, policy_version 21512 (0.0008) [2023-10-08 00:44:11,806][52059] Updated weights for policy 1, policy_version 21522 (0.0010) [2023-10-08 00:44:12,170][52059] Updated weights for policy 1, policy_version 21532 (0.0010) [2023-10-08 00:44:13,023][52060] Updated weights for policy 0, policy_version 21250 (0.0009) [2023-10-08 00:44:13,398][52060] Updated weights for policy 0, policy_version 21260 (0.0009) [2023-10-08 00:44:13,774][52060] Updated weights for policy 0, policy_version 21270 (0.0009) [2023-10-08 00:44:14,143][52060] Updated weights for policy 0, policy_version 21280 (0.0008) [2023-10-08 00:44:16,053][52059] Updated weights for policy 1, policy_version 21542 (0.0009) [2023-10-08 00:44:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43843584. Throughput: 0: 1720.2, 1: 1744.1. Samples: 10975084. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:44:16,211][50642] Avg episode reward: [(0, '15.910'), (1, '18.990')] [2023-10-08 00:44:16,416][52059] Updated weights for policy 1, policy_version 21552 (0.0007) [2023-10-08 00:44:16,781][52059] Updated weights for policy 1, policy_version 21562 (0.0007) [2023-10-08 00:44:18,152][52060] Updated weights for policy 0, policy_version 21290 (0.0010) [2023-10-08 00:44:18,516][52060] Updated weights for policy 0, policy_version 21300 (0.0010) [2023-10-08 00:44:18,883][52060] Updated weights for policy 0, policy_version 21310 (0.0009) [2023-10-08 00:44:20,733][52059] Updated weights for policy 1, policy_version 21572 (0.0007) [2023-10-08 00:44:21,082][52059] Updated weights for policy 1, policy_version 21582 (0.0010) [2023-10-08 00:44:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43909120. Throughput: 0: 1703.7, 1: 1727.2. Samples: 10984872. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:44:21,211][50642] Avg episode reward: [(0, '16.280'), (1, '17.590')] [2023-10-08 00:44:21,459][52059] Updated weights for policy 1, policy_version 21592 (0.0011) [2023-10-08 00:44:22,821][52060] Updated weights for policy 0, policy_version 21320 (0.0009) [2023-10-08 00:44:23,180][52060] Updated weights for policy 0, policy_version 21330 (0.0011) [2023-10-08 00:44:23,553][52060] Updated weights for policy 0, policy_version 21340 (0.0008) [2023-10-08 00:44:25,484][52059] Updated weights for policy 1, policy_version 21602 (0.0010) [2023-10-08 00:44:25,879][52059] Updated weights for policy 1, policy_version 21612 (0.0007) [2023-10-08 00:44:26,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 43974656. Throughput: 0: 1706.5, 1: 1751.2. Samples: 11006092. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:44:26,211][50642] Avg episode reward: [(0, '17.780'), (1, '17.710')] [2023-10-08 00:44:26,246][52059] Updated weights for policy 1, policy_version 21622 (0.0009) [2023-10-08 00:44:26,617][52059] Updated weights for policy 1, policy_version 21632 (0.0009) [2023-10-08 00:44:27,521][52060] Updated weights for policy 0, policy_version 21350 (0.0008) [2023-10-08 00:44:27,887][52060] Updated weights for policy 0, policy_version 21360 (0.0008) [2023-10-08 00:44:28,262][52060] Updated weights for policy 0, policy_version 21370 (0.0009) [2023-10-08 00:44:30,396][52059] Updated weights for policy 1, policy_version 21642 (0.0010) [2023-10-08 00:44:30,755][52059] Updated weights for policy 1, policy_version 21652 (0.0008) [2023-10-08 00:44:31,138][52059] Updated weights for policy 1, policy_version 21662 (0.0009) [2023-10-08 00:44:31,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 44072960. Throughput: 0: 1726.0, 1: 1725.4. Samples: 11026416. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:44:31,213][50642] Avg episode reward: [(0, '14.800'), (1, '19.210')] [2023-10-08 00:44:32,157][52060] Updated weights for policy 0, policy_version 21380 (0.0009) [2023-10-08 00:44:32,524][52060] Updated weights for policy 0, policy_version 21390 (0.0010) [2023-10-08 00:44:32,900][52060] Updated weights for policy 0, policy_version 21400 (0.0010) [2023-10-08 00:44:35,044][52059] Updated weights for policy 1, policy_version 21672 (0.0010) [2023-10-08 00:44:35,396][52059] Updated weights for policy 1, policy_version 21682 (0.0011) [2023-10-08 00:44:35,767][52059] Updated weights for policy 1, policy_version 21692 (0.0007) [2023-10-08 00:44:36,210][50642] Fps is (10 sec: 16384.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 44138496. Throughput: 0: 1699.5, 1: 1749.4. Samples: 11036860. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:44:36,211][50642] Avg episode reward: [(0, '15.900'), (1, '18.990')] [2023-10-08 00:44:36,884][52060] Updated weights for policy 0, policy_version 21410 (0.0009) [2023-10-08 00:44:37,290][52060] Updated weights for policy 0, policy_version 21420 (0.0010) [2023-10-08 00:44:37,662][52060] Updated weights for policy 0, policy_version 21430 (0.0008) [2023-10-08 00:44:38,018][52060] Updated weights for policy 0, policy_version 21440 (0.0008) [2023-10-08 00:44:39,675][52059] Updated weights for policy 1, policy_version 21702 (0.0008) [2023-10-08 00:44:40,040][52059] Updated weights for policy 1, policy_version 21712 (0.0008) [2023-10-08 00:44:40,396][52059] Updated weights for policy 1, policy_version 21722 (0.0009) [2023-10-08 00:44:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 44204032. Throughput: 0: 1726.2, 1: 1737.3. Samples: 11057880. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:44:41,211][50642] Avg episode reward: [(0, '15.740'), (1, '16.550')] [2023-10-08 00:44:41,916][52060] Updated weights for policy 0, policy_version 21450 (0.0009) [2023-10-08 00:44:42,293][52060] Updated weights for policy 0, policy_version 21460 (0.0009) [2023-10-08 00:44:42,658][52060] Updated weights for policy 0, policy_version 21470 (0.0009) [2023-10-08 00:44:44,375][52059] Updated weights for policy 1, policy_version 21732 (0.0008) [2023-10-08 00:44:44,745][52059] Updated weights for policy 1, policy_version 21742 (0.0009) [2023-10-08 00:44:45,101][52059] Updated weights for policy 1, policy_version 21752 (0.0008) [2023-10-08 00:44:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 44269568. Throughput: 0: 1729.3, 1: 1719.6. Samples: 11078210. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:44:46,211][50642] Avg episode reward: [(0, '16.080'), (1, '18.070')] [2023-10-08 00:44:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000021472_21987328.pth... [2023-10-08 00:44:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000021760_22282240.pth... [2023-10-08 00:44:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000019872_20348928.pth [2023-10-08 00:44:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000020128_20611072.pth [2023-10-08 00:44:46,841][52060] Updated weights for policy 0, policy_version 21480 (0.0009) [2023-10-08 00:44:47,221][52060] Updated weights for policy 0, policy_version 21490 (0.0009) [2023-10-08 00:44:47,585][52060] Updated weights for policy 0, policy_version 21500 (0.0009) [2023-10-08 00:44:48,905][52059] Updated weights for policy 1, policy_version 21762 (0.0009) [2023-10-08 00:44:49,264][52059] Updated weights for policy 1, policy_version 21772 (0.0009) [2023-10-08 00:44:49,633][52059] Updated weights for policy 1, policy_version 21782 (0.0010) [2023-10-08 00:44:49,991][52059] Updated weights for policy 1, policy_version 21792 (0.0009) [2023-10-08 00:44:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 44335104. Throughput: 0: 1703.7, 1: 1752.3. Samples: 11088870. Policy #0 lag: (min: 15.0, avg: 22.8, max: 47.0) [2023-10-08 00:44:51,211][50642] Avg episode reward: [(0, '17.150'), (1, '20.790')] [2023-10-08 00:44:51,678][52060] Updated weights for policy 0, policy_version 21510 (0.0009) [2023-10-08 00:44:52,040][52060] Updated weights for policy 0, policy_version 21520 (0.0010) [2023-10-08 00:44:52,406][52060] Updated weights for policy 0, policy_version 21530 (0.0007) [2023-10-08 00:44:53,826][52059] Updated weights for policy 1, policy_version 21802 (0.0010) [2023-10-08 00:44:54,190][52059] Updated weights for policy 1, policy_version 21812 (0.0010) [2023-10-08 00:44:54,551][52059] Updated weights for policy 1, policy_version 21822 (0.0009) [2023-10-08 00:44:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 44400640. Throughput: 0: 1721.9, 1: 1722.9. Samples: 11108878. Policy #0 lag: (min: 15.0, avg: 22.8, max: 47.0) [2023-10-08 00:44:56,211][50642] Avg episode reward: [(0, '15.100'), (1, '18.370')] [2023-10-08 00:44:56,521][52060] Updated weights for policy 0, policy_version 21540 (0.0008) [2023-10-08 00:44:56,887][52060] Updated weights for policy 0, policy_version 21550 (0.0007) [2023-10-08 00:44:57,263][52060] Updated weights for policy 0, policy_version 21560 (0.0007) [2023-10-08 00:44:58,497][52059] Updated weights for policy 1, policy_version 21832 (0.0007) [2023-10-08 00:44:58,870][52059] Updated weights for policy 1, policy_version 21842 (0.0009) [2023-10-08 00:44:59,240][52059] Updated weights for policy 1, policy_version 21852 (0.0007) [2023-10-08 00:45:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 44466176. Throughput: 0: 1717.1, 1: 1730.7. Samples: 11130240. Policy #0 lag: (min: 15.0, avg: 22.8, max: 47.0) [2023-10-08 00:45:01,211][50642] Avg episode reward: [(0, '16.000'), (1, '17.130')] [2023-10-08 00:45:01,245][52060] Updated weights for policy 0, policy_version 21570 (0.0007) [2023-10-08 00:45:01,609][52060] Updated weights for policy 0, policy_version 21580 (0.0008) [2023-10-08 00:45:01,982][52060] Updated weights for policy 0, policy_version 21590 (0.0009) [2023-10-08 00:45:02,349][52060] Updated weights for policy 0, policy_version 21600 (0.0010) [2023-10-08 00:45:03,180][52059] Updated weights for policy 1, policy_version 21862 (0.0009) [2023-10-08 00:45:03,543][52059] Updated weights for policy 1, policy_version 21872 (0.0009) [2023-10-08 00:45:03,911][52059] Updated weights for policy 1, policy_version 21882 (0.0007) [2023-10-08 00:45:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 44531712. Throughput: 0: 1710.7, 1: 1735.6. Samples: 11139956. Policy #0 lag: (min: 15.0, avg: 22.8, max: 47.0) [2023-10-08 00:45:06,211][50642] Avg episode reward: [(0, '16.250'), (1, '20.710')] [2023-10-08 00:45:06,321][52060] Updated weights for policy 0, policy_version 21610 (0.0007) [2023-10-08 00:45:06,696][52060] Updated weights for policy 0, policy_version 21620 (0.0008) [2023-10-08 00:45:07,059][52060] Updated weights for policy 0, policy_version 21630 (0.0009) [2023-10-08 00:45:07,928][52059] Updated weights for policy 1, policy_version 21892 (0.0008) [2023-10-08 00:45:08,296][52059] Updated weights for policy 1, policy_version 21902 (0.0009) [2023-10-08 00:45:08,663][52059] Updated weights for policy 1, policy_version 21912 (0.0007) [2023-10-08 00:45:11,065][52060] Updated weights for policy 0, policy_version 21640 (0.0008) [2023-10-08 00:45:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 44597248. Throughput: 0: 1710.1, 1: 1720.7. Samples: 11160476. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:45:11,211][50642] Avg episode reward: [(0, '14.290'), (1, '17.230')] [2023-10-08 00:45:11,436][52060] Updated weights for policy 0, policy_version 21650 (0.0009) [2023-10-08 00:45:11,811][52060] Updated weights for policy 0, policy_version 21660 (0.0009) [2023-10-08 00:45:12,753][52059] Updated weights for policy 1, policy_version 21922 (0.0008) [2023-10-08 00:45:13,170][52059] Updated weights for policy 1, policy_version 21932 (0.0009) [2023-10-08 00:45:13,546][52059] Updated weights for policy 1, policy_version 21942 (0.0010) [2023-10-08 00:45:13,898][52059] Updated weights for policy 1, policy_version 21952 (0.0008) [2023-10-08 00:45:15,647][52060] Updated weights for policy 0, policy_version 21670 (0.0008) [2023-10-08 00:45:16,023][52060] Updated weights for policy 0, policy_version 21680 (0.0010) [2023-10-08 00:45:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 44662784. Throughput: 0: 1706.0, 1: 1734.8. Samples: 11181250. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:45:16,211][50642] Avg episode reward: [(0, '17.710'), (1, '16.450')] [2023-10-08 00:45:16,392][52060] Updated weights for policy 0, policy_version 21690 (0.0007) [2023-10-08 00:45:17,830][52059] Updated weights for policy 1, policy_version 21962 (0.0008) [2023-10-08 00:45:18,190][52059] Updated weights for policy 1, policy_version 21972 (0.0009) [2023-10-08 00:45:18,565][52059] Updated weights for policy 1, policy_version 21982 (0.0007) [2023-10-08 00:45:20,484][52060] Updated weights for policy 0, policy_version 21700 (0.0008) [2023-10-08 00:45:20,846][52060] Updated weights for policy 0, policy_version 21710 (0.0008) [2023-10-08 00:45:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 44728320. Throughput: 0: 1713.2, 1: 1714.7. Samples: 11191114. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:45:21,211][50642] Avg episode reward: [(0, '15.520'), (1, '21.290')] [2023-10-08 00:45:21,211][52060] Updated weights for policy 0, policy_version 21720 (0.0010) [2023-10-08 00:45:21,212][51710] Saving new best policy, reward=21.290! [2023-10-08 00:45:22,634][52059] Updated weights for policy 1, policy_version 21992 (0.0007) [2023-10-08 00:45:22,997][52059] Updated weights for policy 1, policy_version 22002 (0.0008) [2023-10-08 00:45:23,357][52059] Updated weights for policy 1, policy_version 22012 (0.0009) [2023-10-08 00:45:25,213][52060] Updated weights for policy 0, policy_version 21730 (0.0008) [2023-10-08 00:45:25,628][52060] Updated weights for policy 0, policy_version 21740 (0.0008) [2023-10-08 00:45:25,992][52060] Updated weights for policy 0, policy_version 21750 (0.0008) [2023-10-08 00:45:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 44793856. Throughput: 0: 1710.3, 1: 1722.2. Samples: 11212344. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:45:26,211][50642] Avg episode reward: [(0, '15.870'), (1, '19.680')] [2023-10-08 00:45:26,353][52060] Updated weights for policy 0, policy_version 21760 (0.0007) [2023-10-08 00:45:27,261][52059] Updated weights for policy 1, policy_version 22022 (0.0008) [2023-10-08 00:45:27,622][52059] Updated weights for policy 1, policy_version 22032 (0.0008) [2023-10-08 00:45:27,974][52059] Updated weights for policy 1, policy_version 22042 (0.0008) [2023-10-08 00:45:30,285][52060] Updated weights for policy 0, policy_version 21770 (0.0009) [2023-10-08 00:45:30,648][52060] Updated weights for policy 0, policy_version 21780 (0.0010) [2023-10-08 00:45:31,022][52060] Updated weights for policy 0, policy_version 21790 (0.0010) [2023-10-08 00:45:31,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 44892160. Throughput: 0: 1686.7, 1: 1745.9. Samples: 11232676. Policy #0 lag: (min: 31.0, avg: 41.7, max: 63.0) [2023-10-08 00:45:31,211][50642] Avg episode reward: [(0, '17.200'), (1, '17.010')] [2023-10-08 00:45:31,892][52059] Updated weights for policy 1, policy_version 22052 (0.0008) [2023-10-08 00:45:32,261][52059] Updated weights for policy 1, policy_version 22062 (0.0007) [2023-10-08 00:45:32,622][52059] Updated weights for policy 1, policy_version 22072 (0.0007) [2023-10-08 00:45:34,985][52060] Updated weights for policy 0, policy_version 21800 (0.0008) [2023-10-08 00:45:35,354][52060] Updated weights for policy 0, policy_version 21810 (0.0008) [2023-10-08 00:45:35,723][52060] Updated weights for policy 0, policy_version 21820 (0.0008) [2023-10-08 00:45:36,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 44957696. Throughput: 0: 1712.8, 1: 1712.4. Samples: 11243000. Policy #0 lag: (min: 31.0, avg: 41.7, max: 63.0) [2023-10-08 00:45:36,211][50642] Avg episode reward: [(0, '14.320'), (1, '19.300')] [2023-10-08 00:45:36,360][52059] Updated weights for policy 1, policy_version 22082 (0.0008) [2023-10-08 00:45:36,724][52059] Updated weights for policy 1, policy_version 22092 (0.0009) [2023-10-08 00:45:37,101][52059] Updated weights for policy 1, policy_version 22102 (0.0010) [2023-10-08 00:45:37,460][52059] Updated weights for policy 1, policy_version 22112 (0.0009) [2023-10-08 00:45:39,579][52060] Updated weights for policy 0, policy_version 21830 (0.0009) [2023-10-08 00:45:39,947][52060] Updated weights for policy 0, policy_version 21840 (0.0009) [2023-10-08 00:45:40,315][52060] Updated weights for policy 0, policy_version 21850 (0.0008) [2023-10-08 00:45:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 45023232. Throughput: 0: 1706.3, 1: 1735.6. Samples: 11263760. Policy #0 lag: (min: 31.0, avg: 41.7, max: 63.0) [2023-10-08 00:45:41,211][50642] Avg episode reward: [(0, '16.560'), (1, '18.650')] [2023-10-08 00:45:41,539][52059] Updated weights for policy 1, policy_version 22122 (0.0010) [2023-10-08 00:45:41,899][52059] Updated weights for policy 1, policy_version 22132 (0.0009) [2023-10-08 00:45:42,264][52059] Updated weights for policy 1, policy_version 22142 (0.0011) [2023-10-08 00:45:44,530][52060] Updated weights for policy 0, policy_version 21860 (0.0010) [2023-10-08 00:45:44,891][52060] Updated weights for policy 0, policy_version 21870 (0.0009) [2023-10-08 00:45:45,266][52060] Updated weights for policy 0, policy_version 21880 (0.0008) [2023-10-08 00:45:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 45088768. Throughput: 0: 1678.5, 1: 1729.3. Samples: 11283590. Policy #0 lag: (min: 1.0, avg: 11.2, max: 33.0) [2023-10-08 00:45:46,211][50642] Avg episode reward: [(0, '15.260'), (1, '16.490')] [2023-10-08 00:45:46,313][52059] Updated weights for policy 1, policy_version 22152 (0.0009) [2023-10-08 00:45:46,686][52059] Updated weights for policy 1, policy_version 22162 (0.0010) [2023-10-08 00:45:47,055][52059] Updated weights for policy 1, policy_version 22172 (0.0007) [2023-10-08 00:45:49,074][52060] Updated weights for policy 0, policy_version 21890 (0.0010) [2023-10-08 00:45:49,447][52060] Updated weights for policy 0, policy_version 21900 (0.0008) [2023-10-08 00:45:49,820][52060] Updated weights for policy 0, policy_version 21910 (0.0008) [2023-10-08 00:45:50,181][52060] Updated weights for policy 0, policy_version 21920 (0.0009) [2023-10-08 00:45:50,989][52059] Updated weights for policy 1, policy_version 22182 (0.0008) [2023-10-08 00:45:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 45154304. Throughput: 0: 1711.8, 1: 1720.4. Samples: 11294406. Policy #0 lag: (min: 1.0, avg: 11.2, max: 33.0) [2023-10-08 00:45:51,211][50642] Avg episode reward: [(0, '14.390'), (1, '17.210')] [2023-10-08 00:45:51,352][52059] Updated weights for policy 1, policy_version 22192 (0.0010) [2023-10-08 00:45:51,721][52059] Updated weights for policy 1, policy_version 22202 (0.0010) [2023-10-08 00:45:54,248][52060] Updated weights for policy 0, policy_version 21930 (0.0008) [2023-10-08 00:45:54,611][52060] Updated weights for policy 0, policy_version 21940 (0.0008) [2023-10-08 00:45:54,988][52060] Updated weights for policy 0, policy_version 21950 (0.0008) [2023-10-08 00:45:55,719][52059] Updated weights for policy 1, policy_version 22212 (0.0010) [2023-10-08 00:45:56,082][52059] Updated weights for policy 1, policy_version 22222 (0.0010) [2023-10-08 00:45:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 45219840. Throughput: 0: 1692.9, 1: 1731.8. Samples: 11314588. Policy #0 lag: (min: 1.0, avg: 11.2, max: 33.0) [2023-10-08 00:45:56,211][50642] Avg episode reward: [(0, '17.990'), (1, '20.160')] [2023-10-08 00:45:56,439][52059] Updated weights for policy 1, policy_version 22232 (0.0009) [2023-10-08 00:45:59,074][52060] Updated weights for policy 0, policy_version 21960 (0.0008) [2023-10-08 00:45:59,437][52060] Updated weights for policy 0, policy_version 21970 (0.0009) [2023-10-08 00:45:59,806][52060] Updated weights for policy 0, policy_version 21980 (0.0008) [2023-10-08 00:46:00,708][52059] Updated weights for policy 1, policy_version 22242 (0.0010) [2023-10-08 00:46:01,116][52059] Updated weights for policy 1, policy_version 22252 (0.0008) [2023-10-08 00:46:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 45285376. Throughput: 0: 1691.4, 1: 1727.7. Samples: 11335110. Policy #0 lag: (min: 1.0, avg: 11.2, max: 33.0) [2023-10-08 00:46:01,211][50642] Avg episode reward: [(0, '14.590'), (1, '16.940')] [2023-10-08 00:46:01,476][52059] Updated weights for policy 1, policy_version 22262 (0.0007) [2023-10-08 00:46:01,848][52059] Updated weights for policy 1, policy_version 22272 (0.0009) [2023-10-08 00:46:03,729][52060] Updated weights for policy 0, policy_version 21990 (0.0008) [2023-10-08 00:46:04,105][52060] Updated weights for policy 0, policy_version 22000 (0.0009) [2023-10-08 00:46:04,467][52060] Updated weights for policy 0, policy_version 22010 (0.0010) [2023-10-08 00:46:05,657][52059] Updated weights for policy 1, policy_version 22282 (0.0008) [2023-10-08 00:46:06,019][52059] Updated weights for policy 1, policy_version 22292 (0.0007) [2023-10-08 00:46:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 45350912. Throughput: 0: 1705.6, 1: 1734.2. Samples: 11345906. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:46:06,211][50642] Avg episode reward: [(0, '16.330'), (1, '17.700')] [2023-10-08 00:46:06,389][52059] Updated weights for policy 1, policy_version 22302 (0.0008) [2023-10-08 00:46:08,461][52060] Updated weights for policy 0, policy_version 22020 (0.0010) [2023-10-08 00:46:08,836][52060] Updated weights for policy 0, policy_version 22030 (0.0008) [2023-10-08 00:46:09,197][52060] Updated weights for policy 0, policy_version 22040 (0.0010) [2023-10-08 00:46:10,302][52059] Updated weights for policy 1, policy_version 22312 (0.0008) [2023-10-08 00:46:10,670][52059] Updated weights for policy 1, policy_version 22322 (0.0010) [2023-10-08 00:46:11,035][52059] Updated weights for policy 1, policy_version 22332 (0.0010) [2023-10-08 00:46:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 45449216. Throughput: 0: 1684.6, 1: 1732.6. Samples: 11366120. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:46:11,211][50642] Avg episode reward: [(0, '17.680'), (1, '21.950')] [2023-10-08 00:46:11,213][51710] Saving new best policy, reward=21.950! [2023-10-08 00:46:13,307][52060] Updated weights for policy 0, policy_version 22050 (0.0010) [2023-10-08 00:46:13,709][52060] Updated weights for policy 0, policy_version 22060 (0.0008) [2023-10-08 00:46:14,082][52060] Updated weights for policy 0, policy_version 22070 (0.0009) [2023-10-08 00:46:14,450][52060] Updated weights for policy 0, policy_version 22080 (0.0010) [2023-10-08 00:46:14,872][52059] Updated weights for policy 1, policy_version 22342 (0.0008) [2023-10-08 00:46:15,249][52059] Updated weights for policy 1, policy_version 22352 (0.0007) [2023-10-08 00:46:15,607][52059] Updated weights for policy 1, policy_version 22362 (0.0009) [2023-10-08 00:46:16,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 45514752. Throughput: 0: 1714.0, 1: 1701.2. Samples: 11386356. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:46:16,211][50642] Avg episode reward: [(0, '14.430'), (1, '17.550')] [2023-10-08 00:46:18,259][52060] Updated weights for policy 0, policy_version 22090 (0.0009) [2023-10-08 00:46:18,631][52060] Updated weights for policy 0, policy_version 22100 (0.0010) [2023-10-08 00:46:19,005][52060] Updated weights for policy 0, policy_version 22110 (0.0007) [2023-10-08 00:46:19,563][52059] Updated weights for policy 1, policy_version 22372 (0.0010) [2023-10-08 00:46:19,931][52059] Updated weights for policy 1, policy_version 22382 (0.0009) [2023-10-08 00:46:20,291][52059] Updated weights for policy 1, policy_version 22392 (0.0009) [2023-10-08 00:46:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 45580288. Throughput: 0: 1699.8, 1: 1729.3. Samples: 11397310. Policy #0 lag: (min: 31.0, avg: 31.1, max: 37.0) [2023-10-08 00:46:21,211][50642] Avg episode reward: [(0, '17.990'), (1, '16.120')] [2023-10-08 00:46:22,919][52060] Updated weights for policy 0, policy_version 22120 (0.0009) [2023-10-08 00:46:23,290][52060] Updated weights for policy 0, policy_version 22130 (0.0010) [2023-10-08 00:46:23,656][52060] Updated weights for policy 0, policy_version 22140 (0.0008) [2023-10-08 00:46:24,388][52059] Updated weights for policy 1, policy_version 22402 (0.0007) [2023-10-08 00:46:24,758][52059] Updated weights for policy 1, policy_version 22412 (0.0010) [2023-10-08 00:46:25,116][52059] Updated weights for policy 1, policy_version 22422 (0.0008) [2023-10-08 00:46:25,480][52059] Updated weights for policy 1, policy_version 22432 (0.0009) [2023-10-08 00:46:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 45645824. Throughput: 0: 1700.9, 1: 1713.7. Samples: 11417416. Policy #0 lag: (min: 2.0, avg: 6.3, max: 34.0) [2023-10-08 00:46:26,211][50642] Avg episode reward: [(0, '14.460'), (1, '20.350')] [2023-10-08 00:46:27,591][52060] Updated weights for policy 0, policy_version 22150 (0.0008) [2023-10-08 00:46:27,967][52060] Updated weights for policy 0, policy_version 22160 (0.0008) [2023-10-08 00:46:28,335][52060] Updated weights for policy 0, policy_version 22170 (0.0008) [2023-10-08 00:46:29,325][52059] Updated weights for policy 1, policy_version 22442 (0.0009) [2023-10-08 00:46:29,684][52059] Updated weights for policy 1, policy_version 22452 (0.0008) [2023-10-08 00:46:30,051][52059] Updated weights for policy 1, policy_version 22462 (0.0007) [2023-10-08 00:46:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 45711360. Throughput: 0: 1737.3, 1: 1702.7. Samples: 11438390. Policy #0 lag: (min: 2.0, avg: 6.3, max: 34.0) [2023-10-08 00:46:31,211][50642] Avg episode reward: [(0, '15.240'), (1, '17.000')] [2023-10-08 00:46:32,200][52060] Updated weights for policy 0, policy_version 22180 (0.0008) [2023-10-08 00:46:32,577][52060] Updated weights for policy 0, policy_version 22190 (0.0007) [2023-10-08 00:46:32,958][52060] Updated weights for policy 0, policy_version 22200 (0.0008) [2023-10-08 00:46:33,942][52059] Updated weights for policy 1, policy_version 22472 (0.0008) [2023-10-08 00:46:34,322][52059] Updated weights for policy 1, policy_version 22482 (0.0010) [2023-10-08 00:46:34,688][52059] Updated weights for policy 1, policy_version 22492 (0.0008) [2023-10-08 00:46:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 45776896. Throughput: 0: 1703.9, 1: 1729.0. Samples: 11448884. Policy #0 lag: (min: 2.0, avg: 6.3, max: 34.0) [2023-10-08 00:46:36,211][50642] Avg episode reward: [(0, '18.340'), (1, '17.280')] [2023-10-08 00:46:36,211][51605] Saving new best policy, reward=18.340! [2023-10-08 00:46:36,927][52060] Updated weights for policy 0, policy_version 22210 (0.0007) [2023-10-08 00:46:37,291][52060] Updated weights for policy 0, policy_version 22220 (0.0007) [2023-10-08 00:46:37,673][52060] Updated weights for policy 0, policy_version 22230 (0.0009) [2023-10-08 00:46:38,033][52060] Updated weights for policy 0, policy_version 22240 (0.0008) [2023-10-08 00:46:38,500][52059] Updated weights for policy 1, policy_version 22502 (0.0009) [2023-10-08 00:46:38,858][52059] Updated weights for policy 1, policy_version 22512 (0.0009) [2023-10-08 00:46:39,222][52059] Updated weights for policy 1, policy_version 22522 (0.0011) [2023-10-08 00:46:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 45842432. Throughput: 0: 1725.3, 1: 1708.5. Samples: 11469108. Policy #0 lag: (min: 2.0, avg: 6.3, max: 34.0) [2023-10-08 00:46:41,211][50642] Avg episode reward: [(0, '14.500'), (1, '17.560')] [2023-10-08 00:46:41,896][52060] Updated weights for policy 0, policy_version 22250 (0.0010) [2023-10-08 00:46:42,277][52060] Updated weights for policy 0, policy_version 22260 (0.0009) [2023-10-08 00:46:42,638][52060] Updated weights for policy 0, policy_version 22270 (0.0010) [2023-10-08 00:46:43,263][52059] Updated weights for policy 1, policy_version 22532 (0.0011) [2023-10-08 00:46:43,633][52059] Updated weights for policy 1, policy_version 22542 (0.0008) [2023-10-08 00:46:44,003][52059] Updated weights for policy 1, policy_version 22552 (0.0007) [2023-10-08 00:46:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 45907968. Throughput: 0: 1738.9, 1: 1717.4. Samples: 11490642. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:46:46,211][50642] Avg episode reward: [(0, '16.610'), (1, '15.560')] [2023-10-08 00:46:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000022560_23101440.pth... [2023-10-08 00:46:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000020960_21463040.pth [2023-10-08 00:46:46,488][52060] Updated weights for policy 0, policy_version 22280 (0.0007) [2023-10-08 00:46:46,856][52060] Updated weights for policy 0, policy_version 22290 (0.0011) [2023-10-08 00:46:47,223][52060] Updated weights for policy 0, policy_version 22300 (0.0007) [2023-10-08 00:46:47,370][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000022304_22839296.pth... [2023-10-08 00:46:47,398][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000020672_21168128.pth [2023-10-08 00:46:47,763][52059] Updated weights for policy 1, policy_version 22562 (0.0008) [2023-10-08 00:46:48,128][52059] Updated weights for policy 1, policy_version 22572 (0.0009) [2023-10-08 00:46:48,496][52059] Updated weights for policy 1, policy_version 22582 (0.0009) [2023-10-08 00:46:48,849][52059] Updated weights for policy 1, policy_version 22592 (0.0010) [2023-10-08 00:46:51,054][52060] Updated weights for policy 0, policy_version 22310 (0.0008) [2023-10-08 00:46:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 45973504. Throughput: 0: 1715.4, 1: 1711.7. Samples: 11500126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:46:51,211][50642] Avg episode reward: [(0, '16.370'), (1, '15.760')] [2023-10-08 00:46:51,412][52060] Updated weights for policy 0, policy_version 22320 (0.0008) [2023-10-08 00:46:51,777][52060] Updated weights for policy 0, policy_version 22330 (0.0010) [2023-10-08 00:46:52,815][52059] Updated weights for policy 1, policy_version 22602 (0.0009) [2023-10-08 00:46:53,174][52059] Updated weights for policy 1, policy_version 22612 (0.0009) [2023-10-08 00:46:53,542][52059] Updated weights for policy 1, policy_version 22622 (0.0007) [2023-10-08 00:46:55,862][52060] Updated weights for policy 0, policy_version 22340 (0.0010) [2023-10-08 00:46:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 46039040. Throughput: 0: 1741.9, 1: 1713.5. Samples: 11521612. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:46:56,211][50642] Avg episode reward: [(0, '15.010'), (1, '16.630')] [2023-10-08 00:46:56,228][52060] Updated weights for policy 0, policy_version 22350 (0.0008) [2023-10-08 00:46:56,615][52060] Updated weights for policy 0, policy_version 22360 (0.0008) [2023-10-08 00:46:57,376][52059] Updated weights for policy 1, policy_version 22632 (0.0009) [2023-10-08 00:46:57,737][52059] Updated weights for policy 1, policy_version 22642 (0.0008) [2023-10-08 00:46:58,103][52059] Updated weights for policy 1, policy_version 22652 (0.0008) [2023-10-08 00:47:00,735][52060] Updated weights for policy 0, policy_version 22370 (0.0011) [2023-10-08 00:47:01,124][52060] Updated weights for policy 0, policy_version 22380 (0.0011) [2023-10-08 00:47:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 46104576. Throughput: 0: 1732.3, 1: 1748.0. Samples: 11542970. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:47:01,211][50642] Avg episode reward: [(0, '16.420'), (1, '16.460')] [2023-10-08 00:47:01,499][52060] Updated weights for policy 0, policy_version 22390 (0.0010) [2023-10-08 00:47:01,872][52060] Updated weights for policy 0, policy_version 22400 (0.0008) [2023-10-08 00:47:01,875][52059] Updated weights for policy 1, policy_version 22662 (0.0008) [2023-10-08 00:47:02,248][52059] Updated weights for policy 1, policy_version 22672 (0.0009) [2023-10-08 00:47:02,621][52059] Updated weights for policy 1, policy_version 22682 (0.0010) [2023-10-08 00:47:05,918][52060] Updated weights for policy 0, policy_version 22410 (0.0009) [2023-10-08 00:47:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 46170112. Throughput: 0: 1728.7, 1: 1720.9. Samples: 11552540. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 00:47:06,211][50642] Avg episode reward: [(0, '14.640'), (1, '18.850')] [2023-10-08 00:47:06,295][52060] Updated weights for policy 0, policy_version 22420 (0.0011) [2023-10-08 00:47:06,611][52059] Updated weights for policy 1, policy_version 22692 (0.0010) [2023-10-08 00:47:06,667][52060] Updated weights for policy 0, policy_version 22430 (0.0008) [2023-10-08 00:47:06,972][52059] Updated weights for policy 1, policy_version 22702 (0.0008) [2023-10-08 00:47:07,344][52059] Updated weights for policy 1, policy_version 22712 (0.0007) [2023-10-08 00:47:10,571][52060] Updated weights for policy 0, policy_version 22440 (0.0008) [2023-10-08 00:47:10,945][52060] Updated weights for policy 0, policy_version 22450 (0.0007) [2023-10-08 00:47:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 46235648. Throughput: 0: 1734.8, 1: 1743.3. Samples: 11573934. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 00:47:11,211][50642] Avg episode reward: [(0, '15.390'), (1, '20.090')] [2023-10-08 00:47:11,295][52059] Updated weights for policy 1, policy_version 22722 (0.0009) [2023-10-08 00:47:11,316][52060] Updated weights for policy 0, policy_version 22460 (0.0007) [2023-10-08 00:47:11,663][52059] Updated weights for policy 1, policy_version 22732 (0.0008) [2023-10-08 00:47:12,032][52059] Updated weights for policy 1, policy_version 22742 (0.0009) [2023-10-08 00:47:12,397][52059] Updated weights for policy 1, policy_version 22752 (0.0009) [2023-10-08 00:47:15,308][52060] Updated weights for policy 0, policy_version 22470 (0.0009) [2023-10-08 00:47:15,684][52060] Updated weights for policy 0, policy_version 22480 (0.0008) [2023-10-08 00:47:16,050][52060] Updated weights for policy 0, policy_version 22490 (0.0008) [2023-10-08 00:47:16,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 46301184. Throughput: 0: 1709.6, 1: 1760.9. Samples: 11594564. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 00:47:16,211][50642] Avg episode reward: [(0, '17.220'), (1, '16.550')] [2023-10-08 00:47:16,243][52059] Updated weights for policy 1, policy_version 22762 (0.0009) [2023-10-08 00:47:16,604][52059] Updated weights for policy 1, policy_version 22772 (0.0007) [2023-10-08 00:47:16,970][52059] Updated weights for policy 1, policy_version 22782 (0.0007) [2023-10-08 00:47:19,989][52060] Updated weights for policy 0, policy_version 22500 (0.0007) [2023-10-08 00:47:20,357][52060] Updated weights for policy 0, policy_version 22510 (0.0009) [2023-10-08 00:47:20,734][52060] Updated weights for policy 0, policy_version 22520 (0.0008) [2023-10-08 00:47:20,849][52059] Updated weights for policy 1, policy_version 22792 (0.0007) [2023-10-08 00:47:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 46399488. Throughput: 0: 1729.2, 1: 1736.0. Samples: 11604818. Policy #0 lag: (min: 9.0, avg: 22.9, max: 41.0) [2023-10-08 00:47:21,211][50642] Avg episode reward: [(0, '15.480'), (1, '18.440')] [2023-10-08 00:47:21,229][52059] Updated weights for policy 1, policy_version 22802 (0.0010) [2023-10-08 00:47:21,582][52059] Updated weights for policy 1, policy_version 22812 (0.0010) [2023-10-08 00:47:24,599][52060] Updated weights for policy 0, policy_version 22530 (0.0008) [2023-10-08 00:47:24,973][52060] Updated weights for policy 0, policy_version 22540 (0.0007) [2023-10-08 00:47:25,335][52060] Updated weights for policy 0, policy_version 22550 (0.0008) [2023-10-08 00:47:25,573][52059] Updated weights for policy 1, policy_version 22822 (0.0009) [2023-10-08 00:47:25,701][52060] Updated weights for policy 0, policy_version 22560 (0.0009) [2023-10-08 00:47:25,935][52059] Updated weights for policy 1, policy_version 22832 (0.0009) [2023-10-08 00:47:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 46465024. Throughput: 0: 1726.0, 1: 1758.8. Samples: 11625924. Policy #0 lag: (min: 9.0, avg: 22.9, max: 41.0) [2023-10-08 00:47:26,211][50642] Avg episode reward: [(0, '16.810'), (1, '21.260')] [2023-10-08 00:47:26,306][52059] Updated weights for policy 1, policy_version 22842 (0.0009) [2023-10-08 00:47:29,437][52060] Updated weights for policy 0, policy_version 22570 (0.0007) [2023-10-08 00:47:29,822][52060] Updated weights for policy 0, policy_version 22580 (0.0008) [2023-10-08 00:47:30,191][52060] Updated weights for policy 0, policy_version 22590 (0.0008) [2023-10-08 00:47:30,271][52059] Updated weights for policy 1, policy_version 22852 (0.0010) [2023-10-08 00:47:30,646][52059] Updated weights for policy 1, policy_version 22862 (0.0010) [2023-10-08 00:47:31,007][52059] Updated weights for policy 1, policy_version 22872 (0.0009) [2023-10-08 00:47:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 46530560. Throughput: 0: 1698.0, 1: 1741.3. Samples: 11645408. Policy #0 lag: (min: 9.0, avg: 22.9, max: 41.0) [2023-10-08 00:47:31,211][50642] Avg episode reward: [(0, '15.990'), (1, '18.450')] [2023-10-08 00:47:34,004][52060] Updated weights for policy 0, policy_version 22600 (0.0008) [2023-10-08 00:47:34,366][52060] Updated weights for policy 0, policy_version 22610 (0.0009) [2023-10-08 00:47:34,735][52060] Updated weights for policy 0, policy_version 22620 (0.0008) [2023-10-08 00:47:34,988][52059] Updated weights for policy 1, policy_version 22882 (0.0010) [2023-10-08 00:47:35,382][52059] Updated weights for policy 1, policy_version 22892 (0.0008) [2023-10-08 00:47:35,749][52059] Updated weights for policy 1, policy_version 22902 (0.0010) [2023-10-08 00:47:36,113][52059] Updated weights for policy 1, policy_version 22912 (0.0008) [2023-10-08 00:47:36,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 46628864. Throughput: 0: 1729.9, 1: 1755.6. Samples: 11656976. Policy #0 lag: (min: 9.0, avg: 22.9, max: 41.0) [2023-10-08 00:47:36,211][50642] Avg episode reward: [(0, '16.040'), (1, '18.260')] [2023-10-08 00:47:38,853][52060] Updated weights for policy 0, policy_version 22630 (0.0008) [2023-10-08 00:47:39,229][52060] Updated weights for policy 0, policy_version 22640 (0.0008) [2023-10-08 00:47:39,607][52060] Updated weights for policy 0, policy_version 22650 (0.0007) [2023-10-08 00:47:39,974][52059] Updated weights for policy 1, policy_version 22922 (0.0008) [2023-10-08 00:47:40,345][52059] Updated weights for policy 1, policy_version 22932 (0.0008) [2023-10-08 00:47:40,714][52059] Updated weights for policy 1, policy_version 22942 (0.0010) [2023-10-08 00:47:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 46694400. Throughput: 0: 1703.7, 1: 1748.1. Samples: 11676946. Policy #0 lag: (min: 7.0, avg: 10.2, max: 39.0) [2023-10-08 00:47:41,211][50642] Avg episode reward: [(0, '17.110'), (1, '19.590')] [2023-10-08 00:47:43,866][52060] Updated weights for policy 0, policy_version 22660 (0.0008) [2023-10-08 00:47:44,234][52060] Updated weights for policy 0, policy_version 22670 (0.0007) [2023-10-08 00:47:44,509][52059] Updated weights for policy 1, policy_version 22952 (0.0009) [2023-10-08 00:47:44,609][52060] Updated weights for policy 0, policy_version 22680 (0.0009) [2023-10-08 00:47:44,861][52059] Updated weights for policy 1, policy_version 22962 (0.0009) [2023-10-08 00:47:45,225][52059] Updated weights for policy 1, policy_version 22972 (0.0009) [2023-10-08 00:47:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 46759936. Throughput: 0: 1703.7, 1: 1720.6. Samples: 11697064. Policy #0 lag: (min: 7.0, avg: 10.2, max: 39.0) [2023-10-08 00:47:46,211][50642] Avg episode reward: [(0, '15.620'), (1, '17.510')] [2023-10-08 00:47:48,684][52060] Updated weights for policy 0, policy_version 22690 (0.0007) [2023-10-08 00:47:49,080][52059] Updated weights for policy 1, policy_version 22982 (0.0007) [2023-10-08 00:47:49,086][52060] Updated weights for policy 0, policy_version 22700 (0.0007) [2023-10-08 00:47:49,434][52059] Updated weights for policy 1, policy_version 22992 (0.0007) [2023-10-08 00:47:49,459][52060] Updated weights for policy 0, policy_version 22710 (0.0008) [2023-10-08 00:47:49,802][52059] Updated weights for policy 1, policy_version 23002 (0.0011) [2023-10-08 00:47:49,824][52060] Updated weights for policy 0, policy_version 22720 (0.0009) [2023-10-08 00:47:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 46825472. Throughput: 0: 1719.1, 1: 1754.0. Samples: 11708830. Policy #0 lag: (min: 7.0, avg: 10.2, max: 39.0) [2023-10-08 00:47:51,211][50642] Avg episode reward: [(0, '16.930'), (1, '16.630')] [2023-10-08 00:47:53,767][52059] Updated weights for policy 1, policy_version 23012 (0.0011) [2023-10-08 00:47:53,817][52060] Updated weights for policy 0, policy_version 22730 (0.0010) [2023-10-08 00:47:54,129][52059] Updated weights for policy 1, policy_version 23022 (0.0009) [2023-10-08 00:47:54,185][52060] Updated weights for policy 0, policy_version 22740 (0.0008) [2023-10-08 00:47:54,491][52059] Updated weights for policy 1, policy_version 23032 (0.0008) [2023-10-08 00:47:54,558][52060] Updated weights for policy 0, policy_version 22750 (0.0009) [2023-10-08 00:47:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 46891008. Throughput: 0: 1692.9, 1: 1720.3. Samples: 11727530. Policy #0 lag: (min: 7.0, avg: 10.2, max: 39.0) [2023-10-08 00:47:56,211][50642] Avg episode reward: [(0, '16.550'), (1, '17.820')] [2023-10-08 00:47:58,570][52059] Updated weights for policy 1, policy_version 23042 (0.0008) [2023-10-08 00:47:58,618][52060] Updated weights for policy 0, policy_version 22760 (0.0007) [2023-10-08 00:47:58,940][52059] Updated weights for policy 1, policy_version 23052 (0.0009) [2023-10-08 00:47:58,989][52060] Updated weights for policy 0, policy_version 22770 (0.0008) [2023-10-08 00:47:59,308][52059] Updated weights for policy 1, policy_version 23062 (0.0007) [2023-10-08 00:47:59,357][52060] Updated weights for policy 0, policy_version 22780 (0.0009) [2023-10-08 00:47:59,680][52059] Updated weights for policy 1, policy_version 23072 (0.0009) [2023-10-08 00:48:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 46956544. Throughput: 0: 1708.1, 1: 1715.7. Samples: 11748636. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 00:48:01,211][50642] Avg episode reward: [(0, '15.670'), (1, '17.750')] [2023-10-08 00:48:03,344][52060] Updated weights for policy 0, policy_version 22790 (0.0009) [2023-10-08 00:48:03,563][52059] Updated weights for policy 1, policy_version 23082 (0.0007) [2023-10-08 00:48:03,702][52060] Updated weights for policy 0, policy_version 22800 (0.0007) [2023-10-08 00:48:03,928][52059] Updated weights for policy 1, policy_version 23092 (0.0010) [2023-10-08 00:48:04,067][52060] Updated weights for policy 0, policy_version 22810 (0.0007) [2023-10-08 00:48:04,285][52059] Updated weights for policy 1, policy_version 23102 (0.0008) [2023-10-08 00:48:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 47022080. Throughput: 0: 1701.1, 1: 1732.8. Samples: 11759342. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 00:48:06,211][50642] Avg episode reward: [(0, '16.930'), (1, '18.360')] [2023-10-08 00:48:07,919][52060] Updated weights for policy 0, policy_version 22820 (0.0009) [2023-10-08 00:48:08,134][52059] Updated weights for policy 1, policy_version 23112 (0.0008) [2023-10-08 00:48:08,279][52060] Updated weights for policy 0, policy_version 22830 (0.0008) [2023-10-08 00:48:08,495][52059] Updated weights for policy 1, policy_version 23122 (0.0008) [2023-10-08 00:48:08,642][52060] Updated weights for policy 0, policy_version 22840 (0.0009) [2023-10-08 00:48:08,859][52059] Updated weights for policy 1, policy_version 23132 (0.0007) [2023-10-08 00:48:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 47087616. Throughput: 0: 1699.9, 1: 1719.3. Samples: 11779786. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 00:48:11,211][50642] Avg episode reward: [(0, '15.400'), (1, '20.340')] [2023-10-08 00:48:12,584][52060] Updated weights for policy 0, policy_version 22850 (0.0009) [2023-10-08 00:48:12,948][52059] Updated weights for policy 1, policy_version 23142 (0.0010) [2023-10-08 00:48:12,949][52060] Updated weights for policy 0, policy_version 22860 (0.0007) [2023-10-08 00:48:13,306][52059] Updated weights for policy 1, policy_version 23152 (0.0007) [2023-10-08 00:48:13,319][52060] Updated weights for policy 0, policy_version 22870 (0.0008) [2023-10-08 00:48:13,676][52059] Updated weights for policy 1, policy_version 23162 (0.0008) [2023-10-08 00:48:13,687][52060] Updated weights for policy 0, policy_version 22880 (0.0008) [2023-10-08 00:48:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 47153152. Throughput: 0: 1718.4, 1: 1731.2. Samples: 11800644. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 00:48:16,211][50642] Avg episode reward: [(0, '17.400'), (1, '17.430')] [2023-10-08 00:48:17,561][52059] Updated weights for policy 1, policy_version 23172 (0.0008) [2023-10-08 00:48:17,722][52060] Updated weights for policy 0, policy_version 22890 (0.0007) [2023-10-08 00:48:17,928][52059] Updated weights for policy 1, policy_version 23182 (0.0007) [2023-10-08 00:48:18,084][52060] Updated weights for policy 0, policy_version 22900 (0.0008) [2023-10-08 00:48:18,289][52059] Updated weights for policy 1, policy_version 23192 (0.0007) [2023-10-08 00:48:18,450][52060] Updated weights for policy 0, policy_version 22910 (0.0007) [2023-10-08 00:48:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 47218688. Throughput: 0: 1684.1, 1: 1718.0. Samples: 11810074. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:48:21,211][50642] Avg episode reward: [(0, '15.860'), (1, '15.900')] [2023-10-08 00:48:22,261][52059] Updated weights for policy 1, policy_version 23202 (0.0007) [2023-10-08 00:48:22,346][52060] Updated weights for policy 0, policy_version 22920 (0.0008) [2023-10-08 00:48:22,622][52059] Updated weights for policy 1, policy_version 23212 (0.0007) [2023-10-08 00:48:22,712][52060] Updated weights for policy 0, policy_version 22930 (0.0010) [2023-10-08 00:48:22,996][52059] Updated weights for policy 1, policy_version 23222 (0.0008) [2023-10-08 00:48:23,080][52060] Updated weights for policy 0, policy_version 22940 (0.0010) [2023-10-08 00:48:23,362][52059] Updated weights for policy 1, policy_version 23232 (0.0009) [2023-10-08 00:48:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 47284224. Throughput: 0: 1704.2, 1: 1724.4. Samples: 11831232. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:48:26,211][50642] Avg episode reward: [(0, '15.640'), (1, '20.180')] [2023-10-08 00:48:26,933][52060] Updated weights for policy 0, policy_version 22950 (0.0009) [2023-10-08 00:48:27,302][52060] Updated weights for policy 0, policy_version 22960 (0.0008) [2023-10-08 00:48:27,375][52059] Updated weights for policy 1, policy_version 23242 (0.0007) [2023-10-08 00:48:27,667][52060] Updated weights for policy 0, policy_version 22970 (0.0008) [2023-10-08 00:48:27,753][52059] Updated weights for policy 1, policy_version 23252 (0.0008) [2023-10-08 00:48:28,116][52059] Updated weights for policy 1, policy_version 23262 (0.0008) [2023-10-08 00:48:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 47349760. Throughput: 0: 1714.1, 1: 1741.5. Samples: 11852568. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:48:31,211][50642] Avg episode reward: [(0, '17.730'), (1, '16.930')] [2023-10-08 00:48:31,734][52060] Updated weights for policy 0, policy_version 22980 (0.0008) [2023-10-08 00:48:31,980][52059] Updated weights for policy 1, policy_version 23272 (0.0008) [2023-10-08 00:48:32,100][52060] Updated weights for policy 0, policy_version 22990 (0.0008) [2023-10-08 00:48:32,344][52059] Updated weights for policy 1, policy_version 23282 (0.0008) [2023-10-08 00:48:32,462][52060] Updated weights for policy 0, policy_version 23000 (0.0008) [2023-10-08 00:48:32,710][52059] Updated weights for policy 1, policy_version 23292 (0.0008) [2023-10-08 00:48:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 47415296. Throughput: 0: 1689.7, 1: 1708.3. Samples: 11861740. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:48:36,211][50642] Avg episode reward: [(0, '14.770'), (1, '16.400')] [2023-10-08 00:48:36,567][52060] Updated weights for policy 0, policy_version 23010 (0.0007) [2023-10-08 00:48:36,795][52059] Updated weights for policy 1, policy_version 23302 (0.0007) [2023-10-08 00:48:36,972][52060] Updated weights for policy 0, policy_version 23020 (0.0008) [2023-10-08 00:48:37,150][52059] Updated weights for policy 1, policy_version 23312 (0.0007) [2023-10-08 00:48:37,345][52060] Updated weights for policy 0, policy_version 23030 (0.0008) [2023-10-08 00:48:37,513][52059] Updated weights for policy 1, policy_version 23322 (0.0007) [2023-10-08 00:48:37,716][52060] Updated weights for policy 0, policy_version 23040 (0.0008) [2023-10-08 00:48:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 47480832. Throughput: 0: 1710.9, 1: 1742.1. Samples: 11882918. Policy #0 lag: (min: 2.0, avg: 2.1, max: 8.0) [2023-10-08 00:48:41,211][50642] Avg episode reward: [(0, '16.220'), (1, '18.890')] [2023-10-08 00:48:41,457][52059] Updated weights for policy 1, policy_version 23332 (0.0007) [2023-10-08 00:48:41,745][52060] Updated weights for policy 0, policy_version 23050 (0.0008) [2023-10-08 00:48:41,818][52059] Updated weights for policy 1, policy_version 23342 (0.0008) [2023-10-08 00:48:42,110][52060] Updated weights for policy 0, policy_version 23060 (0.0010) [2023-10-08 00:48:42,171][52059] Updated weights for policy 1, policy_version 23352 (0.0009) [2023-10-08 00:48:42,480][52060] Updated weights for policy 0, policy_version 23070 (0.0007) [2023-10-08 00:48:46,144][52059] Updated weights for policy 1, policy_version 23362 (0.0009) [2023-10-08 00:48:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 47546368. Throughput: 0: 1714.8, 1: 1742.6. Samples: 11904218. Policy #0 lag: (min: 2.0, avg: 2.1, max: 8.0) [2023-10-08 00:48:46,211][50642] Avg episode reward: [(0, '16.300'), (1, '19.080')] [2023-10-08 00:48:46,398][52060] Updated weights for policy 0, policy_version 23080 (0.0007) [2023-10-08 00:48:46,502][52059] Updated weights for policy 1, policy_version 23372 (0.0007) [2023-10-08 00:48:46,771][52060] Updated weights for policy 0, policy_version 23090 (0.0008) [2023-10-08 00:48:46,867][52059] Updated weights for policy 1, policy_version 23382 (0.0007) [2023-10-08 00:48:47,136][52060] Updated weights for policy 0, policy_version 23100 (0.0008) [2023-10-08 00:48:47,235][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000023392_23953408.pth... [2023-10-08 00:48:47,238][52059] Updated weights for policy 1, policy_version 23392 (0.0008) [2023-10-08 00:48:47,273][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000021760_22282240.pth [2023-10-08 00:48:47,282][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000023104_23658496.pth... [2023-10-08 00:48:47,310][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000021472_21987328.pth [2023-10-08 00:48:51,151][52060] Updated weights for policy 0, policy_version 23110 (0.0008) [2023-10-08 00:48:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 47611904. Throughput: 0: 1702.2, 1: 1724.7. Samples: 11913556. Policy #0 lag: (min: 2.0, avg: 2.1, max: 8.0) [2023-10-08 00:48:51,211][50642] Avg episode reward: [(0, '15.710'), (1, '16.150')] [2023-10-08 00:48:51,267][52059] Updated weights for policy 1, policy_version 23402 (0.0009) [2023-10-08 00:48:51,519][52060] Updated weights for policy 0, policy_version 23120 (0.0008) [2023-10-08 00:48:51,621][52059] Updated weights for policy 1, policy_version 23412 (0.0007) [2023-10-08 00:48:51,889][52060] Updated weights for policy 0, policy_version 23130 (0.0008) [2023-10-08 00:48:51,991][52059] Updated weights for policy 1, policy_version 23422 (0.0007) [2023-10-08 00:48:55,716][52059] Updated weights for policy 1, policy_version 23432 (0.0008) [2023-10-08 00:48:56,053][52060] Updated weights for policy 0, policy_version 23140 (0.0007) [2023-10-08 00:48:56,076][52059] Updated weights for policy 1, policy_version 23442 (0.0008) [2023-10-08 00:48:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 47677440. Throughput: 0: 1703.2, 1: 1737.1. Samples: 11934602. Policy #0 lag: (min: 2.0, avg: 2.1, max: 8.0) [2023-10-08 00:48:56,211][50642] Avg episode reward: [(0, '17.730'), (1, '18.590')] [2023-10-08 00:48:56,425][52060] Updated weights for policy 0, policy_version 23150 (0.0007) [2023-10-08 00:48:56,441][52059] Updated weights for policy 1, policy_version 23452 (0.0007) [2023-10-08 00:48:56,793][52060] Updated weights for policy 0, policy_version 23160 (0.0007) [2023-10-08 00:49:00,230][52059] Updated weights for policy 1, policy_version 23462 (0.0007) [2023-10-08 00:49:00,597][52059] Updated weights for policy 1, policy_version 23472 (0.0008) [2023-10-08 00:49:00,715][52060] Updated weights for policy 0, policy_version 23170 (0.0008) [2023-10-08 00:49:00,963][52059] Updated weights for policy 1, policy_version 23482 (0.0010) [2023-10-08 00:49:01,083][52060] Updated weights for policy 0, policy_version 23180 (0.0008) [2023-10-08 00:49:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 47775744. Throughput: 0: 1700.0, 1: 1723.4. Samples: 11954700. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:01,211][50642] Avg episode reward: [(0, '14.920'), (1, '19.160')] [2023-10-08 00:49:01,463][52060] Updated weights for policy 0, policy_version 23190 (0.0009) [2023-10-08 00:49:01,832][52060] Updated weights for policy 0, policy_version 23200 (0.0009) [2023-10-08 00:49:04,896][52059] Updated weights for policy 1, policy_version 23492 (0.0009) [2023-10-08 00:49:05,261][52059] Updated weights for policy 1, policy_version 23502 (0.0009) [2023-10-08 00:49:05,613][52059] Updated weights for policy 1, policy_version 23512 (0.0008) [2023-10-08 00:49:05,820][52060] Updated weights for policy 0, policy_version 23210 (0.0008) [2023-10-08 00:49:06,176][52060] Updated weights for policy 0, policy_version 23220 (0.0010) [2023-10-08 00:49:06,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 47841280. Throughput: 0: 1705.2, 1: 1742.7. Samples: 11965228. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:06,211][50642] Avg episode reward: [(0, '16.000'), (1, '15.760')] [2023-10-08 00:49:06,545][52060] Updated weights for policy 0, policy_version 23230 (0.0011) [2023-10-08 00:49:09,531][52059] Updated weights for policy 1, policy_version 23522 (0.0008) [2023-10-08 00:49:09,898][52059] Updated weights for policy 1, policy_version 23532 (0.0009) [2023-10-08 00:49:10,271][52059] Updated weights for policy 1, policy_version 23542 (0.0008) [2023-10-08 00:49:10,630][52059] Updated weights for policy 1, policy_version 23552 (0.0008) [2023-10-08 00:49:10,716][52060] Updated weights for policy 0, policy_version 23240 (0.0008) [2023-10-08 00:49:11,101][52060] Updated weights for policy 0, policy_version 23250 (0.0009) [2023-10-08 00:49:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 47906816. Throughput: 0: 1705.2, 1: 1738.5. Samples: 11986200. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:11,211][50642] Avg episode reward: [(0, '18.180'), (1, '16.990')] [2023-10-08 00:49:11,475][52060] Updated weights for policy 0, policy_version 23260 (0.0009) [2023-10-08 00:49:14,481][52059] Updated weights for policy 1, policy_version 23562 (0.0009) [2023-10-08 00:49:14,843][52059] Updated weights for policy 1, policy_version 23572 (0.0009) [2023-10-08 00:49:15,218][52059] Updated weights for policy 1, policy_version 23582 (0.0010) [2023-10-08 00:49:15,505][52060] Updated weights for policy 0, policy_version 23270 (0.0010) [2023-10-08 00:49:15,868][52060] Updated weights for policy 0, policy_version 23280 (0.0010) [2023-10-08 00:49:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 47972352. Throughput: 0: 1691.5, 1: 1725.6. Samples: 12006338. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:16,211][50642] Avg episode reward: [(0, '15.120'), (1, '20.150')] [2023-10-08 00:49:16,235][52060] Updated weights for policy 0, policy_version 23290 (0.0007) [2023-10-08 00:49:19,190][52059] Updated weights for policy 1, policy_version 23592 (0.0010) [2023-10-08 00:49:19,552][52059] Updated weights for policy 1, policy_version 23602 (0.0008) [2023-10-08 00:49:19,922][52059] Updated weights for policy 1, policy_version 23612 (0.0007) [2023-10-08 00:49:20,213][52060] Updated weights for policy 0, policy_version 23300 (0.0008) [2023-10-08 00:49:20,580][52060] Updated weights for policy 0, policy_version 23310 (0.0009) [2023-10-08 00:49:20,950][52060] Updated weights for policy 0, policy_version 23320 (0.0009) [2023-10-08 00:49:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 48037888. Throughput: 0: 1708.4, 1: 1758.8. Samples: 12017768. Policy #0 lag: (min: 27.0, avg: 27.7, max: 44.0) [2023-10-08 00:49:21,211][50642] Avg episode reward: [(0, '16.070'), (1, '16.390')] [2023-10-08 00:49:23,721][52059] Updated weights for policy 1, policy_version 23622 (0.0007) [2023-10-08 00:49:24,090][52059] Updated weights for policy 1, policy_version 23632 (0.0007) [2023-10-08 00:49:24,458][52059] Updated weights for policy 1, policy_version 23642 (0.0009) [2023-10-08 00:49:24,830][52060] Updated weights for policy 0, policy_version 23330 (0.0009) [2023-10-08 00:49:25,232][52060] Updated weights for policy 0, policy_version 23340 (0.0010) [2023-10-08 00:49:25,590][52060] Updated weights for policy 0, policy_version 23350 (0.0010) [2023-10-08 00:49:25,957][52060] Updated weights for policy 0, policy_version 23360 (0.0011) [2023-10-08 00:49:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 48136192. Throughput: 0: 1715.1, 1: 1727.2. Samples: 12037824. Policy #0 lag: (min: 27.0, avg: 27.7, max: 44.0) [2023-10-08 00:49:26,211][50642] Avg episode reward: [(0, '16.080'), (1, '16.820')] [2023-10-08 00:49:28,492][52059] Updated weights for policy 1, policy_version 23652 (0.0009) [2023-10-08 00:49:28,865][52059] Updated weights for policy 1, policy_version 23662 (0.0007) [2023-10-08 00:49:29,232][52059] Updated weights for policy 1, policy_version 23672 (0.0007) [2023-10-08 00:49:29,689][52060] Updated weights for policy 0, policy_version 23370 (0.0008) [2023-10-08 00:49:30,061][52060] Updated weights for policy 0, policy_version 23380 (0.0009) [2023-10-08 00:49:30,436][52060] Updated weights for policy 0, policy_version 23390 (0.0009) [2023-10-08 00:49:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 48201728. Throughput: 0: 1687.7, 1: 1728.2. Samples: 12057934. Policy #0 lag: (min: 27.0, avg: 27.7, max: 44.0) [2023-10-08 00:49:31,211][50642] Avg episode reward: [(0, '14.540'), (1, '21.890')] [2023-10-08 00:49:33,040][52059] Updated weights for policy 1, policy_version 23682 (0.0007) [2023-10-08 00:49:33,407][52059] Updated weights for policy 1, policy_version 23692 (0.0008) [2023-10-08 00:49:33,782][52059] Updated weights for policy 1, policy_version 23702 (0.0008) [2023-10-08 00:49:34,143][52059] Updated weights for policy 1, policy_version 23712 (0.0009) [2023-10-08 00:49:34,396][52060] Updated weights for policy 0, policy_version 23400 (0.0010) [2023-10-08 00:49:34,774][52060] Updated weights for policy 0, policy_version 23410 (0.0008) [2023-10-08 00:49:35,150][52060] Updated weights for policy 0, policy_version 23420 (0.0008) [2023-10-08 00:49:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 48267264. Throughput: 0: 1718.1, 1: 1738.3. Samples: 12069092. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:36,211][50642] Avg episode reward: [(0, '16.810'), (1, '16.840')] [2023-10-08 00:49:38,015][52059] Updated weights for policy 1, policy_version 23722 (0.0008) [2023-10-08 00:49:38,373][52059] Updated weights for policy 1, policy_version 23732 (0.0008) [2023-10-08 00:49:38,739][52059] Updated weights for policy 1, policy_version 23742 (0.0009) [2023-10-08 00:49:39,177][52060] Updated weights for policy 0, policy_version 23430 (0.0010) [2023-10-08 00:49:39,551][52060] Updated weights for policy 0, policy_version 23440 (0.0008) [2023-10-08 00:49:39,919][52060] Updated weights for policy 0, policy_version 23450 (0.0008) [2023-10-08 00:49:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 48332800. Throughput: 0: 1703.9, 1: 1730.0. Samples: 12089128. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:41,211][50642] Avg episode reward: [(0, '15.290'), (1, '17.440')] [2023-10-08 00:49:42,722][52059] Updated weights for policy 1, policy_version 23752 (0.0008) [2023-10-08 00:49:43,083][52059] Updated weights for policy 1, policy_version 23762 (0.0007) [2023-10-08 00:49:43,444][52059] Updated weights for policy 1, policy_version 23772 (0.0007) [2023-10-08 00:49:43,825][52060] Updated weights for policy 0, policy_version 23460 (0.0007) [2023-10-08 00:49:44,199][52060] Updated weights for policy 0, policy_version 23470 (0.0007) [2023-10-08 00:49:44,557][52060] Updated weights for policy 0, policy_version 23480 (0.0009) [2023-10-08 00:49:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 48398336. Throughput: 0: 1701.2, 1: 1752.5. Samples: 12110118. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:46,212][50642] Avg episode reward: [(0, '16.090'), (1, '20.600')] [2023-10-08 00:49:47,181][52059] Updated weights for policy 1, policy_version 23782 (0.0007) [2023-10-08 00:49:47,547][52059] Updated weights for policy 1, policy_version 23792 (0.0009) [2023-10-08 00:49:47,913][52059] Updated weights for policy 1, policy_version 23802 (0.0009) [2023-10-08 00:49:48,736][52060] Updated weights for policy 0, policy_version 23490 (0.0007) [2023-10-08 00:49:49,102][52060] Updated weights for policy 0, policy_version 23500 (0.0008) [2023-10-08 00:49:49,472][52060] Updated weights for policy 0, policy_version 23510 (0.0009) [2023-10-08 00:49:49,839][52060] Updated weights for policy 0, policy_version 23520 (0.0008) [2023-10-08 00:49:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 48463872. Throughput: 0: 1722.3, 1: 1732.4. Samples: 12120688. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:49:51,211][50642] Avg episode reward: [(0, '16.200'), (1, '18.490')] [2023-10-08 00:49:51,681][52059] Updated weights for policy 1, policy_version 23812 (0.0010) [2023-10-08 00:49:52,047][52059] Updated weights for policy 1, policy_version 23822 (0.0009) [2023-10-08 00:49:52,408][52059] Updated weights for policy 1, policy_version 23832 (0.0008) [2023-10-08 00:49:53,939][52060] Updated weights for policy 0, policy_version 23530 (0.0010) [2023-10-08 00:49:54,318][52060] Updated weights for policy 0, policy_version 23540 (0.0009) [2023-10-08 00:49:54,695][52060] Updated weights for policy 0, policy_version 23550 (0.0008) [2023-10-08 00:49:56,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 48529408. Throughput: 0: 1696.0, 1: 1743.3. Samples: 12140966. Policy #0 lag: (min: 1.0, avg: 7.8, max: 33.0) [2023-10-08 00:49:56,211][50642] Avg episode reward: [(0, '15.480'), (1, '16.450')] [2023-10-08 00:49:56,336][52059] Updated weights for policy 1, policy_version 23842 (0.0008) [2023-10-08 00:49:56,710][52059] Updated weights for policy 1, policy_version 23852 (0.0008) [2023-10-08 00:49:57,078][52059] Updated weights for policy 1, policy_version 23862 (0.0009) [2023-10-08 00:49:57,441][52059] Updated weights for policy 1, policy_version 23872 (0.0008) [2023-10-08 00:49:58,636][52060] Updated weights for policy 0, policy_version 23560 (0.0009) [2023-10-08 00:49:59,002][52060] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-10-08 00:49:59,376][52060] Updated weights for policy 0, policy_version 23580 (0.0007) [2023-10-08 00:50:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 48594944. Throughput: 0: 1708.6, 1: 1754.8. Samples: 12162194. Policy #0 lag: (min: 1.0, avg: 7.8, max: 33.0) [2023-10-08 00:50:01,211][50642] Avg episode reward: [(0, '18.100'), (1, '19.370')] [2023-10-08 00:50:01,490][52059] Updated weights for policy 1, policy_version 23882 (0.0008) [2023-10-08 00:50:01,863][52059] Updated weights for policy 1, policy_version 23892 (0.0009) [2023-10-08 00:50:02,213][52059] Updated weights for policy 1, policy_version 23902 (0.0011) [2023-10-08 00:50:03,357][52060] Updated weights for policy 0, policy_version 23590 (0.0008) [2023-10-08 00:50:03,719][52060] Updated weights for policy 0, policy_version 23600 (0.0008) [2023-10-08 00:50:04,087][52060] Updated weights for policy 0, policy_version 23610 (0.0010) [2023-10-08 00:50:06,150][52059] Updated weights for policy 1, policy_version 23912 (0.0009) [2023-10-08 00:50:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 48660480. Throughput: 0: 1707.8, 1: 1724.4. Samples: 12172218. Policy #0 lag: (min: 1.0, avg: 7.8, max: 33.0) [2023-10-08 00:50:06,211][50642] Avg episode reward: [(0, '15.490'), (1, '19.580')] [2023-10-08 00:50:06,524][52059] Updated weights for policy 1, policy_version 23922 (0.0009) [2023-10-08 00:50:06,886][52059] Updated weights for policy 1, policy_version 23932 (0.0007) [2023-10-08 00:50:07,985][52060] Updated weights for policy 0, policy_version 23620 (0.0008) [2023-10-08 00:50:08,349][52060] Updated weights for policy 0, policy_version 23630 (0.0008) [2023-10-08 00:50:08,708][52060] Updated weights for policy 0, policy_version 23640 (0.0007) [2023-10-08 00:50:10,531][52059] Updated weights for policy 1, policy_version 23942 (0.0007) [2023-10-08 00:50:10,903][52059] Updated weights for policy 1, policy_version 23952 (0.0010) [2023-10-08 00:50:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 48726016. Throughput: 0: 1693.6, 1: 1760.5. Samples: 12193256. Policy #0 lag: (min: 1.0, avg: 7.8, max: 33.0) [2023-10-08 00:50:11,211][50642] Avg episode reward: [(0, '16.480'), (1, '18.550')] [2023-10-08 00:50:11,271][52059] Updated weights for policy 1, policy_version 23962 (0.0011) [2023-10-08 00:50:12,907][52060] Updated weights for policy 0, policy_version 23650 (0.0009) [2023-10-08 00:50:13,306][52060] Updated weights for policy 0, policy_version 23660 (0.0009) [2023-10-08 00:50:13,665][52060] Updated weights for policy 0, policy_version 23670 (0.0008) [2023-10-08 00:50:14,035][52060] Updated weights for policy 0, policy_version 23680 (0.0008) [2023-10-08 00:50:15,121][52059] Updated weights for policy 1, policy_version 23972 (0.0007) [2023-10-08 00:50:15,485][52059] Updated weights for policy 1, policy_version 23982 (0.0007) [2023-10-08 00:50:15,854][52059] Updated weights for policy 1, policy_version 23992 (0.0010) [2023-10-08 00:50:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 48824320. Throughput: 0: 1712.3, 1: 1742.5. Samples: 12213396. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:50:16,211][50642] Avg episode reward: [(0, '16.020'), (1, '19.120')] [2023-10-08 00:50:17,946][52060] Updated weights for policy 0, policy_version 23690 (0.0009) [2023-10-08 00:50:18,316][52060] Updated weights for policy 0, policy_version 23700 (0.0009) [2023-10-08 00:50:18,687][52060] Updated weights for policy 0, policy_version 23710 (0.0009) [2023-10-08 00:50:19,811][52059] Updated weights for policy 1, policy_version 24002 (0.0008) [2023-10-08 00:50:20,176][52059] Updated weights for policy 1, policy_version 24012 (0.0008) [2023-10-08 00:50:20,540][52059] Updated weights for policy 1, policy_version 24022 (0.0011) [2023-10-08 00:50:20,901][52059] Updated weights for policy 1, policy_version 24032 (0.0009) [2023-10-08 00:50:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 48889856. Throughput: 0: 1683.1, 1: 1756.5. Samples: 12223874. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:50:21,211][50642] Avg episode reward: [(0, '15.490'), (1, '19.970')] [2023-10-08 00:50:22,725][52060] Updated weights for policy 0, policy_version 23720 (0.0008) [2023-10-08 00:50:23,094][52060] Updated weights for policy 0, policy_version 23730 (0.0007) [2023-10-08 00:50:23,458][52060] Updated weights for policy 0, policy_version 23740 (0.0007) [2023-10-08 00:50:24,782][52059] Updated weights for policy 1, policy_version 24042 (0.0009) [2023-10-08 00:50:25,143][52059] Updated weights for policy 1, policy_version 24052 (0.0009) [2023-10-08 00:50:25,503][52059] Updated weights for policy 1, policy_version 24062 (0.0008) [2023-10-08 00:50:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 48955392. Throughput: 0: 1704.0, 1: 1753.6. Samples: 12244718. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:50:26,211][50642] Avg episode reward: [(0, '17.080'), (1, '17.060')] [2023-10-08 00:50:27,356][52060] Updated weights for policy 0, policy_version 23750 (0.0008) [2023-10-08 00:50:27,736][52060] Updated weights for policy 0, policy_version 23760 (0.0009) [2023-10-08 00:50:28,100][52060] Updated weights for policy 0, policy_version 23770 (0.0008) [2023-10-08 00:50:29,390][52059] Updated weights for policy 1, policy_version 24072 (0.0008) [2023-10-08 00:50:29,748][52059] Updated weights for policy 1, policy_version 24082 (0.0009) [2023-10-08 00:50:30,113][52059] Updated weights for policy 1, policy_version 24092 (0.0008) [2023-10-08 00:50:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49020928. Throughput: 0: 1715.4, 1: 1738.0. Samples: 12265522. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:50:31,211][50642] Avg episode reward: [(0, '15.170'), (1, '17.620')] [2023-10-08 00:50:32,192][52060] Updated weights for policy 0, policy_version 23780 (0.0010) [2023-10-08 00:50:32,562][52060] Updated weights for policy 0, policy_version 23790 (0.0008) [2023-10-08 00:50:32,927][52060] Updated weights for policy 0, policy_version 23800 (0.0008) [2023-10-08 00:50:33,994][52059] Updated weights for policy 1, policy_version 24102 (0.0008) [2023-10-08 00:50:34,352][52059] Updated weights for policy 1, policy_version 24112 (0.0009) [2023-10-08 00:50:34,714][52059] Updated weights for policy 1, policy_version 24122 (0.0009) [2023-10-08 00:50:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49086464. Throughput: 0: 1691.4, 1: 1762.6. Samples: 12276118. Policy #0 lag: (min: 10.0, avg: 17.9, max: 42.0) [2023-10-08 00:50:36,211][50642] Avg episode reward: [(0, '16.520'), (1, '18.500')] [2023-10-08 00:50:36,681][52060] Updated weights for policy 0, policy_version 23810 (0.0008) [2023-10-08 00:50:37,058][52060] Updated weights for policy 0, policy_version 23820 (0.0010) [2023-10-08 00:50:37,429][52060] Updated weights for policy 0, policy_version 23830 (0.0008) [2023-10-08 00:50:37,807][52060] Updated weights for policy 0, policy_version 23840 (0.0009) [2023-10-08 00:50:38,616][52059] Updated weights for policy 1, policy_version 24132 (0.0009) [2023-10-08 00:50:38,985][52059] Updated weights for policy 1, policy_version 24142 (0.0008) [2023-10-08 00:50:39,352][52059] Updated weights for policy 1, policy_version 24152 (0.0008) [2023-10-08 00:50:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 49152000. Throughput: 0: 1722.1, 1: 1734.8. Samples: 12296526. Policy #0 lag: (min: 10.0, avg: 17.9, max: 42.0) [2023-10-08 00:50:41,211][50642] Avg episode reward: [(0, '16.530'), (1, '18.540')] [2023-10-08 00:50:41,769][52060] Updated weights for policy 0, policy_version 23850 (0.0007) [2023-10-08 00:50:42,141][52060] Updated weights for policy 0, policy_version 23860 (0.0008) [2023-10-08 00:50:42,498][52060] Updated weights for policy 0, policy_version 23870 (0.0008) [2023-10-08 00:50:43,195][52059] Updated weights for policy 1, policy_version 24162 (0.0007) [2023-10-08 00:50:43,559][52059] Updated weights for policy 1, policy_version 24172 (0.0007) [2023-10-08 00:50:43,920][52059] Updated weights for policy 1, policy_version 24182 (0.0008) [2023-10-08 00:50:44,291][52059] Updated weights for policy 1, policy_version 24192 (0.0009) [2023-10-08 00:50:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 49217536. Throughput: 0: 1718.7, 1: 1736.2. Samples: 12317664. Policy #0 lag: (min: 10.0, avg: 17.9, max: 42.0) [2023-10-08 00:50:46,211][50642] Avg episode reward: [(0, '15.540'), (1, '20.410')] [2023-10-08 00:50:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000024192_24772608.pth... [2023-10-08 00:50:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000022560_23101440.pth [2023-10-08 00:50:46,263][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000024192_24772608.pth [2023-10-08 00:50:46,542][52060] Updated weights for policy 0, policy_version 23880 (0.0007) [2023-10-08 00:50:46,905][52060] Updated weights for policy 0, policy_version 23890 (0.0007) [2023-10-08 00:50:47,281][52060] Updated weights for policy 0, policy_version 23900 (0.0007) [2023-10-08 00:50:47,420][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000023904_24477696.pth... [2023-10-08 00:50:47,451][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000022304_22839296.pth [2023-10-08 00:50:47,455][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000023904_24477696.pth [2023-10-08 00:50:48,376][52059] Updated weights for policy 1, policy_version 24202 (0.0010) [2023-10-08 00:50:48,739][52059] Updated weights for policy 1, policy_version 24212 (0.0010) [2023-10-08 00:50:49,109][52059] Updated weights for policy 1, policy_version 24222 (0.0009) [2023-10-08 00:50:51,203][52060] Updated weights for policy 0, policy_version 23910 (0.0008) [2023-10-08 00:50:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49283072. Throughput: 0: 1706.9, 1: 1739.8. Samples: 12327320. Policy #0 lag: (min: 10.0, avg: 17.9, max: 42.0) [2023-10-08 00:50:51,211][50642] Avg episode reward: [(0, '16.760'), (1, '18.350')] [2023-10-08 00:50:51,577][52060] Updated weights for policy 0, policy_version 23920 (0.0008) [2023-10-08 00:50:51,947][52060] Updated weights for policy 0, policy_version 23930 (0.0008) [2023-10-08 00:50:53,116][52059] Updated weights for policy 1, policy_version 24232 (0.0008) [2023-10-08 00:50:53,484][52059] Updated weights for policy 1, policy_version 24242 (0.0009) [2023-10-08 00:50:53,843][52059] Updated weights for policy 1, policy_version 24252 (0.0008) [2023-10-08 00:50:55,939][52060] Updated weights for policy 0, policy_version 23940 (0.0008) [2023-10-08 00:50:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49348608. Throughput: 0: 1717.1, 1: 1725.4. Samples: 12348168. Policy #0 lag: (min: 15.0, avg: 22.9, max: 47.0) [2023-10-08 00:50:56,211][50642] Avg episode reward: [(0, '15.370'), (1, '17.070')] [2023-10-08 00:50:56,311][52060] Updated weights for policy 0, policy_version 23950 (0.0009) [2023-10-08 00:50:56,683][52060] Updated weights for policy 0, policy_version 23960 (0.0010) [2023-10-08 00:50:57,864][52059] Updated weights for policy 1, policy_version 24262 (0.0008) [2023-10-08 00:50:58,246][52059] Updated weights for policy 1, policy_version 24272 (0.0010) [2023-10-08 00:50:58,609][52059] Updated weights for policy 1, policy_version 24282 (0.0011) [2023-10-08 00:51:00,518][52060] Updated weights for policy 0, policy_version 23970 (0.0009) [2023-10-08 00:51:00,890][52060] Updated weights for policy 0, policy_version 23980 (0.0007) [2023-10-08 00:51:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49414144. Throughput: 0: 1717.8, 1: 1741.5. Samples: 12369062. Policy #0 lag: (min: 15.0, avg: 22.9, max: 47.0) [2023-10-08 00:51:01,211][50642] Avg episode reward: [(0, '15.760'), (1, '21.220')] [2023-10-08 00:51:01,252][52060] Updated weights for policy 0, policy_version 23990 (0.0007) [2023-10-08 00:51:01,625][52060] Updated weights for policy 0, policy_version 24000 (0.0007) [2023-10-08 00:51:02,544][52059] Updated weights for policy 1, policy_version 24292 (0.0010) [2023-10-08 00:51:02,915][52059] Updated weights for policy 1, policy_version 24302 (0.0007) [2023-10-08 00:51:03,282][52059] Updated weights for policy 1, policy_version 24312 (0.0007) [2023-10-08 00:51:05,611][52060] Updated weights for policy 0, policy_version 24010 (0.0010) [2023-10-08 00:51:05,983][52060] Updated weights for policy 0, policy_version 24020 (0.0010) [2023-10-08 00:51:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 49479680. Throughput: 0: 1727.3, 1: 1717.4. Samples: 12378884. Policy #0 lag: (min: 15.0, avg: 22.9, max: 47.0) [2023-10-08 00:51:06,211][50642] Avg episode reward: [(0, '17.120'), (1, '19.860')] [2023-10-08 00:51:06,349][52060] Updated weights for policy 0, policy_version 24030 (0.0009) [2023-10-08 00:51:07,121][52059] Updated weights for policy 1, policy_version 24322 (0.0008) [2023-10-08 00:51:07,492][52059] Updated weights for policy 1, policy_version 24332 (0.0009) [2023-10-08 00:51:07,855][52059] Updated weights for policy 1, policy_version 24342 (0.0009) [2023-10-08 00:51:08,223][52059] Updated weights for policy 1, policy_version 24352 (0.0008) [2023-10-08 00:51:10,378][52060] Updated weights for policy 0, policy_version 24040 (0.0010) [2023-10-08 00:51:10,743][52060] Updated weights for policy 0, policy_version 24050 (0.0008) [2023-10-08 00:51:11,106][52060] Updated weights for policy 0, policy_version 24060 (0.0009) [2023-10-08 00:51:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 49545216. Throughput: 0: 1724.6, 1: 1731.2. Samples: 12400230. Policy #0 lag: (min: 15.0, avg: 22.9, max: 47.0) [2023-10-08 00:51:11,211][50642] Avg episode reward: [(0, '15.490'), (1, '17.780')] [2023-10-08 00:51:12,123][52059] Updated weights for policy 1, policy_version 24362 (0.0007) [2023-10-08 00:51:12,493][52059] Updated weights for policy 1, policy_version 24372 (0.0008) [2023-10-08 00:51:12,858][52059] Updated weights for policy 1, policy_version 24382 (0.0009) [2023-10-08 00:51:15,074][52060] Updated weights for policy 0, policy_version 24070 (0.0011) [2023-10-08 00:51:15,444][52060] Updated weights for policy 0, policy_version 24080 (0.0008) [2023-10-08 00:51:15,817][52060] Updated weights for policy 0, policy_version 24090 (0.0010) [2023-10-08 00:51:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49643520. Throughput: 0: 1700.4, 1: 1745.4. Samples: 12420582. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:16,211][50642] Avg episode reward: [(0, '16.700'), (1, '18.520')] [2023-10-08 00:51:16,801][52059] Updated weights for policy 1, policy_version 24392 (0.0007) [2023-10-08 00:51:17,169][52059] Updated weights for policy 1, policy_version 24402 (0.0008) [2023-10-08 00:51:17,523][52059] Updated weights for policy 1, policy_version 24412 (0.0009) [2023-10-08 00:51:19,738][52060] Updated weights for policy 0, policy_version 24100 (0.0010) [2023-10-08 00:51:20,116][52060] Updated weights for policy 0, policy_version 24110 (0.0008) [2023-10-08 00:51:20,484][52060] Updated weights for policy 0, policy_version 24120 (0.0007) [2023-10-08 00:51:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49709056. Throughput: 0: 1727.0, 1: 1716.6. Samples: 12431082. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:21,211][50642] Avg episode reward: [(0, '16.190'), (1, '19.860')] [2023-10-08 00:51:21,648][52059] Updated weights for policy 1, policy_version 24422 (0.0009) [2023-10-08 00:51:22,019][52059] Updated weights for policy 1, policy_version 24432 (0.0011) [2023-10-08 00:51:22,383][52059] Updated weights for policy 1, policy_version 24442 (0.0007) [2023-10-08 00:51:24,508][52060] Updated weights for policy 0, policy_version 24130 (0.0010) [2023-10-08 00:51:24,871][52060] Updated weights for policy 0, policy_version 24140 (0.0011) [2023-10-08 00:51:25,235][52060] Updated weights for policy 0, policy_version 24150 (0.0008) [2023-10-08 00:51:25,605][52060] Updated weights for policy 0, policy_version 24160 (0.0010) [2023-10-08 00:51:26,209][52059] Updated weights for policy 1, policy_version 24452 (0.0007) [2023-10-08 00:51:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49774592. Throughput: 0: 1713.6, 1: 1740.1. Samples: 12451942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:26,211][50642] Avg episode reward: [(0, '15.240'), (1, '17.780')] [2023-10-08 00:51:26,573][52059] Updated weights for policy 1, policy_version 24462 (0.0007) [2023-10-08 00:51:26,934][52059] Updated weights for policy 1, policy_version 24472 (0.0007) [2023-10-08 00:51:29,641][52060] Updated weights for policy 0, policy_version 24170 (0.0009) [2023-10-08 00:51:30,004][52060] Updated weights for policy 0, policy_version 24180 (0.0007) [2023-10-08 00:51:30,369][52060] Updated weights for policy 0, policy_version 24190 (0.0007) [2023-10-08 00:51:30,794][52059] Updated weights for policy 1, policy_version 24482 (0.0009) [2023-10-08 00:51:31,159][52059] Updated weights for policy 1, policy_version 24492 (0.0008) [2023-10-08 00:51:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49840128. Throughput: 0: 1693.9, 1: 1741.6. Samples: 12472262. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:31,211][50642] Avg episode reward: [(0, '16.710'), (1, '18.390')] [2023-10-08 00:51:31,530][52059] Updated weights for policy 1, policy_version 24502 (0.0008) [2023-10-08 00:51:31,889][52059] Updated weights for policy 1, policy_version 24512 (0.0009) [2023-10-08 00:51:34,220][52060] Updated weights for policy 0, policy_version 24200 (0.0008) [2023-10-08 00:51:34,589][52060] Updated weights for policy 0, policy_version 24210 (0.0007) [2023-10-08 00:51:34,968][52060] Updated weights for policy 0, policy_version 24220 (0.0007) [2023-10-08 00:51:35,898][52059] Updated weights for policy 1, policy_version 24522 (0.0009) [2023-10-08 00:51:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 49905664. Throughput: 0: 1730.3, 1: 1736.7. Samples: 12483334. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:36,211][50642] Avg episode reward: [(0, '15.880'), (1, '20.580')] [2023-10-08 00:51:36,262][52059] Updated weights for policy 1, policy_version 24532 (0.0008) [2023-10-08 00:51:36,630][52059] Updated weights for policy 1, policy_version 24542 (0.0009) [2023-10-08 00:51:38,929][52060] Updated weights for policy 0, policy_version 24230 (0.0009) [2023-10-08 00:51:39,299][52060] Updated weights for policy 0, policy_version 24240 (0.0007) [2023-10-08 00:51:39,665][52060] Updated weights for policy 0, policy_version 24250 (0.0008) [2023-10-08 00:51:40,312][52059] Updated weights for policy 1, policy_version 24552 (0.0008) [2023-10-08 00:51:40,676][52059] Updated weights for policy 1, policy_version 24562 (0.0010) [2023-10-08 00:51:41,038][52059] Updated weights for policy 1, policy_version 24572 (0.0009) [2023-10-08 00:51:41,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 50003968. Throughput: 0: 1707.9, 1: 1748.8. Samples: 12503722. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:41,211][50642] Avg episode reward: [(0, '16.490'), (1, '17.860')] [2023-10-08 00:51:43,636][52060] Updated weights for policy 0, policy_version 24260 (0.0008) [2023-10-08 00:51:44,004][52060] Updated weights for policy 0, policy_version 24270 (0.0011) [2023-10-08 00:51:44,366][52060] Updated weights for policy 0, policy_version 24280 (0.0010) [2023-10-08 00:51:44,887][52059] Updated weights for policy 1, policy_version 24582 (0.0010) [2023-10-08 00:51:45,259][52059] Updated weights for policy 1, policy_version 24592 (0.0008) [2023-10-08 00:51:45,627][52059] Updated weights for policy 1, policy_version 24602 (0.0011) [2023-10-08 00:51:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 50069504. Throughput: 0: 1714.3, 1: 1730.5. Samples: 12524076. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:51:46,211][50642] Avg episode reward: [(0, '16.480'), (1, '18.540')] [2023-10-08 00:51:48,311][52060] Updated weights for policy 0, policy_version 24290 (0.0008) [2023-10-08 00:51:48,714][52060] Updated weights for policy 0, policy_version 24300 (0.0010) [2023-10-08 00:51:49,089][52060] Updated weights for policy 0, policy_version 24310 (0.0007) [2023-10-08 00:51:49,445][52059] Updated weights for policy 1, policy_version 24612 (0.0009) [2023-10-08 00:51:49,452][52060] Updated weights for policy 0, policy_version 24320 (0.0007) [2023-10-08 00:51:49,814][52059] Updated weights for policy 1, policy_version 24622 (0.0008) [2023-10-08 00:51:50,186][52059] Updated weights for policy 1, policy_version 24632 (0.0008) [2023-10-08 00:51:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 50135040. Throughput: 0: 1715.5, 1: 1760.2. Samples: 12535288. Policy #0 lag: (min: 9.0, avg: 14.4, max: 41.0) [2023-10-08 00:51:51,211][50642] Avg episode reward: [(0, '16.470'), (1, '21.390')] [2023-10-08 00:51:53,359][52060] Updated weights for policy 0, policy_version 24330 (0.0007) [2023-10-08 00:51:53,737][52060] Updated weights for policy 0, policy_version 24340 (0.0010) [2023-10-08 00:51:54,052][52059] Updated weights for policy 1, policy_version 24642 (0.0008) [2023-10-08 00:51:54,104][52060] Updated weights for policy 0, policy_version 24350 (0.0007) [2023-10-08 00:51:54,406][52059] Updated weights for policy 1, policy_version 24652 (0.0008) [2023-10-08 00:51:54,779][52059] Updated weights for policy 1, policy_version 24662 (0.0009) [2023-10-08 00:51:55,138][52059] Updated weights for policy 1, policy_version 24672 (0.0009) [2023-10-08 00:51:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 50200576. Throughput: 0: 1702.3, 1: 1733.6. Samples: 12554844. Policy #0 lag: (min: 9.0, avg: 14.4, max: 41.0) [2023-10-08 00:51:56,211][50642] Avg episode reward: [(0, '17.040'), (1, '18.010')] [2023-10-08 00:51:58,054][52060] Updated weights for policy 0, policy_version 24360 (0.0008) [2023-10-08 00:51:58,433][52060] Updated weights for policy 0, policy_version 24370 (0.0010) [2023-10-08 00:51:58,795][52060] Updated weights for policy 0, policy_version 24380 (0.0009) [2023-10-08 00:51:59,137][52059] Updated weights for policy 1, policy_version 24682 (0.0010) [2023-10-08 00:51:59,508][52059] Updated weights for policy 1, policy_version 24692 (0.0008) [2023-10-08 00:51:59,862][52059] Updated weights for policy 1, policy_version 24702 (0.0010) [2023-10-08 00:52:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 50266112. Throughput: 0: 1724.0, 1: 1727.7. Samples: 12575908. Policy #0 lag: (min: 9.0, avg: 14.4, max: 41.0) [2023-10-08 00:52:01,211][50642] Avg episode reward: [(0, '16.510'), (1, '18.320')] [2023-10-08 00:52:02,882][52060] Updated weights for policy 0, policy_version 24390 (0.0009) [2023-10-08 00:52:03,255][52060] Updated weights for policy 0, policy_version 24400 (0.0010) [2023-10-08 00:52:03,631][52060] Updated weights for policy 0, policy_version 24410 (0.0009) [2023-10-08 00:52:03,802][52059] Updated weights for policy 1, policy_version 24712 (0.0009) [2023-10-08 00:52:04,170][52059] Updated weights for policy 1, policy_version 24722 (0.0009) [2023-10-08 00:52:04,536][52059] Updated weights for policy 1, policy_version 24732 (0.0008) [2023-10-08 00:52:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 50331648. Throughput: 0: 1696.4, 1: 1753.5. Samples: 12586324. Policy #0 lag: (min: 9.0, avg: 14.4, max: 41.0) [2023-10-08 00:52:06,211][50642] Avg episode reward: [(0, '17.000'), (1, '19.980')] [2023-10-08 00:52:07,461][52060] Updated weights for policy 0, policy_version 24420 (0.0009) [2023-10-08 00:52:07,828][52060] Updated weights for policy 0, policy_version 24430 (0.0007) [2023-10-08 00:52:08,195][52060] Updated weights for policy 0, policy_version 24440 (0.0010) [2023-10-08 00:52:08,460][52059] Updated weights for policy 1, policy_version 24742 (0.0009) [2023-10-08 00:52:08,819][52059] Updated weights for policy 1, policy_version 24752 (0.0008) [2023-10-08 00:52:09,184][52059] Updated weights for policy 1, policy_version 24762 (0.0007) [2023-10-08 00:52:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 50397184. Throughput: 0: 1711.7, 1: 1737.9. Samples: 12607174. Policy #0 lag: (min: 1.0, avg: 9.7, max: 33.0) [2023-10-08 00:52:11,211][50642] Avg episode reward: [(0, '16.530'), (1, '17.060')] [2023-10-08 00:52:12,149][52060] Updated weights for policy 0, policy_version 24450 (0.0007) [2023-10-08 00:52:12,525][52060] Updated weights for policy 0, policy_version 24460 (0.0008) [2023-10-08 00:52:12,889][52060] Updated weights for policy 0, policy_version 24470 (0.0010) [2023-10-08 00:52:13,249][52059] Updated weights for policy 1, policy_version 24772 (0.0009) [2023-10-08 00:52:13,262][52060] Updated weights for policy 0, policy_version 24480 (0.0008) [2023-10-08 00:52:13,611][52059] Updated weights for policy 1, policy_version 24782 (0.0008) [2023-10-08 00:52:13,974][52059] Updated weights for policy 1, policy_version 24792 (0.0007) [2023-10-08 00:52:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 50462720. Throughput: 0: 1734.6, 1: 1733.9. Samples: 12628344. Policy #0 lag: (min: 1.0, avg: 9.7, max: 33.0) [2023-10-08 00:52:16,211][50642] Avg episode reward: [(0, '15.920'), (1, '15.900')] [2023-10-08 00:52:17,277][52060] Updated weights for policy 0, policy_version 24490 (0.0008) [2023-10-08 00:52:17,649][52060] Updated weights for policy 0, policy_version 24500 (0.0009) [2023-10-08 00:52:17,907][52059] Updated weights for policy 1, policy_version 24802 (0.0007) [2023-10-08 00:52:18,008][52060] Updated weights for policy 0, policy_version 24510 (0.0007) [2023-10-08 00:52:18,274][52059] Updated weights for policy 1, policy_version 24812 (0.0010) [2023-10-08 00:52:18,652][52059] Updated weights for policy 1, policy_version 24822 (0.0010) [2023-10-08 00:52:19,014][52059] Updated weights for policy 1, policy_version 24832 (0.0008) [2023-10-08 00:52:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 50528256. Throughput: 0: 1697.6, 1: 1741.5. Samples: 12638092. Policy #0 lag: (min: 1.0, avg: 9.7, max: 33.0) [2023-10-08 00:52:21,211][50642] Avg episode reward: [(0, '16.940'), (1, '20.310')] [2023-10-08 00:52:21,931][52060] Updated weights for policy 0, policy_version 24520 (0.0009) [2023-10-08 00:52:22,302][52060] Updated weights for policy 0, policy_version 24530 (0.0007) [2023-10-08 00:52:22,667][52060] Updated weights for policy 0, policy_version 24540 (0.0007) [2023-10-08 00:52:22,869][52059] Updated weights for policy 1, policy_version 24842 (0.0008) [2023-10-08 00:52:23,227][52059] Updated weights for policy 1, policy_version 24852 (0.0011) [2023-10-08 00:52:23,601][52059] Updated weights for policy 1, policy_version 24862 (0.0011) [2023-10-08 00:52:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 50593792. Throughput: 0: 1722.2, 1: 1732.4. Samples: 12659180. Policy #0 lag: (min: 1.0, avg: 9.7, max: 33.0) [2023-10-08 00:52:26,211][50642] Avg episode reward: [(0, '16.490'), (1, '17.520')] [2023-10-08 00:52:26,589][52060] Updated weights for policy 0, policy_version 24550 (0.0007) [2023-10-08 00:52:26,954][52060] Updated weights for policy 0, policy_version 24560 (0.0008) [2023-10-08 00:52:27,325][52060] Updated weights for policy 0, policy_version 24570 (0.0007) [2023-10-08 00:52:27,472][52059] Updated weights for policy 1, policy_version 24872 (0.0011) [2023-10-08 00:52:27,828][52059] Updated weights for policy 1, policy_version 24882 (0.0008) [2023-10-08 00:52:28,195][52059] Updated weights for policy 1, policy_version 24892 (0.0008) [2023-10-08 00:52:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 50659328. Throughput: 0: 1723.3, 1: 1753.4. Samples: 12680528. Policy #0 lag: (min: 15.0, avg: 17.8, max: 47.0) [2023-10-08 00:52:31,211][50642] Avg episode reward: [(0, '16.410'), (1, '16.230')] [2023-10-08 00:52:31,304][52060] Updated weights for policy 0, policy_version 24580 (0.0008) [2023-10-08 00:52:31,671][52060] Updated weights for policy 0, policy_version 24590 (0.0009) [2023-10-08 00:52:32,035][52060] Updated weights for policy 0, policy_version 24600 (0.0008) [2023-10-08 00:52:32,228][52059] Updated weights for policy 1, policy_version 24902 (0.0007) [2023-10-08 00:52:32,600][52059] Updated weights for policy 1, policy_version 24912 (0.0010) [2023-10-08 00:52:32,966][52059] Updated weights for policy 1, policy_version 24922 (0.0010) [2023-10-08 00:52:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 50724864. Throughput: 0: 1709.5, 1: 1724.4. Samples: 12689816. Policy #0 lag: (min: 15.0, avg: 17.8, max: 47.0) [2023-10-08 00:52:36,212][50642] Avg episode reward: [(0, '16.600'), (1, '17.740')] [2023-10-08 00:52:36,306][52060] Updated weights for policy 0, policy_version 24610 (0.0009) [2023-10-08 00:52:36,718][52060] Updated weights for policy 0, policy_version 24620 (0.0007) [2023-10-08 00:52:36,862][52059] Updated weights for policy 1, policy_version 24932 (0.0009) [2023-10-08 00:52:37,087][52060] Updated weights for policy 0, policy_version 24630 (0.0008) [2023-10-08 00:52:37,217][52059] Updated weights for policy 1, policy_version 24942 (0.0008) [2023-10-08 00:52:37,459][52060] Updated weights for policy 0, policy_version 24640 (0.0008) [2023-10-08 00:52:37,581][52059] Updated weights for policy 1, policy_version 24952 (0.0008) [2023-10-08 00:52:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 50790400. Throughput: 0: 1724.0, 1: 1751.7. Samples: 12711252. Policy #0 lag: (min: 15.0, avg: 17.8, max: 47.0) [2023-10-08 00:52:41,211][50642] Avg episode reward: [(0, '16.040'), (1, '20.440')] [2023-10-08 00:52:41,382][52060] Updated weights for policy 0, policy_version 24650 (0.0009) [2023-10-08 00:52:41,500][52059] Updated weights for policy 1, policy_version 24962 (0.0008) [2023-10-08 00:52:41,738][52060] Updated weights for policy 0, policy_version 24660 (0.0008) [2023-10-08 00:52:41,858][52059] Updated weights for policy 1, policy_version 24972 (0.0009) [2023-10-08 00:52:42,105][52060] Updated weights for policy 0, policy_version 24670 (0.0007) [2023-10-08 00:52:42,225][52059] Updated weights for policy 1, policy_version 24982 (0.0009) [2023-10-08 00:52:42,588][52059] Updated weights for policy 1, policy_version 24992 (0.0010) [2023-10-08 00:52:46,121][52060] Updated weights for policy 0, policy_version 24680 (0.0009) [2023-10-08 00:52:46,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 50855936. Throughput: 0: 1720.2, 1: 1756.3. Samples: 12732350. Policy #0 lag: (min: 15.0, avg: 17.8, max: 47.0) [2023-10-08 00:52:46,211][50642] Avg episode reward: [(0, '16.490'), (1, '16.450')] [2023-10-08 00:52:46,500][52060] Updated weights for policy 0, policy_version 24690 (0.0009) [2023-10-08 00:52:46,505][52059] Updated weights for policy 1, policy_version 25002 (0.0009) [2023-10-08 00:52:46,862][52059] Updated weights for policy 1, policy_version 25012 (0.0010) [2023-10-08 00:52:46,871][52060] Updated weights for policy 0, policy_version 24700 (0.0010) [2023-10-08 00:52:47,018][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000024704_25296896.pth... [2023-10-08 00:52:47,057][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000023104_23658496.pth [2023-10-08 00:52:47,232][52059] Updated weights for policy 1, policy_version 25022 (0.0010) [2023-10-08 00:52:47,295][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000025024_25624576.pth... [2023-10-08 00:52:47,324][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000023392_23953408.pth [2023-10-08 00:52:50,845][52060] Updated weights for policy 0, policy_version 24710 (0.0010) [2023-10-08 00:52:51,034][52059] Updated weights for policy 1, policy_version 25032 (0.0009) [2023-10-08 00:52:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 50921472. Throughput: 0: 1719.3, 1: 1731.6. Samples: 12741614. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) [2023-10-08 00:52:51,211][50642] Avg episode reward: [(0, '16.760'), (1, '18.500')] [2023-10-08 00:52:51,215][52060] Updated weights for policy 0, policy_version 24720 (0.0009) [2023-10-08 00:52:51,400][52059] Updated weights for policy 1, policy_version 25042 (0.0008) [2023-10-08 00:52:51,583][52060] Updated weights for policy 0, policy_version 24730 (0.0007) [2023-10-08 00:52:51,758][52059] Updated weights for policy 1, policy_version 25052 (0.0007) [2023-10-08 00:52:55,404][52060] Updated weights for policy 0, policy_version 24740 (0.0007) [2023-10-08 00:52:55,635][52059] Updated weights for policy 1, policy_version 25062 (0.0007) [2023-10-08 00:52:55,771][52060] Updated weights for policy 0, policy_version 24750 (0.0009) [2023-10-08 00:52:56,003][52059] Updated weights for policy 1, policy_version 25072 (0.0007) [2023-10-08 00:52:56,146][52060] Updated weights for policy 0, policy_version 24760 (0.0008) [2023-10-08 00:52:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 50987008. Throughput: 0: 1711.5, 1: 1754.2. Samples: 12763134. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) [2023-10-08 00:52:56,211][50642] Avg episode reward: [(0, '15.780'), (1, '20.690')] [2023-10-08 00:52:56,372][52059] Updated weights for policy 1, policy_version 25082 (0.0008) [2023-10-08 00:53:00,141][52060] Updated weights for policy 0, policy_version 24770 (0.0009) [2023-10-08 00:53:00,240][52059] Updated weights for policy 1, policy_version 25092 (0.0008) [2023-10-08 00:53:00,514][52060] Updated weights for policy 0, policy_version 24780 (0.0008) [2023-10-08 00:53:00,606][52059] Updated weights for policy 1, policy_version 25102 (0.0008) [2023-10-08 00:53:00,876][52060] Updated weights for policy 0, policy_version 24790 (0.0007) [2023-10-08 00:53:00,965][52059] Updated weights for policy 1, policy_version 25112 (0.0008) [2023-10-08 00:53:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 51052544. Throughput: 0: 1693.0, 1: 1740.4. Samples: 12782846. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) [2023-10-08 00:53:01,211][50642] Avg episode reward: [(0, '16.840'), (1, '18.480')] [2023-10-08 00:53:01,249][52060] Updated weights for policy 0, policy_version 24800 (0.0008) [2023-10-08 00:53:04,960][52059] Updated weights for policy 1, policy_version 25122 (0.0009) [2023-10-08 00:53:05,327][52059] Updated weights for policy 1, policy_version 25132 (0.0010) [2023-10-08 00:53:05,486][52060] Updated weights for policy 0, policy_version 24810 (0.0008) [2023-10-08 00:53:05,678][52059] Updated weights for policy 1, policy_version 25142 (0.0009) [2023-10-08 00:53:05,860][52060] Updated weights for policy 0, policy_version 24820 (0.0007) [2023-10-08 00:53:06,046][52059] Updated weights for policy 1, policy_version 25152 (0.0009) [2023-10-08 00:53:06,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 51150848. Throughput: 0: 1710.4, 1: 1749.7. Samples: 12793796. Policy #0 lag: (min: 11.0, avg: 11.6, max: 28.0) [2023-10-08 00:53:06,211][50642] Avg episode reward: [(0, '16.230'), (1, '17.010')] [2023-10-08 00:53:06,224][52060] Updated weights for policy 0, policy_version 24830 (0.0007) [2023-10-08 00:53:10,001][52059] Updated weights for policy 1, policy_version 25162 (0.0007) [2023-10-08 00:53:10,118][52060] Updated weights for policy 0, policy_version 24840 (0.0007) [2023-10-08 00:53:10,369][52059] Updated weights for policy 1, policy_version 25172 (0.0008) [2023-10-08 00:53:10,486][52060] Updated weights for policy 0, policy_version 24850 (0.0008) [2023-10-08 00:53:10,732][52059] Updated weights for policy 1, policy_version 25182 (0.0008) [2023-10-08 00:53:10,859][52060] Updated weights for policy 0, policy_version 24860 (0.0010) [2023-10-08 00:53:11,210][50642] Fps is (10 sec: 19660.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 51249152. Throughput: 0: 1713.7, 1: 1748.7. Samples: 12814990. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:53:11,211][50642] Avg episode reward: [(0, '15.440'), (1, '19.170')] [2023-10-08 00:53:14,677][52059] Updated weights for policy 1, policy_version 25192 (0.0008) [2023-10-08 00:53:14,725][52060] Updated weights for policy 0, policy_version 24870 (0.0007) [2023-10-08 00:53:15,038][52059] Updated weights for policy 1, policy_version 25202 (0.0008) [2023-10-08 00:53:15,091][52060] Updated weights for policy 0, policy_version 24880 (0.0008) [2023-10-08 00:53:15,396][52059] Updated weights for policy 1, policy_version 25212 (0.0010) [2023-10-08 00:53:15,459][52060] Updated weights for policy 0, policy_version 24890 (0.0007) [2023-10-08 00:53:16,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 51314688. Throughput: 0: 1682.7, 1: 1720.4. Samples: 12833668. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:53:16,211][50642] Avg episode reward: [(0, '16.720'), (1, '19.690')] [2023-10-08 00:53:19,252][52059] Updated weights for policy 1, policy_version 25222 (0.0009) [2023-10-08 00:53:19,528][52060] Updated weights for policy 0, policy_version 24900 (0.0009) [2023-10-08 00:53:19,622][52059] Updated weights for policy 1, policy_version 25232 (0.0009) [2023-10-08 00:53:19,895][52060] Updated weights for policy 0, policy_version 24910 (0.0009) [2023-10-08 00:53:20,000][52059] Updated weights for policy 1, policy_version 25242 (0.0009) [2023-10-08 00:53:20,259][52060] Updated weights for policy 0, policy_version 24920 (0.0010) [2023-10-08 00:53:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 51380224. Throughput: 0: 1716.5, 1: 1752.7. Samples: 12845928. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:53:21,211][50642] Avg episode reward: [(0, '15.870'), (1, '19.150')] [2023-10-08 00:53:24,009][52059] Updated weights for policy 1, policy_version 25252 (0.0009) [2023-10-08 00:53:24,371][52059] Updated weights for policy 1, policy_version 25262 (0.0008) [2023-10-08 00:53:24,411][52060] Updated weights for policy 0, policy_version 24930 (0.0010) [2023-10-08 00:53:24,734][52059] Updated weights for policy 1, policy_version 25272 (0.0008) [2023-10-08 00:53:24,823][52060] Updated weights for policy 0, policy_version 24940 (0.0007) [2023-10-08 00:53:25,184][52060] Updated weights for policy 0, policy_version 24950 (0.0008) [2023-10-08 00:53:25,558][52060] Updated weights for policy 0, policy_version 24960 (0.0008) [2023-10-08 00:53:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 51445760. Throughput: 0: 1697.3, 1: 1723.5. Samples: 12865184. Policy #0 lag: (min: 29.0, avg: 29.8, max: 47.0) [2023-10-08 00:53:26,211][50642] Avg episode reward: [(0, '15.280'), (1, '19.150')] [2023-10-08 00:53:28,627][52059] Updated weights for policy 1, policy_version 25282 (0.0008) [2023-10-08 00:53:28,995][52059] Updated weights for policy 1, policy_version 25292 (0.0010) [2023-10-08 00:53:29,368][52059] Updated weights for policy 1, policy_version 25302 (0.0007) [2023-10-08 00:53:29,410][52060] Updated weights for policy 0, policy_version 24970 (0.0010) [2023-10-08 00:53:29,737][52059] Updated weights for policy 1, policy_version 25312 (0.0007) [2023-10-08 00:53:29,787][52060] Updated weights for policy 0, policy_version 24980 (0.0007) [2023-10-08 00:53:30,151][52060] Updated weights for policy 0, policy_version 24990 (0.0009) [2023-10-08 00:53:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 51511296. Throughput: 0: 1684.5, 1: 1724.4. Samples: 12885748. Policy #0 lag: (min: 29.0, avg: 29.8, max: 47.0) [2023-10-08 00:53:31,211][50642] Avg episode reward: [(0, '17.940'), (1, '19.260')] [2023-10-08 00:53:33,714][52059] Updated weights for policy 1, policy_version 25322 (0.0009) [2023-10-08 00:53:34,092][52059] Updated weights for policy 1, policy_version 25332 (0.0010) [2023-10-08 00:53:34,200][52060] Updated weights for policy 0, policy_version 25000 (0.0008) [2023-10-08 00:53:34,448][52059] Updated weights for policy 1, policy_version 25342 (0.0009) [2023-10-08 00:53:34,561][52060] Updated weights for policy 0, policy_version 25010 (0.0008) [2023-10-08 00:53:34,933][52060] Updated weights for policy 0, policy_version 25020 (0.0009) [2023-10-08 00:53:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.6, 300 sec: 13884.7). Total num frames: 51576832. Throughput: 0: 1714.5, 1: 1737.7. Samples: 12896964. Policy #0 lag: (min: 29.0, avg: 29.8, max: 47.0) [2023-10-08 00:53:36,211][50642] Avg episode reward: [(0, '14.960'), (1, '19.340')] [2023-10-08 00:53:38,288][52059] Updated weights for policy 1, policy_version 25352 (0.0007) [2023-10-08 00:53:38,662][52059] Updated weights for policy 1, policy_version 25362 (0.0008) [2023-10-08 00:53:38,773][52060] Updated weights for policy 0, policy_version 25030 (0.0008) [2023-10-08 00:53:39,024][52059] Updated weights for policy 1, policy_version 25372 (0.0008) [2023-10-08 00:53:39,139][52060] Updated weights for policy 0, policy_version 25040 (0.0007) [2023-10-08 00:53:39,499][52060] Updated weights for policy 0, policy_version 25050 (0.0007) [2023-10-08 00:53:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 51642368. Throughput: 0: 1690.1, 1: 1713.4. Samples: 12916292. Policy #0 lag: (min: 29.0, avg: 29.8, max: 47.0) [2023-10-08 00:53:41,211][50642] Avg episode reward: [(0, '16.700'), (1, '19.250')] [2023-10-08 00:53:43,077][52059] Updated weights for policy 1, policy_version 25382 (0.0008) [2023-10-08 00:53:43,426][52060] Updated weights for policy 0, policy_version 25060 (0.0007) [2023-10-08 00:53:43,444][52059] Updated weights for policy 1, policy_version 25392 (0.0008) [2023-10-08 00:53:43,794][52060] Updated weights for policy 0, policy_version 25070 (0.0008) [2023-10-08 00:53:43,818][52059] Updated weights for policy 1, policy_version 25402 (0.0009) [2023-10-08 00:53:44,164][52060] Updated weights for policy 0, policy_version 25080 (0.0008) [2023-10-08 00:53:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 51707904. Throughput: 0: 1708.1, 1: 1732.6. Samples: 12937676. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:53:46,211][50642] Avg episode reward: [(0, '15.750'), (1, '19.430')] [2023-10-08 00:53:47,680][52059] Updated weights for policy 1, policy_version 25412 (0.0007) [2023-10-08 00:53:48,049][52059] Updated weights for policy 1, policy_version 25422 (0.0008) [2023-10-08 00:53:48,173][52060] Updated weights for policy 0, policy_version 25090 (0.0010) [2023-10-08 00:53:48,408][52059] Updated weights for policy 1, policy_version 25432 (0.0008) [2023-10-08 00:53:48,537][52060] Updated weights for policy 0, policy_version 25100 (0.0008) [2023-10-08 00:53:48,912][52060] Updated weights for policy 0, policy_version 25110 (0.0009) [2023-10-08 00:53:49,276][52060] Updated weights for policy 0, policy_version 25120 (0.0009) [2023-10-08 00:53:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 51773440. Throughput: 0: 1702.7, 1: 1713.4. Samples: 12947524. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:53:51,211][50642] Avg episode reward: [(0, '16.600'), (1, '19.190')] [2023-10-08 00:53:52,310][52059] Updated weights for policy 1, policy_version 25442 (0.0008) [2023-10-08 00:53:52,683][52059] Updated weights for policy 1, policy_version 25452 (0.0010) [2023-10-08 00:53:53,036][52059] Updated weights for policy 1, policy_version 25462 (0.0009) [2023-10-08 00:53:53,195][52060] Updated weights for policy 0, policy_version 25130 (0.0008) [2023-10-08 00:53:53,399][52059] Updated weights for policy 1, policy_version 25472 (0.0009) [2023-10-08 00:53:53,564][52060] Updated weights for policy 0, policy_version 25140 (0.0008) [2023-10-08 00:53:53,922][52060] Updated weights for policy 0, policy_version 25150 (0.0007) [2023-10-08 00:53:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 51838976. Throughput: 0: 1688.0, 1: 1722.4. Samples: 12968460. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:53:56,211][50642] Avg episode reward: [(0, '18.230'), (1, '19.540')] [2023-10-08 00:53:57,348][52059] Updated weights for policy 1, policy_version 25482 (0.0008) [2023-10-08 00:53:57,708][52059] Updated weights for policy 1, policy_version 25492 (0.0007) [2023-10-08 00:53:57,863][52060] Updated weights for policy 0, policy_version 25160 (0.0008) [2023-10-08 00:53:58,083][52059] Updated weights for policy 1, policy_version 25502 (0.0009) [2023-10-08 00:53:58,230][52060] Updated weights for policy 0, policy_version 25170 (0.0008) [2023-10-08 00:53:58,605][52060] Updated weights for policy 0, policy_version 25180 (0.0008) [2023-10-08 00:54:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 51904512. Throughput: 0: 1721.6, 1: 1746.9. Samples: 12989748. Policy #0 lag: (min: 31.0, avg: 38.5, max: 63.0) [2023-10-08 00:54:01,211][50642] Avg episode reward: [(0, '15.230'), (1, '18.240')] [2023-10-08 00:54:01,970][52059] Updated weights for policy 1, policy_version 25512 (0.0009) [2023-10-08 00:54:02,331][52059] Updated weights for policy 1, policy_version 25522 (0.0008) [2023-10-08 00:54:02,548][52060] Updated weights for policy 0, policy_version 25190 (0.0010) [2023-10-08 00:54:02,703][52059] Updated weights for policy 1, policy_version 25532 (0.0008) [2023-10-08 00:54:02,913][52060] Updated weights for policy 0, policy_version 25200 (0.0008) [2023-10-08 00:54:03,285][52060] Updated weights for policy 0, policy_version 25210 (0.0008) [2023-10-08 00:54:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 51970048. Throughput: 0: 1690.5, 1: 1717.3. Samples: 12999280. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:06,211][50642] Avg episode reward: [(0, '16.070'), (1, '18.890')] [2023-10-08 00:54:06,553][52059] Updated weights for policy 1, policy_version 25542 (0.0007) [2023-10-08 00:54:06,921][52059] Updated weights for policy 1, policy_version 25552 (0.0007) [2023-10-08 00:54:07,164][52060] Updated weights for policy 0, policy_version 25220 (0.0008) [2023-10-08 00:54:07,290][52059] Updated weights for policy 1, policy_version 25562 (0.0008) [2023-10-08 00:54:07,535][52060] Updated weights for policy 0, policy_version 25230 (0.0008) [2023-10-08 00:54:07,909][52060] Updated weights for policy 0, policy_version 25240 (0.0008) [2023-10-08 00:54:11,150][52059] Updated weights for policy 1, policy_version 25572 (0.0009) [2023-10-08 00:54:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 52035584. Throughput: 0: 1709.9, 1: 1747.1. Samples: 13020748. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:11,211][50642] Avg episode reward: [(0, '15.930'), (1, '19.170')] [2023-10-08 00:54:11,520][52059] Updated weights for policy 1, policy_version 25582 (0.0009) [2023-10-08 00:54:11,879][52059] Updated weights for policy 1, policy_version 25592 (0.0009) [2023-10-08 00:54:12,040][52060] Updated weights for policy 0, policy_version 25250 (0.0009) [2023-10-08 00:54:12,432][52060] Updated weights for policy 0, policy_version 25260 (0.0007) [2023-10-08 00:54:12,799][52060] Updated weights for policy 0, policy_version 25270 (0.0008) [2023-10-08 00:54:13,174][52060] Updated weights for policy 0, policy_version 25280 (0.0011) [2023-10-08 00:54:15,948][52059] Updated weights for policy 1, policy_version 25602 (0.0007) [2023-10-08 00:54:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 52101120. Throughput: 0: 1725.8, 1: 1743.0. Samples: 13041842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:16,211][50642] Avg episode reward: [(0, '15.740'), (1, '17.840')] [2023-10-08 00:54:16,319][52059] Updated weights for policy 1, policy_version 25612 (0.0010) [2023-10-08 00:54:16,693][52059] Updated weights for policy 1, policy_version 25622 (0.0009) [2023-10-08 00:54:17,051][52059] Updated weights for policy 1, policy_version 25632 (0.0008) [2023-10-08 00:54:17,128][52060] Updated weights for policy 0, policy_version 25290 (0.0007) [2023-10-08 00:54:17,501][52060] Updated weights for policy 0, policy_version 25300 (0.0007) [2023-10-08 00:54:17,866][52060] Updated weights for policy 0, policy_version 25310 (0.0007) [2023-10-08 00:54:20,896][52059] Updated weights for policy 1, policy_version 25642 (0.0009) [2023-10-08 00:54:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 52166656. Throughput: 0: 1694.4, 1: 1729.5. Samples: 13051040. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:21,211][50642] Avg episode reward: [(0, '17.860'), (1, '17.790')] [2023-10-08 00:54:21,264][52059] Updated weights for policy 1, policy_version 25652 (0.0008) [2023-10-08 00:54:21,625][52059] Updated weights for policy 1, policy_version 25662 (0.0009) [2023-10-08 00:54:21,953][52060] Updated weights for policy 0, policy_version 25320 (0.0008) [2023-10-08 00:54:22,324][52060] Updated weights for policy 0, policy_version 25330 (0.0007) [2023-10-08 00:54:22,699][52060] Updated weights for policy 0, policy_version 25340 (0.0010) [2023-10-08 00:54:25,572][52059] Updated weights for policy 1, policy_version 25672 (0.0008) [2023-10-08 00:54:25,925][52059] Updated weights for policy 1, policy_version 25682 (0.0010) [2023-10-08 00:54:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 52232192. Throughput: 0: 1719.2, 1: 1747.3. Samples: 13072286. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:26,211][50642] Avg episode reward: [(0, '14.820'), (1, '19.310')] [2023-10-08 00:54:26,284][52059] Updated weights for policy 1, policy_version 25692 (0.0009) [2023-10-08 00:54:26,598][52060] Updated weights for policy 0, policy_version 25350 (0.0008) [2023-10-08 00:54:26,959][52060] Updated weights for policy 0, policy_version 25360 (0.0007) [2023-10-08 00:54:27,341][52060] Updated weights for policy 0, policy_version 25370 (0.0008) [2023-10-08 00:54:30,134][52059] Updated weights for policy 1, policy_version 25702 (0.0007) [2023-10-08 00:54:30,495][52059] Updated weights for policy 1, policy_version 25712 (0.0008) [2023-10-08 00:54:30,861][52059] Updated weights for policy 1, policy_version 25722 (0.0008) [2023-10-08 00:54:31,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 52330496. Throughput: 0: 1726.4, 1: 1724.7. Samples: 13092976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:31,211][50642] Avg episode reward: [(0, '14.870'), (1, '18.770')] [2023-10-08 00:54:31,415][52060] Updated weights for policy 0, policy_version 25380 (0.0008) [2023-10-08 00:54:31,781][52060] Updated weights for policy 0, policy_version 25390 (0.0009) [2023-10-08 00:54:32,163][52060] Updated weights for policy 0, policy_version 25400 (0.0008) [2023-10-08 00:54:34,809][52059] Updated weights for policy 1, policy_version 25732 (0.0009) [2023-10-08 00:54:35,174][52059] Updated weights for policy 1, policy_version 25742 (0.0008) [2023-10-08 00:54:35,541][52059] Updated weights for policy 1, policy_version 25752 (0.0009) [2023-10-08 00:54:35,895][52060] Updated weights for policy 0, policy_version 25410 (0.0007) [2023-10-08 00:54:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 52396032. Throughput: 0: 1714.6, 1: 1750.1. Samples: 13103436. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:36,211][50642] Avg episode reward: [(0, '17.010'), (1, '20.090')] [2023-10-08 00:54:36,270][52060] Updated weights for policy 0, policy_version 25420 (0.0008) [2023-10-08 00:54:36,643][52060] Updated weights for policy 0, policy_version 25430 (0.0010) [2023-10-08 00:54:37,017][52060] Updated weights for policy 0, policy_version 25440 (0.0008) [2023-10-08 00:54:39,531][52059] Updated weights for policy 1, policy_version 25762 (0.0009) [2023-10-08 00:54:39,896][52059] Updated weights for policy 1, policy_version 25772 (0.0009) [2023-10-08 00:54:40,261][52059] Updated weights for policy 1, policy_version 25782 (0.0009) [2023-10-08 00:54:40,625][52059] Updated weights for policy 1, policy_version 25792 (0.0008) [2023-10-08 00:54:40,952][52060] Updated weights for policy 0, policy_version 25450 (0.0009) [2023-10-08 00:54:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 52461568. Throughput: 0: 1729.2, 1: 1734.8. Samples: 13124340. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:54:41,211][50642] Avg episode reward: [(0, '14.780'), (1, '20.070')] [2023-10-08 00:54:41,315][52060] Updated weights for policy 0, policy_version 25460 (0.0008) [2023-10-08 00:54:41,693][52060] Updated weights for policy 0, policy_version 25470 (0.0008) [2023-10-08 00:54:44,652][52059] Updated weights for policy 1, policy_version 25802 (0.0009) [2023-10-08 00:54:45,031][52059] Updated weights for policy 1, policy_version 25812 (0.0008) [2023-10-08 00:54:45,388][52059] Updated weights for policy 1, policy_version 25822 (0.0009) [2023-10-08 00:54:45,681][52060] Updated weights for policy 0, policy_version 25480 (0.0008) [2023-10-08 00:54:46,055][52060] Updated weights for policy 0, policy_version 25490 (0.0011) [2023-10-08 00:54:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 52527104. Throughput: 0: 1716.4, 1: 1719.6. Samples: 13144366. Policy #0 lag: (min: 28.0, avg: 39.4, max: 60.0) [2023-10-08 00:54:46,211][50642] Avg episode reward: [(0, '17.320'), (1, '19.570')] [2023-10-08 00:54:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000025824_26443776.pth... [2023-10-08 00:54:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000024192_24772608.pth [2023-10-08 00:54:46,409][52060] Updated weights for policy 0, policy_version 25500 (0.0010) [2023-10-08 00:54:46,555][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000025504_26116096.pth... [2023-10-08 00:54:46,584][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000023904_24477696.pth [2023-10-08 00:54:49,361][52059] Updated weights for policy 1, policy_version 25832 (0.0009) [2023-10-08 00:54:49,733][52059] Updated weights for policy 1, policy_version 25842 (0.0008) [2023-10-08 00:54:50,097][52059] Updated weights for policy 1, policy_version 25852 (0.0008) [2023-10-08 00:54:50,405][52060] Updated weights for policy 0, policy_version 25510 (0.0010) [2023-10-08 00:54:50,776][52060] Updated weights for policy 0, policy_version 25520 (0.0009) [2023-10-08 00:54:51,137][52060] Updated weights for policy 0, policy_version 25530 (0.0007) [2023-10-08 00:54:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 52592640. Throughput: 0: 1726.9, 1: 1746.2. Samples: 13155570. Policy #0 lag: (min: 28.0, avg: 39.4, max: 60.0) [2023-10-08 00:54:51,212][50642] Avg episode reward: [(0, '14.450'), (1, '19.470')] [2023-10-08 00:54:53,907][52059] Updated weights for policy 1, policy_version 25862 (0.0010) [2023-10-08 00:54:54,278][52059] Updated weights for policy 1, policy_version 25872 (0.0009) [2023-10-08 00:54:54,647][52059] Updated weights for policy 1, policy_version 25882 (0.0009) [2023-10-08 00:54:54,904][52060] Updated weights for policy 0, policy_version 25540 (0.0008) [2023-10-08 00:54:55,268][52060] Updated weights for policy 0, policy_version 25550 (0.0008) [2023-10-08 00:54:55,632][52060] Updated weights for policy 0, policy_version 25560 (0.0009) [2023-10-08 00:54:56,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 52690944. Throughput: 0: 1735.0, 1: 1716.6. Samples: 13176070. Policy #0 lag: (min: 28.0, avg: 39.4, max: 60.0) [2023-10-08 00:54:56,211][50642] Avg episode reward: [(0, '14.360'), (1, '17.420')] [2023-10-08 00:54:58,598][52059] Updated weights for policy 1, policy_version 25892 (0.0008) [2023-10-08 00:54:58,959][52059] Updated weights for policy 1, policy_version 25902 (0.0008) [2023-10-08 00:54:59,321][52059] Updated weights for policy 1, policy_version 25912 (0.0010) [2023-10-08 00:54:59,671][52060] Updated weights for policy 0, policy_version 25570 (0.0009) [2023-10-08 00:55:00,064][52060] Updated weights for policy 0, policy_version 25580 (0.0009) [2023-10-08 00:55:00,437][52060] Updated weights for policy 0, policy_version 25590 (0.0010) [2023-10-08 00:55:00,813][52060] Updated weights for policy 0, policy_version 25600 (0.0009) [2023-10-08 00:55:01,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 52756480. Throughput: 0: 1707.1, 1: 1719.8. Samples: 13196050. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:55:01,211][50642] Avg episode reward: [(0, '18.030'), (1, '20.680')] [2023-10-08 00:55:03,269][52059] Updated weights for policy 1, policy_version 25922 (0.0008) [2023-10-08 00:55:03,638][52059] Updated weights for policy 1, policy_version 25932 (0.0008) [2023-10-08 00:55:04,000][52059] Updated weights for policy 1, policy_version 25942 (0.0007) [2023-10-08 00:55:04,368][52059] Updated weights for policy 1, policy_version 25952 (0.0009) [2023-10-08 00:55:04,715][52060] Updated weights for policy 0, policy_version 25610 (0.0010) [2023-10-08 00:55:05,077][52060] Updated weights for policy 0, policy_version 25620 (0.0009) [2023-10-08 00:55:05,447][52060] Updated weights for policy 0, policy_version 25630 (0.0010) [2023-10-08 00:55:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 52822016. Throughput: 0: 1741.0, 1: 1733.3. Samples: 13207384. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:55:06,211][50642] Avg episode reward: [(0, '14.850'), (1, '18.160')] [2023-10-08 00:55:08,209][52059] Updated weights for policy 1, policy_version 25962 (0.0008) [2023-10-08 00:55:08,572][52059] Updated weights for policy 1, policy_version 25972 (0.0009) [2023-10-08 00:55:08,938][52059] Updated weights for policy 1, policy_version 25982 (0.0008) [2023-10-08 00:55:09,255][52060] Updated weights for policy 0, policy_version 25640 (0.0010) [2023-10-08 00:55:09,621][52060] Updated weights for policy 0, policy_version 25650 (0.0008) [2023-10-08 00:55:09,989][52060] Updated weights for policy 0, policy_version 25660 (0.0007) [2023-10-08 00:55:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 52887552. Throughput: 0: 1726.8, 1: 1718.5. Samples: 13227324. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:55:11,211][50642] Avg episode reward: [(0, '16.780'), (1, '18.340')] [2023-10-08 00:55:12,946][52059] Updated weights for policy 1, policy_version 25992 (0.0008) [2023-10-08 00:55:13,316][52059] Updated weights for policy 1, policy_version 26002 (0.0007) [2023-10-08 00:55:13,673][52059] Updated weights for policy 1, policy_version 26012 (0.0009) [2023-10-08 00:55:13,942][52060] Updated weights for policy 0, policy_version 25670 (0.0009) [2023-10-08 00:55:14,302][52060] Updated weights for policy 0, policy_version 25680 (0.0008) [2023-10-08 00:55:14,671][52060] Updated weights for policy 0, policy_version 25690 (0.0008) [2023-10-08 00:55:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 52953088. Throughput: 0: 1713.3, 1: 1739.5. Samples: 13248350. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:55:16,211][50642] Avg episode reward: [(0, '15.750'), (1, '20.440')] [2023-10-08 00:55:17,676][52059] Updated weights for policy 1, policy_version 26022 (0.0008) [2023-10-08 00:55:18,050][52059] Updated weights for policy 1, policy_version 26032 (0.0009) [2023-10-08 00:55:18,424][52059] Updated weights for policy 1, policy_version 26042 (0.0008) [2023-10-08 00:55:18,623][52060] Updated weights for policy 0, policy_version 25700 (0.0008) [2023-10-08 00:55:19,004][52060] Updated weights for policy 0, policy_version 25710 (0.0009) [2023-10-08 00:55:19,365][52060] Updated weights for policy 0, policy_version 25720 (0.0007) [2023-10-08 00:55:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 53018624. Throughput: 0: 1736.0, 1: 1714.5. Samples: 13258712. Policy #0 lag: (min: 28.0, avg: 30.4, max: 60.0) [2023-10-08 00:55:21,211][50642] Avg episode reward: [(0, '15.280'), (1, '20.180')] [2023-10-08 00:55:22,300][52059] Updated weights for policy 1, policy_version 26052 (0.0010) [2023-10-08 00:55:22,661][52059] Updated weights for policy 1, policy_version 26062 (0.0011) [2023-10-08 00:55:23,027][52059] Updated weights for policy 1, policy_version 26072 (0.0010) [2023-10-08 00:55:23,341][52060] Updated weights for policy 0, policy_version 25730 (0.0007) [2023-10-08 00:55:23,715][52060] Updated weights for policy 0, policy_version 25740 (0.0009) [2023-10-08 00:55:24,077][52060] Updated weights for policy 0, policy_version 25750 (0.0009) [2023-10-08 00:55:24,443][52060] Updated weights for policy 0, policy_version 25760 (0.0010) [2023-10-08 00:55:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 53084160. Throughput: 0: 1711.0, 1: 1728.9. Samples: 13279132. Policy #0 lag: (min: 28.0, avg: 30.4, max: 60.0) [2023-10-08 00:55:26,211][50642] Avg episode reward: [(0, '17.470'), (1, '16.880')] [2023-10-08 00:55:26,875][52059] Updated weights for policy 1, policy_version 26082 (0.0008) [2023-10-08 00:55:27,248][52059] Updated weights for policy 1, policy_version 26092 (0.0009) [2023-10-08 00:55:27,609][52059] Updated weights for policy 1, policy_version 26102 (0.0008) [2023-10-08 00:55:27,972][52059] Updated weights for policy 1, policy_version 26112 (0.0009) [2023-10-08 00:55:28,453][52060] Updated weights for policy 0, policy_version 25770 (0.0009) [2023-10-08 00:55:28,819][52060] Updated weights for policy 0, policy_version 25780 (0.0008) [2023-10-08 00:55:29,190][52060] Updated weights for policy 0, policy_version 25790 (0.0011) [2023-10-08 00:55:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 53149696. Throughput: 0: 1725.5, 1: 1745.0. Samples: 13300536. Policy #0 lag: (min: 28.0, avg: 30.4, max: 60.0) [2023-10-08 00:55:31,211][50642] Avg episode reward: [(0, '14.750'), (1, '17.740')] [2023-10-08 00:55:32,076][52059] Updated weights for policy 1, policy_version 26122 (0.0007) [2023-10-08 00:55:32,444][52059] Updated weights for policy 1, policy_version 26132 (0.0007) [2023-10-08 00:55:32,805][52059] Updated weights for policy 1, policy_version 26142 (0.0008) [2023-10-08 00:55:33,036][52060] Updated weights for policy 0, policy_version 25800 (0.0009) [2023-10-08 00:55:33,406][52060] Updated weights for policy 0, policy_version 25810 (0.0010) [2023-10-08 00:55:33,783][52060] Updated weights for policy 0, policy_version 25820 (0.0008) [2023-10-08 00:55:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 53215232. Throughput: 0: 1717.3, 1: 1714.0. Samples: 13309978. Policy #0 lag: (min: 28.0, avg: 30.4, max: 60.0) [2023-10-08 00:55:36,211][50642] Avg episode reward: [(0, '15.830'), (1, '22.660')] [2023-10-08 00:55:36,212][51710] Saving new best policy, reward=22.660! [2023-10-08 00:55:36,683][52059] Updated weights for policy 1, policy_version 26152 (0.0010) [2023-10-08 00:55:37,050][52059] Updated weights for policy 1, policy_version 26162 (0.0010) [2023-10-08 00:55:37,420][52059] Updated weights for policy 1, policy_version 26172 (0.0009) [2023-10-08 00:55:37,791][52060] Updated weights for policy 0, policy_version 25830 (0.0008) [2023-10-08 00:55:38,171][52060] Updated weights for policy 0, policy_version 25840 (0.0011) [2023-10-08 00:55:38,539][52060] Updated weights for policy 0, policy_version 25850 (0.0009) [2023-10-08 00:55:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 53280768. Throughput: 0: 1704.4, 1: 1740.4. Samples: 13331088. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:55:41,211][50642] Avg episode reward: [(0, '16.440'), (1, '15.920')] [2023-10-08 00:55:41,287][52059] Updated weights for policy 1, policy_version 26182 (0.0010) [2023-10-08 00:55:41,660][52059] Updated weights for policy 1, policy_version 26192 (0.0009) [2023-10-08 00:55:42,025][52059] Updated weights for policy 1, policy_version 26202 (0.0009) [2023-10-08 00:55:42,456][52060] Updated weights for policy 0, policy_version 25860 (0.0010) [2023-10-08 00:55:42,827][52060] Updated weights for policy 0, policy_version 25870 (0.0008) [2023-10-08 00:55:43,193][52060] Updated weights for policy 0, policy_version 25880 (0.0008) [2023-10-08 00:55:45,968][52059] Updated weights for policy 1, policy_version 26212 (0.0011) [2023-10-08 00:55:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 53346304. Throughput: 0: 1738.2, 1: 1741.4. Samples: 13352632. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:55:46,211][50642] Avg episode reward: [(0, '14.620'), (1, '16.490')] [2023-10-08 00:55:46,331][52059] Updated weights for policy 1, policy_version 26222 (0.0009) [2023-10-08 00:55:46,704][52059] Updated weights for policy 1, policy_version 26232 (0.0009) [2023-10-08 00:55:47,099][52060] Updated weights for policy 0, policy_version 25890 (0.0009) [2023-10-08 00:55:47,472][52060] Updated weights for policy 0, policy_version 25900 (0.0010) [2023-10-08 00:55:47,847][52060] Updated weights for policy 0, policy_version 25910 (0.0008) [2023-10-08 00:55:48,207][52060] Updated weights for policy 0, policy_version 25920 (0.0008) [2023-10-08 00:55:50,696][52059] Updated weights for policy 1, policy_version 26242 (0.0011) [2023-10-08 00:55:51,061][52059] Updated weights for policy 1, policy_version 26252 (0.0010) [2023-10-08 00:55:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 53411840. Throughput: 0: 1709.9, 1: 1725.0. Samples: 13361954. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:55:51,211][50642] Avg episode reward: [(0, '15.970'), (1, '19.280')] [2023-10-08 00:55:51,424][52059] Updated weights for policy 1, policy_version 26262 (0.0010) [2023-10-08 00:55:51,793][52059] Updated weights for policy 1, policy_version 26272 (0.0011) [2023-10-08 00:55:52,282][52060] Updated weights for policy 0, policy_version 25930 (0.0009) [2023-10-08 00:55:52,661][52060] Updated weights for policy 0, policy_version 25940 (0.0009) [2023-10-08 00:55:53,034][52060] Updated weights for policy 0, policy_version 25950 (0.0011) [2023-10-08 00:55:55,786][52059] Updated weights for policy 1, policy_version 26282 (0.0007) [2023-10-08 00:55:56,139][52059] Updated weights for policy 1, policy_version 26292 (0.0007) [2023-10-08 00:55:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 53477376. Throughput: 0: 1722.8, 1: 1737.5. Samples: 13383036. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 00:55:56,211][50642] Avg episode reward: [(0, '14.410'), (1, '22.010')] [2023-10-08 00:55:56,501][52059] Updated weights for policy 1, policy_version 26302 (0.0009) [2023-10-08 00:55:57,092][52060] Updated weights for policy 0, policy_version 25960 (0.0008) [2023-10-08 00:55:57,463][52060] Updated weights for policy 0, policy_version 25970 (0.0009) [2023-10-08 00:55:57,833][52060] Updated weights for policy 0, policy_version 25980 (0.0009) [2023-10-08 00:56:00,372][52059] Updated weights for policy 1, policy_version 26312 (0.0010) [2023-10-08 00:56:00,744][52059] Updated weights for policy 1, policy_version 26322 (0.0010) [2023-10-08 00:56:01,105][52059] Updated weights for policy 1, policy_version 26332 (0.0008) [2023-10-08 00:56:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 53542912. Throughput: 0: 1733.2, 1: 1718.9. Samples: 13403692. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:56:01,211][50642] Avg episode reward: [(0, '15.060'), (1, '15.570')] [2023-10-08 00:56:01,779][52060] Updated weights for policy 0, policy_version 25990 (0.0009) [2023-10-08 00:56:02,149][52060] Updated weights for policy 0, policy_version 26000 (0.0008) [2023-10-08 00:56:02,526][52060] Updated weights for policy 0, policy_version 26010 (0.0007) [2023-10-08 00:56:04,910][52059] Updated weights for policy 1, policy_version 26342 (0.0008) [2023-10-08 00:56:05,274][52059] Updated weights for policy 1, policy_version 26352 (0.0008) [2023-10-08 00:56:05,645][52059] Updated weights for policy 1, policy_version 26362 (0.0008) [2023-10-08 00:56:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 53641216. Throughput: 0: 1710.7, 1: 1743.6. Samples: 13414156. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:56:06,211][50642] Avg episode reward: [(0, '16.830'), (1, '17.730')] [2023-10-08 00:56:06,518][52060] Updated weights for policy 0, policy_version 26020 (0.0008) [2023-10-08 00:56:06,890][52060] Updated weights for policy 0, policy_version 26030 (0.0007) [2023-10-08 00:56:07,254][52060] Updated weights for policy 0, policy_version 26040 (0.0011) [2023-10-08 00:56:09,517][52059] Updated weights for policy 1, policy_version 26372 (0.0008) [2023-10-08 00:56:09,889][52059] Updated weights for policy 1, policy_version 26382 (0.0011) [2023-10-08 00:56:10,251][52059] Updated weights for policy 1, policy_version 26392 (0.0010) [2023-10-08 00:56:11,199][52060] Updated weights for policy 0, policy_version 26050 (0.0008) [2023-10-08 00:56:11,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 53706752. Throughput: 0: 1731.2, 1: 1731.1. Samples: 13434936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:56:11,211][50642] Avg episode reward: [(0, '16.070'), (1, '19.800')] [2023-10-08 00:56:11,561][52060] Updated weights for policy 0, policy_version 26060 (0.0009) [2023-10-08 00:56:11,937][52060] Updated weights for policy 0, policy_version 26070 (0.0010) [2023-10-08 00:56:12,299][52060] Updated weights for policy 0, policy_version 26080 (0.0009) [2023-10-08 00:56:14,038][52059] Updated weights for policy 1, policy_version 26402 (0.0010) [2023-10-08 00:56:14,414][52059] Updated weights for policy 1, policy_version 26412 (0.0008) [2023-10-08 00:56:14,773][52059] Updated weights for policy 1, policy_version 26422 (0.0010) [2023-10-08 00:56:15,137][52059] Updated weights for policy 1, policy_version 26432 (0.0009) [2023-10-08 00:56:16,192][52060] Updated weights for policy 0, policy_version 26090 (0.0007) [2023-10-08 00:56:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 53772288. Throughput: 0: 1728.5, 1: 1714.5. Samples: 13455470. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:56:16,211][50642] Avg episode reward: [(0, '17.120'), (1, '18.920')] [2023-10-08 00:56:16,564][52060] Updated weights for policy 0, policy_version 26100 (0.0009) [2023-10-08 00:56:16,943][52060] Updated weights for policy 0, policy_version 26110 (0.0009) [2023-10-08 00:56:19,126][52059] Updated weights for policy 1, policy_version 26442 (0.0009) [2023-10-08 00:56:19,492][52059] Updated weights for policy 1, policy_version 26452 (0.0009) [2023-10-08 00:56:19,852][52059] Updated weights for policy 1, policy_version 26462 (0.0010) [2023-10-08 00:56:20,987][52060] Updated weights for policy 0, policy_version 26120 (0.0008) [2023-10-08 00:56:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 53837824. Throughput: 0: 1724.0, 1: 1742.2. Samples: 13465956. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:56:21,211][50642] Avg episode reward: [(0, '16.880'), (1, '17.600')] [2023-10-08 00:56:21,358][52060] Updated weights for policy 0, policy_version 26130 (0.0010) [2023-10-08 00:56:21,732][52060] Updated weights for policy 0, policy_version 26140 (0.0010) [2023-10-08 00:56:23,686][52059] Updated weights for policy 1, policy_version 26472 (0.0008) [2023-10-08 00:56:24,059][52059] Updated weights for policy 1, policy_version 26482 (0.0010) [2023-10-08 00:56:24,428][52059] Updated weights for policy 1, policy_version 26492 (0.0008) [2023-10-08 00:56:25,684][52060] Updated weights for policy 0, policy_version 26150 (0.0007) [2023-10-08 00:56:26,051][52060] Updated weights for policy 0, policy_version 26160 (0.0007) [2023-10-08 00:56:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 53903360. Throughput: 0: 1734.0, 1: 1718.0. Samples: 13486426. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:56:26,211][50642] Avg episode reward: [(0, '15.260'), (1, '17.870')] [2023-10-08 00:56:26,420][52060] Updated weights for policy 0, policy_version 26170 (0.0008) [2023-10-08 00:56:28,197][52059] Updated weights for policy 1, policy_version 26502 (0.0007) [2023-10-08 00:56:28,553][52059] Updated weights for policy 1, policy_version 26512 (0.0007) [2023-10-08 00:56:28,926][52059] Updated weights for policy 1, policy_version 26522 (0.0009) [2023-10-08 00:56:30,263][52060] Updated weights for policy 0, policy_version 26180 (0.0009) [2023-10-08 00:56:30,632][52060] Updated weights for policy 0, policy_version 26190 (0.0009) [2023-10-08 00:56:30,999][52060] Updated weights for policy 0, policy_version 26200 (0.0008) [2023-10-08 00:56:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 53968896. Throughput: 0: 1714.6, 1: 1725.0. Samples: 13507412. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 00:56:31,211][50642] Avg episode reward: [(0, '15.840'), (1, '20.740')] [2023-10-08 00:56:32,991][52059] Updated weights for policy 1, policy_version 26532 (0.0008) [2023-10-08 00:56:33,361][52059] Updated weights for policy 1, policy_version 26542 (0.0009) [2023-10-08 00:56:33,712][52059] Updated weights for policy 1, policy_version 26552 (0.0010) [2023-10-08 00:56:34,892][52060] Updated weights for policy 0, policy_version 26210 (0.0009) [2023-10-08 00:56:35,287][52060] Updated weights for policy 0, policy_version 26220 (0.0007) [2023-10-08 00:56:35,658][52060] Updated weights for policy 0, policy_version 26230 (0.0008) [2023-10-08 00:56:36,018][52060] Updated weights for policy 0, policy_version 26240 (0.0008) [2023-10-08 00:56:36,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 54067200. Throughput: 0: 1731.4, 1: 1728.9. Samples: 13517668. Policy #0 lag: (min: 11.0, avg: 14.3, max: 43.0) [2023-10-08 00:56:36,211][50642] Avg episode reward: [(0, '15.800'), (1, '19.890')] [2023-10-08 00:56:37,723][52059] Updated weights for policy 1, policy_version 26562 (0.0008) [2023-10-08 00:56:38,086][52059] Updated weights for policy 1, policy_version 26572 (0.0010) [2023-10-08 00:56:38,448][52059] Updated weights for policy 1, policy_version 26582 (0.0010) [2023-10-08 00:56:38,812][52059] Updated weights for policy 1, policy_version 26592 (0.0008) [2023-10-08 00:56:39,901][52060] Updated weights for policy 0, policy_version 26250 (0.0010) [2023-10-08 00:56:40,268][52060] Updated weights for policy 0, policy_version 26260 (0.0010) [2023-10-08 00:56:40,640][52060] Updated weights for policy 0, policy_version 26270 (0.0009) [2023-10-08 00:56:41,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 54132736. Throughput: 0: 1730.8, 1: 1724.9. Samples: 13538542. Policy #0 lag: (min: 11.0, avg: 14.3, max: 43.0) [2023-10-08 00:56:41,211][50642] Avg episode reward: [(0, '16.740'), (1, '18.180')] [2023-10-08 00:56:42,687][52059] Updated weights for policy 1, policy_version 26602 (0.0010) [2023-10-08 00:56:43,051][52059] Updated weights for policy 1, policy_version 26612 (0.0010) [2023-10-08 00:56:43,422][52059] Updated weights for policy 1, policy_version 26622 (0.0009) [2023-10-08 00:56:44,616][52060] Updated weights for policy 0, policy_version 26280 (0.0007) [2023-10-08 00:56:44,989][52060] Updated weights for policy 0, policy_version 26290 (0.0007) [2023-10-08 00:56:45,358][52060] Updated weights for policy 0, policy_version 26300 (0.0007) [2023-10-08 00:56:46,211][50642] Fps is (10 sec: 13106.6, 60 sec: 14199.4, 300 sec: 13773.6). Total num frames: 54198272. Throughput: 0: 1703.5, 1: 1745.1. Samples: 13558882. Policy #0 lag: (min: 11.0, avg: 14.3, max: 43.0) [2023-10-08 00:56:46,212][50642] Avg episode reward: [(0, '17.180'), (1, '18.300')] [2023-10-08 00:56:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000026304_26935296.pth... [2023-10-08 00:56:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000026624_27262976.pth... [2023-10-08 00:56:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000024704_25296896.pth [2023-10-08 00:56:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000025024_25624576.pth [2023-10-08 00:56:47,273][52059] Updated weights for policy 1, policy_version 26632 (0.0007) [2023-10-08 00:56:47,640][52059] Updated weights for policy 1, policy_version 26642 (0.0013) [2023-10-08 00:56:47,996][52059] Updated weights for policy 1, policy_version 26652 (0.0007) [2023-10-08 00:56:49,394][52060] Updated weights for policy 0, policy_version 26310 (0.0008) [2023-10-08 00:56:49,764][52060] Updated weights for policy 0, policy_version 26320 (0.0007) [2023-10-08 00:56:50,132][52060] Updated weights for policy 0, policy_version 26330 (0.0009) [2023-10-08 00:56:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 54263808. Throughput: 0: 1731.0, 1: 1723.1. Samples: 13569590. Policy #0 lag: (min: 11.0, avg: 14.3, max: 43.0) [2023-10-08 00:56:51,211][50642] Avg episode reward: [(0, '16.210'), (1, '21.410')] [2023-10-08 00:56:51,999][52059] Updated weights for policy 1, policy_version 26662 (0.0009) [2023-10-08 00:56:52,366][52059] Updated weights for policy 1, policy_version 26672 (0.0008) [2023-10-08 00:56:52,729][52059] Updated weights for policy 1, policy_version 26682 (0.0008) [2023-10-08 00:56:54,068][52060] Updated weights for policy 0, policy_version 26340 (0.0010) [2023-10-08 00:56:54,441][52060] Updated weights for policy 0, policy_version 26350 (0.0009) [2023-10-08 00:56:54,804][52060] Updated weights for policy 0, policy_version 26360 (0.0010) [2023-10-08 00:56:56,210][50642] Fps is (10 sec: 13107.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 54329344. Throughput: 0: 1709.4, 1: 1739.6. Samples: 13590144. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:56:56,211][50642] Avg episode reward: [(0, '15.960'), (1, '19.270')] [2023-10-08 00:56:56,669][52059] Updated weights for policy 1, policy_version 26692 (0.0008) [2023-10-08 00:56:57,037][52059] Updated weights for policy 1, policy_version 26702 (0.0007) [2023-10-08 00:56:57,406][52059] Updated weights for policy 1, policy_version 26712 (0.0007) [2023-10-08 00:56:58,714][52060] Updated weights for policy 0, policy_version 26370 (0.0008) [2023-10-08 00:56:59,085][52060] Updated weights for policy 0, policy_version 26380 (0.0008) [2023-10-08 00:56:59,447][52060] Updated weights for policy 0, policy_version 26390 (0.0008) [2023-10-08 00:56:59,823][52060] Updated weights for policy 0, policy_version 26400 (0.0007) [2023-10-08 00:57:01,210][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 54394880. Throughput: 0: 1706.1, 1: 1759.2. Samples: 13611408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:01,211][50642] Avg episode reward: [(0, '16.430'), (1, '18.180')] [2023-10-08 00:57:01,246][52059] Updated weights for policy 1, policy_version 26722 (0.0010) [2023-10-08 00:57:01,607][52059] Updated weights for policy 1, policy_version 26732 (0.0008) [2023-10-08 00:57:01,979][52059] Updated weights for policy 1, policy_version 26742 (0.0008) [2023-10-08 00:57:02,346][52059] Updated weights for policy 1, policy_version 26752 (0.0008) [2023-10-08 00:57:03,670][52060] Updated weights for policy 0, policy_version 26410 (0.0008) [2023-10-08 00:57:04,044][52060] Updated weights for policy 0, policy_version 26420 (0.0007) [2023-10-08 00:57:04,415][52060] Updated weights for policy 0, policy_version 26430 (0.0008) [2023-10-08 00:57:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 54460416. Throughput: 0: 1723.6, 1: 1732.6. Samples: 13621482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:06,211][50642] Avg episode reward: [(0, '13.970'), (1, '19.960')] [2023-10-08 00:57:06,389][52059] Updated weights for policy 1, policy_version 26762 (0.0009) [2023-10-08 00:57:06,753][52059] Updated weights for policy 1, policy_version 26772 (0.0010) [2023-10-08 00:57:07,124][52059] Updated weights for policy 1, policy_version 26782 (0.0010) [2023-10-08 00:57:08,557][52060] Updated weights for policy 0, policy_version 26440 (0.0009) [2023-10-08 00:57:08,918][52060] Updated weights for policy 0, policy_version 26450 (0.0008) [2023-10-08 00:57:09,296][52060] Updated weights for policy 0, policy_version 26460 (0.0008) [2023-10-08 00:57:11,078][52059] Updated weights for policy 1, policy_version 26792 (0.0010) [2023-10-08 00:57:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 54525952. Throughput: 0: 1703.9, 1: 1757.8. Samples: 13642200. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:11,211][50642] Avg episode reward: [(0, '16.810'), (1, '19.890')] [2023-10-08 00:57:11,437][52059] Updated weights for policy 1, policy_version 26802 (0.0009) [2023-10-08 00:57:11,814][52059] Updated weights for policy 1, policy_version 26812 (0.0009) [2023-10-08 00:57:13,241][52060] Updated weights for policy 0, policy_version 26470 (0.0007) [2023-10-08 00:57:13,602][52060] Updated weights for policy 0, policy_version 26480 (0.0009) [2023-10-08 00:57:13,968][52060] Updated weights for policy 0, policy_version 26490 (0.0010) [2023-10-08 00:57:15,491][52059] Updated weights for policy 1, policy_version 26822 (0.0010) [2023-10-08 00:57:15,855][52059] Updated weights for policy 1, policy_version 26832 (0.0007) [2023-10-08 00:57:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 54591488. Throughput: 0: 1717.3, 1: 1742.4. Samples: 13663100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:57:16,211][50642] Avg episode reward: [(0, '14.630'), (1, '18.410')] [2023-10-08 00:57:16,221][52059] Updated weights for policy 1, policy_version 26842 (0.0008) [2023-10-08 00:57:17,948][52060] Updated weights for policy 0, policy_version 26500 (0.0010) [2023-10-08 00:57:18,313][52060] Updated weights for policy 0, policy_version 26510 (0.0008) [2023-10-08 00:57:18,687][52060] Updated weights for policy 0, policy_version 26520 (0.0008) [2023-10-08 00:57:20,131][52059] Updated weights for policy 1, policy_version 26852 (0.0008) [2023-10-08 00:57:20,494][52059] Updated weights for policy 1, policy_version 26862 (0.0008) [2023-10-08 00:57:20,866][52059] Updated weights for policy 1, policy_version 26872 (0.0007) [2023-10-08 00:57:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 54689792. Throughput: 0: 1703.5, 1: 1755.4. Samples: 13673322. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:57:21,211][50642] Avg episode reward: [(0, '14.030'), (1, '19.820')] [2023-10-08 00:57:22,617][52060] Updated weights for policy 0, policy_version 26530 (0.0007) [2023-10-08 00:57:22,986][52060] Updated weights for policy 0, policy_version 26540 (0.0007) [2023-10-08 00:57:23,355][52060] Updated weights for policy 0, policy_version 26550 (0.0007) [2023-10-08 00:57:23,726][52060] Updated weights for policy 0, policy_version 26560 (0.0009) [2023-10-08 00:57:24,786][52059] Updated weights for policy 1, policy_version 26882 (0.0008) [2023-10-08 00:57:25,150][52059] Updated weights for policy 1, policy_version 26892 (0.0008) [2023-10-08 00:57:25,524][52059] Updated weights for policy 1, policy_version 26902 (0.0007) [2023-10-08 00:57:25,892][52059] Updated weights for policy 1, policy_version 26912 (0.0009) [2023-10-08 00:57:26,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 54755328. Throughput: 0: 1704.9, 1: 1763.0. Samples: 13694596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:57:26,211][50642] Avg episode reward: [(0, '17.170'), (1, '19.900')] [2023-10-08 00:57:27,834][52060] Updated weights for policy 0, policy_version 26570 (0.0008) [2023-10-08 00:57:28,204][52060] Updated weights for policy 0, policy_version 26580 (0.0008) [2023-10-08 00:57:28,570][52060] Updated weights for policy 0, policy_version 26590 (0.0007) [2023-10-08 00:57:29,645][52059] Updated weights for policy 1, policy_version 26922 (0.0007) [2023-10-08 00:57:30,012][52059] Updated weights for policy 1, policy_version 26932 (0.0009) [2023-10-08 00:57:30,369][52059] Updated weights for policy 1, policy_version 26942 (0.0010) [2023-10-08 00:57:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 54820864. Throughput: 0: 1723.2, 1: 1738.5. Samples: 13714658. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:57:31,211][50642] Avg episode reward: [(0, '14.290'), (1, '18.550')] [2023-10-08 00:57:32,477][52060] Updated weights for policy 0, policy_version 26600 (0.0009) [2023-10-08 00:57:32,842][52060] Updated weights for policy 0, policy_version 26610 (0.0010) [2023-10-08 00:57:33,210][52060] Updated weights for policy 0, policy_version 26620 (0.0009) [2023-10-08 00:57:34,081][52059] Updated weights for policy 1, policy_version 26952 (0.0010) [2023-10-08 00:57:34,441][52059] Updated weights for policy 1, policy_version 26962 (0.0010) [2023-10-08 00:57:34,800][52059] Updated weights for policy 1, policy_version 26972 (0.0008) [2023-10-08 00:57:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 54886400. Throughput: 0: 1692.8, 1: 1769.7. Samples: 13725402. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:36,211][50642] Avg episode reward: [(0, '15.990'), (1, '19.900')] [2023-10-08 00:57:37,287][52060] Updated weights for policy 0, policy_version 26630 (0.0007) [2023-10-08 00:57:37,662][52060] Updated weights for policy 0, policy_version 26640 (0.0008) [2023-10-08 00:57:38,037][52060] Updated weights for policy 0, policy_version 26650 (0.0008) [2023-10-08 00:57:38,961][52059] Updated weights for policy 1, policy_version 26982 (0.0008) [2023-10-08 00:57:39,323][52059] Updated weights for policy 1, policy_version 26992 (0.0007) [2023-10-08 00:57:39,681][52059] Updated weights for policy 1, policy_version 27002 (0.0009) [2023-10-08 00:57:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 54951936. Throughput: 0: 1715.5, 1: 1739.0. Samples: 13745598. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:41,211][50642] Avg episode reward: [(0, '16.180'), (1, '19.860')] [2023-10-08 00:57:42,036][52060] Updated weights for policy 0, policy_version 26660 (0.0009) [2023-10-08 00:57:42,406][52060] Updated weights for policy 0, policy_version 26670 (0.0007) [2023-10-08 00:57:42,773][52060] Updated weights for policy 0, policy_version 26680 (0.0008) [2023-10-08 00:57:43,592][52059] Updated weights for policy 1, policy_version 27012 (0.0008) [2023-10-08 00:57:43,959][52059] Updated weights for policy 1, policy_version 27022 (0.0008) [2023-10-08 00:57:44,322][52059] Updated weights for policy 1, policy_version 27032 (0.0009) [2023-10-08 00:57:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 55017472. Throughput: 0: 1716.9, 1: 1733.6. Samples: 13766678. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:46,211][50642] Avg episode reward: [(0, '14.510'), (1, '18.920')] [2023-10-08 00:57:46,811][52060] Updated weights for policy 0, policy_version 26690 (0.0008) [2023-10-08 00:57:47,177][52060] Updated weights for policy 0, policy_version 26700 (0.0010) [2023-10-08 00:57:47,544][52060] Updated weights for policy 0, policy_version 26710 (0.0007) [2023-10-08 00:57:47,920][52060] Updated weights for policy 0, policy_version 26720 (0.0008) [2023-10-08 00:57:48,113][52059] Updated weights for policy 1, policy_version 27042 (0.0009) [2023-10-08 00:57:48,484][52059] Updated weights for policy 1, policy_version 27052 (0.0009) [2023-10-08 00:57:48,854][52059] Updated weights for policy 1, policy_version 27062 (0.0007) [2023-10-08 00:57:49,207][52059] Updated weights for policy 1, policy_version 27072 (0.0011) [2023-10-08 00:57:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13884.8). Total num frames: 55083008. Throughput: 0: 1699.2, 1: 1746.9. Samples: 13776560. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:57:51,211][50642] Avg episode reward: [(0, '17.070'), (1, '18.580')] [2023-10-08 00:57:51,945][52060] Updated weights for policy 0, policy_version 26730 (0.0008) [2023-10-08 00:57:52,320][52060] Updated weights for policy 0, policy_version 26740 (0.0007) [2023-10-08 00:57:52,699][52060] Updated weights for policy 0, policy_version 26750 (0.0008) [2023-10-08 00:57:53,205][52059] Updated weights for policy 1, policy_version 27082 (0.0007) [2023-10-08 00:57:53,573][52059] Updated weights for policy 1, policy_version 27092 (0.0008) [2023-10-08 00:57:53,933][52059] Updated weights for policy 1, policy_version 27102 (0.0011) [2023-10-08 00:57:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 55148544. Throughput: 0: 1712.8, 1: 1730.4. Samples: 13797146. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:57:56,211][50642] Avg episode reward: [(0, '15.150'), (1, '19.840')] [2023-10-08 00:57:56,662][52060] Updated weights for policy 0, policy_version 26760 (0.0008) [2023-10-08 00:57:57,038][52060] Updated weights for policy 0, policy_version 26770 (0.0007) [2023-10-08 00:57:57,415][52060] Updated weights for policy 0, policy_version 26780 (0.0007) [2023-10-08 00:57:57,841][52059] Updated weights for policy 1, policy_version 27112 (0.0009) [2023-10-08 00:57:58,207][52059] Updated weights for policy 1, policy_version 27122 (0.0010) [2023-10-08 00:57:58,571][52059] Updated weights for policy 1, policy_version 27132 (0.0009) [2023-10-08 00:58:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 55214080. Throughput: 0: 1714.1, 1: 1738.5. Samples: 13818466. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:58:01,211][50642] Avg episode reward: [(0, '14.730'), (1, '19.080')] [2023-10-08 00:58:01,361][52060] Updated weights for policy 0, policy_version 26790 (0.0009) [2023-10-08 00:58:01,730][52060] Updated weights for policy 0, policy_version 26800 (0.0009) [2023-10-08 00:58:02,097][52060] Updated weights for policy 0, policy_version 26810 (0.0009) [2023-10-08 00:58:02,424][52059] Updated weights for policy 1, policy_version 27142 (0.0008) [2023-10-08 00:58:02,793][52059] Updated weights for policy 1, policy_version 27152 (0.0009) [2023-10-08 00:58:03,157][52059] Updated weights for policy 1, policy_version 27162 (0.0007) [2023-10-08 00:58:06,010][52060] Updated weights for policy 0, policy_version 26820 (0.0008) [2023-10-08 00:58:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 55279616. Throughput: 0: 1708.5, 1: 1725.7. Samples: 13827864. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:58:06,211][50642] Avg episode reward: [(0, '16.920'), (1, '18.360')] [2023-10-08 00:58:06,378][52060] Updated weights for policy 0, policy_version 26830 (0.0008) [2023-10-08 00:58:06,756][52060] Updated weights for policy 0, policy_version 26840 (0.0007) [2023-10-08 00:58:07,116][52059] Updated weights for policy 1, policy_version 27172 (0.0008) [2023-10-08 00:58:07,479][52059] Updated weights for policy 1, policy_version 27182 (0.0008) [2023-10-08 00:58:07,851][52059] Updated weights for policy 1, policy_version 27192 (0.0009) [2023-10-08 00:58:10,668][52060] Updated weights for policy 0, policy_version 26850 (0.0008) [2023-10-08 00:58:11,034][52060] Updated weights for policy 0, policy_version 26860 (0.0009) [2023-10-08 00:58:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 55345152. Throughput: 0: 1713.3, 1: 1726.2. Samples: 13849374. Policy #0 lag: (min: 22.0, avg: 22.0, max: 22.0) [2023-10-08 00:58:11,211][50642] Avg episode reward: [(0, '17.390'), (1, '18.280')] [2023-10-08 00:58:11,395][52060] Updated weights for policy 0, policy_version 26870 (0.0008) [2023-10-08 00:58:11,765][52060] Updated weights for policy 0, policy_version 26880 (0.0009) [2023-10-08 00:58:11,779][52059] Updated weights for policy 1, policy_version 27202 (0.0009) [2023-10-08 00:58:12,139][52059] Updated weights for policy 1, policy_version 27212 (0.0008) [2023-10-08 00:58:12,501][52059] Updated weights for policy 1, policy_version 27222 (0.0008) [2023-10-08 00:58:12,865][52059] Updated weights for policy 1, policy_version 27232 (0.0008) [2023-10-08 00:58:15,894][52060] Updated weights for policy 0, policy_version 26890 (0.0011) [2023-10-08 00:58:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 55410688. Throughput: 0: 1709.9, 1: 1754.6. Samples: 13870558. Policy #0 lag: (min: 31.0, avg: 35.1, max: 63.0) [2023-10-08 00:58:16,211][50642] Avg episode reward: [(0, '15.250'), (1, '18.950')] [2023-10-08 00:58:16,263][52060] Updated weights for policy 0, policy_version 26900 (0.0011) [2023-10-08 00:58:16,627][52060] Updated weights for policy 0, policy_version 26910 (0.0009) [2023-10-08 00:58:16,762][52059] Updated weights for policy 1, policy_version 27242 (0.0008) [2023-10-08 00:58:17,121][52059] Updated weights for policy 1, policy_version 27252 (0.0008) [2023-10-08 00:58:17,502][52059] Updated weights for policy 1, policy_version 27262 (0.0010) [2023-10-08 00:58:20,575][52060] Updated weights for policy 0, policy_version 26920 (0.0009) [2023-10-08 00:58:20,942][52060] Updated weights for policy 0, policy_version 26930 (0.0008) [2023-10-08 00:58:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 55476224. Throughput: 0: 1719.0, 1: 1719.8. Samples: 13880148. Policy #0 lag: (min: 31.0, avg: 35.1, max: 63.0) [2023-10-08 00:58:21,211][50642] Avg episode reward: [(0, '16.800'), (1, '18.960')] [2023-10-08 00:58:21,312][52060] Updated weights for policy 0, policy_version 26940 (0.0007) [2023-10-08 00:58:21,388][52059] Updated weights for policy 1, policy_version 27272 (0.0009) [2023-10-08 00:58:21,756][52059] Updated weights for policy 1, policy_version 27282 (0.0008) [2023-10-08 00:58:22,125][52059] Updated weights for policy 1, policy_version 27292 (0.0010) [2023-10-08 00:58:25,409][52060] Updated weights for policy 0, policy_version 26950 (0.0008) [2023-10-08 00:58:25,771][52060] Updated weights for policy 0, policy_version 26960 (0.0008) [2023-10-08 00:58:26,138][52060] Updated weights for policy 0, policy_version 26970 (0.0009) [2023-10-08 00:58:26,180][52059] Updated weights for policy 1, policy_version 27302 (0.0008) [2023-10-08 00:58:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 55541760. Throughput: 0: 1717.1, 1: 1743.6. Samples: 13901330. Policy #0 lag: (min: 31.0, avg: 35.1, max: 63.0) [2023-10-08 00:58:26,211][50642] Avg episode reward: [(0, '16.890'), (1, '19.630')] [2023-10-08 00:58:26,543][52059] Updated weights for policy 1, policy_version 27312 (0.0007) [2023-10-08 00:58:26,904][52059] Updated weights for policy 1, policy_version 27322 (0.0009) [2023-10-08 00:58:29,858][52060] Updated weights for policy 0, policy_version 26980 (0.0007) [2023-10-08 00:58:30,234][52060] Updated weights for policy 0, policy_version 26990 (0.0008) [2023-10-08 00:58:30,607][52060] Updated weights for policy 0, policy_version 27000 (0.0008) [2023-10-08 00:58:30,889][52059] Updated weights for policy 1, policy_version 27332 (0.0008) [2023-10-08 00:58:31,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 55640064. Throughput: 0: 1693.2, 1: 1746.2. Samples: 13921448. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:58:31,211][50642] Avg episode reward: [(0, '17.340'), (1, '19.100')] [2023-10-08 00:58:31,247][52059] Updated weights for policy 1, policy_version 27342 (0.0007) [2023-10-08 00:58:31,618][52059] Updated weights for policy 1, policy_version 27352 (0.0007) [2023-10-08 00:58:34,555][52060] Updated weights for policy 0, policy_version 27010 (0.0008) [2023-10-08 00:58:34,918][52060] Updated weights for policy 0, policy_version 27020 (0.0010) [2023-10-08 00:58:35,292][52060] Updated weights for policy 0, policy_version 27030 (0.0009) [2023-10-08 00:58:35,378][52059] Updated weights for policy 1, policy_version 27362 (0.0009) [2023-10-08 00:58:35,655][52060] Updated weights for policy 0, policy_version 27040 (0.0008) [2023-10-08 00:58:35,748][52059] Updated weights for policy 1, policy_version 27372 (0.0008) [2023-10-08 00:58:36,107][52059] Updated weights for policy 1, policy_version 27382 (0.0009) [2023-10-08 00:58:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 55705600. Throughput: 0: 1721.4, 1: 1735.6. Samples: 13932126. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:58:36,211][50642] Avg episode reward: [(0, '16.270'), (1, '18.380')] [2023-10-08 00:58:36,470][52059] Updated weights for policy 1, policy_version 27392 (0.0007) [2023-10-08 00:58:39,598][52060] Updated weights for policy 0, policy_version 27050 (0.0007) [2023-10-08 00:58:39,953][52060] Updated weights for policy 0, policy_version 27060 (0.0009) [2023-10-08 00:58:40,322][52060] Updated weights for policy 0, policy_version 27070 (0.0007) [2023-10-08 00:58:40,561][52059] Updated weights for policy 1, policy_version 27402 (0.0010) [2023-10-08 00:58:40,924][52059] Updated weights for policy 1, policy_version 27412 (0.0009) [2023-10-08 00:58:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 55771136. Throughput: 0: 1706.9, 1: 1753.4. Samples: 13952862. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:58:41,211][50642] Avg episode reward: [(0, '17.770'), (1, '18.620')] [2023-10-08 00:58:41,294][52059] Updated weights for policy 1, policy_version 27422 (0.0010) [2023-10-08 00:58:44,327][52060] Updated weights for policy 0, policy_version 27080 (0.0008) [2023-10-08 00:58:44,699][52060] Updated weights for policy 0, policy_version 27090 (0.0009) [2023-10-08 00:58:45,069][52060] Updated weights for policy 0, policy_version 27100 (0.0007) [2023-10-08 00:58:45,227][52059] Updated weights for policy 1, policy_version 27432 (0.0007) [2023-10-08 00:58:45,589][52059] Updated weights for policy 1, policy_version 27442 (0.0007) [2023-10-08 00:58:45,957][52059] Updated weights for policy 1, policy_version 27452 (0.0007) [2023-10-08 00:58:46,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 55869440. Throughput: 0: 1690.6, 1: 1731.3. Samples: 13972452. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 00:58:46,211][50642] Avg episode reward: [(0, '15.690'), (1, '19.570')] [2023-10-08 00:58:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000027104_27754496.pth... [2023-10-08 00:58:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000027456_28114944.pth... [2023-10-08 00:58:46,256][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000025824_26443776.pth [2023-10-08 00:58:46,262][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000025504_26116096.pth [2023-10-08 00:58:49,051][52060] Updated weights for policy 0, policy_version 27110 (0.0008) [2023-10-08 00:58:49,429][52060] Updated weights for policy 0, policy_version 27120 (0.0009) [2023-10-08 00:58:49,775][52059] Updated weights for policy 1, policy_version 27462 (0.0010) [2023-10-08 00:58:49,782][52060] Updated weights for policy 0, policy_version 27130 (0.0007) [2023-10-08 00:58:50,147][52059] Updated weights for policy 1, policy_version 27472 (0.0009) [2023-10-08 00:58:50,507][52059] Updated weights for policy 1, policy_version 27482 (0.0007) [2023-10-08 00:58:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 55934976. Throughput: 0: 1718.4, 1: 1752.8. Samples: 13984072. Policy #0 lag: (min: 18.0, avg: 18.0, max: 22.0) [2023-10-08 00:58:51,211][50642] Avg episode reward: [(0, '16.950'), (1, '19.940')] [2023-10-08 00:58:53,847][52060] Updated weights for policy 0, policy_version 27140 (0.0008) [2023-10-08 00:58:54,215][52060] Updated weights for policy 0, policy_version 27150 (0.0008) [2023-10-08 00:58:54,445][52059] Updated weights for policy 1, policy_version 27492 (0.0007) [2023-10-08 00:58:54,584][52060] Updated weights for policy 0, policy_version 27160 (0.0010) [2023-10-08 00:58:54,799][52059] Updated weights for policy 1, policy_version 27502 (0.0007) [2023-10-08 00:58:55,175][52059] Updated weights for policy 1, policy_version 27512 (0.0007) [2023-10-08 00:58:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 56000512. Throughput: 0: 1687.4, 1: 1737.8. Samples: 14003506. Policy #0 lag: (min: 18.0, avg: 18.0, max: 22.0) [2023-10-08 00:58:56,211][50642] Avg episode reward: [(0, '16.680'), (1, '18.440')] [2023-10-08 00:58:58,679][52060] Updated weights for policy 0, policy_version 27170 (0.0008) [2023-10-08 00:58:58,943][52059] Updated weights for policy 1, policy_version 27522 (0.0008) [2023-10-08 00:58:59,039][52060] Updated weights for policy 0, policy_version 27180 (0.0008) [2023-10-08 00:58:59,317][52059] Updated weights for policy 1, policy_version 27532 (0.0009) [2023-10-08 00:58:59,406][52060] Updated weights for policy 0, policy_version 27190 (0.0008) [2023-10-08 00:58:59,677][52059] Updated weights for policy 1, policy_version 27542 (0.0008) [2023-10-08 00:58:59,780][52060] Updated weights for policy 0, policy_version 27200 (0.0007) [2023-10-08 00:59:00,036][52059] Updated weights for policy 1, policy_version 27552 (0.0009) [2023-10-08 00:59:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 56066048. Throughput: 0: 1691.1, 1: 1718.6. Samples: 14023994. Policy #0 lag: (min: 18.0, avg: 18.0, max: 22.0) [2023-10-08 00:59:01,211][50642] Avg episode reward: [(0, '15.960'), (1, '19.240')] [2023-10-08 00:59:03,959][52060] Updated weights for policy 0, policy_version 27210 (0.0008) [2023-10-08 00:59:04,070][52059] Updated weights for policy 1, policy_version 27562 (0.0008) [2023-10-08 00:59:04,328][52060] Updated weights for policy 0, policy_version 27220 (0.0009) [2023-10-08 00:59:04,433][52059] Updated weights for policy 1, policy_version 27572 (0.0007) [2023-10-08 00:59:04,693][52060] Updated weights for policy 0, policy_version 27230 (0.0009) [2023-10-08 00:59:04,786][52059] Updated weights for policy 1, policy_version 27582 (0.0010) [2023-10-08 00:59:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 56131584. Throughput: 0: 1705.0, 1: 1744.9. Samples: 14035394. Policy #0 lag: (min: 18.0, avg: 18.0, max: 22.0) [2023-10-08 00:59:06,211][50642] Avg episode reward: [(0, '17.520'), (1, '20.000')] [2023-10-08 00:59:08,670][52059] Updated weights for policy 1, policy_version 27592 (0.0008) [2023-10-08 00:59:08,692][52060] Updated weights for policy 0, policy_version 27240 (0.0007) [2023-10-08 00:59:09,031][52059] Updated weights for policy 1, policy_version 27602 (0.0008) [2023-10-08 00:59:09,066][52060] Updated weights for policy 0, policy_version 27250 (0.0008) [2023-10-08 00:59:09,395][52059] Updated weights for policy 1, policy_version 27612 (0.0008) [2023-10-08 00:59:09,430][52060] Updated weights for policy 0, policy_version 27260 (0.0010) [2023-10-08 00:59:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 56197120. Throughput: 0: 1686.2, 1: 1722.4. Samples: 14054718. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:59:11,211][50642] Avg episode reward: [(0, '16.920'), (1, '17.860')] [2023-10-08 00:59:13,497][52060] Updated weights for policy 0, policy_version 27270 (0.0008) [2023-10-08 00:59:13,509][52059] Updated weights for policy 1, policy_version 27622 (0.0008) [2023-10-08 00:59:13,859][52060] Updated weights for policy 0, policy_version 27280 (0.0008) [2023-10-08 00:59:13,870][52059] Updated weights for policy 1, policy_version 27632 (0.0008) [2023-10-08 00:59:14,232][52060] Updated weights for policy 0, policy_version 27290 (0.0008) [2023-10-08 00:59:14,234][52059] Updated weights for policy 1, policy_version 27642 (0.0007) [2023-10-08 00:59:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 56262656. Throughput: 0: 1707.2, 1: 1720.3. Samples: 14075682. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:59:16,211][50642] Avg episode reward: [(0, '16.600'), (1, '19.490')] [2023-10-08 00:59:18,182][52060] Updated weights for policy 0, policy_version 27300 (0.0011) [2023-10-08 00:59:18,205][52059] Updated weights for policy 1, policy_version 27652 (0.0007) [2023-10-08 00:59:18,546][52060] Updated weights for policy 0, policy_version 27310 (0.0007) [2023-10-08 00:59:18,574][52059] Updated weights for policy 1, policy_version 27662 (0.0010) [2023-10-08 00:59:18,906][52060] Updated weights for policy 0, policy_version 27320 (0.0008) [2023-10-08 00:59:18,940][52059] Updated weights for policy 1, policy_version 27672 (0.0009) [2023-10-08 00:59:21,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 56328192. Throughput: 0: 1693.5, 1: 1725.5. Samples: 14085982. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:59:21,211][50642] Avg episode reward: [(0, '17.460'), (1, '20.900')] [2023-10-08 00:59:22,969][52059] Updated weights for policy 1, policy_version 27682 (0.0007) [2023-10-08 00:59:23,017][52060] Updated weights for policy 0, policy_version 27330 (0.0009) [2023-10-08 00:59:23,332][52059] Updated weights for policy 1, policy_version 27692 (0.0007) [2023-10-08 00:59:23,388][52060] Updated weights for policy 0, policy_version 27340 (0.0008) [2023-10-08 00:59:23,702][52059] Updated weights for policy 1, policy_version 27702 (0.0007) [2023-10-08 00:59:23,753][52060] Updated weights for policy 0, policy_version 27350 (0.0009) [2023-10-08 00:59:24,063][52059] Updated weights for policy 1, policy_version 27712 (0.0008) [2023-10-08 00:59:24,128][52060] Updated weights for policy 0, policy_version 27360 (0.0008) [2023-10-08 00:59:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 56393728. Throughput: 0: 1693.4, 1: 1711.0. Samples: 14106060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 00:59:26,211][50642] Avg episode reward: [(0, '16.540'), (1, '20.100')] [2023-10-08 00:59:28,043][52060] Updated weights for policy 0, policy_version 27370 (0.0007) [2023-10-08 00:59:28,091][52059] Updated weights for policy 1, policy_version 27722 (0.0009) [2023-10-08 00:59:28,409][52060] Updated weights for policy 0, policy_version 27380 (0.0009) [2023-10-08 00:59:28,458][52059] Updated weights for policy 1, policy_version 27732 (0.0007) [2023-10-08 00:59:28,781][52060] Updated weights for policy 0, policy_version 27390 (0.0008) [2023-10-08 00:59:28,818][52059] Updated weights for policy 1, policy_version 27742 (0.0008) [2023-10-08 00:59:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 56459264. Throughput: 0: 1711.8, 1: 1729.0. Samples: 14127290. Policy #0 lag: (min: 1.0, avg: 14.5, max: 33.0) [2023-10-08 00:59:31,211][50642] Avg episode reward: [(0, '16.470'), (1, '18.700')] [2023-10-08 00:59:32,618][52059] Updated weights for policy 1, policy_version 27752 (0.0007) [2023-10-08 00:59:32,715][52060] Updated weights for policy 0, policy_version 27400 (0.0007) [2023-10-08 00:59:32,985][52059] Updated weights for policy 1, policy_version 27762 (0.0008) [2023-10-08 00:59:33,085][52060] Updated weights for policy 0, policy_version 27410 (0.0008) [2023-10-08 00:59:33,358][52059] Updated weights for policy 1, policy_version 27772 (0.0008) [2023-10-08 00:59:33,452][52060] Updated weights for policy 0, policy_version 27420 (0.0008) [2023-10-08 00:59:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 56524800. Throughput: 0: 1683.6, 1: 1706.3. Samples: 14136618. Policy #0 lag: (min: 1.0, avg: 14.5, max: 33.0) [2023-10-08 00:59:36,211][50642] Avg episode reward: [(0, '18.140'), (1, '20.280')] [2023-10-08 00:59:37,110][52059] Updated weights for policy 1, policy_version 27782 (0.0009) [2023-10-08 00:59:37,436][52060] Updated weights for policy 0, policy_version 27430 (0.0007) [2023-10-08 00:59:37,468][52059] Updated weights for policy 1, policy_version 27792 (0.0009) [2023-10-08 00:59:37,795][52060] Updated weights for policy 0, policy_version 27440 (0.0008) [2023-10-08 00:59:37,834][52059] Updated weights for policy 1, policy_version 27802 (0.0010) [2023-10-08 00:59:38,163][52060] Updated weights for policy 0, policy_version 27450 (0.0009) [2023-10-08 00:59:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 56590336. Throughput: 0: 1711.9, 1: 1718.8. Samples: 14157886. Policy #0 lag: (min: 1.0, avg: 14.5, max: 33.0) [2023-10-08 00:59:41,211][50642] Avg episode reward: [(0, '16.570'), (1, '19.720')] [2023-10-08 00:59:41,994][52059] Updated weights for policy 1, policy_version 27812 (0.0009) [2023-10-08 00:59:42,245][52060] Updated weights for policy 0, policy_version 27460 (0.0010) [2023-10-08 00:59:42,359][52059] Updated weights for policy 1, policy_version 27822 (0.0009) [2023-10-08 00:59:42,616][52060] Updated weights for policy 0, policy_version 27470 (0.0007) [2023-10-08 00:59:42,727][52059] Updated weights for policy 1, policy_version 27832 (0.0008) [2023-10-08 00:59:42,987][52060] Updated weights for policy 0, policy_version 27480 (0.0007) [2023-10-08 00:59:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 56655872. Throughput: 0: 1714.5, 1: 1727.8. Samples: 14178896. Policy #0 lag: (min: 1.0, avg: 14.5, max: 33.0) [2023-10-08 00:59:46,211][50642] Avg episode reward: [(0, '16.290'), (1, '18.110')] [2023-10-08 00:59:46,770][52059] Updated weights for policy 1, policy_version 27842 (0.0008) [2023-10-08 00:59:46,953][52060] Updated weights for policy 0, policy_version 27490 (0.0008) [2023-10-08 00:59:47,132][52059] Updated weights for policy 1, policy_version 27852 (0.0008) [2023-10-08 00:59:47,324][52060] Updated weights for policy 0, policy_version 27500 (0.0009) [2023-10-08 00:59:47,490][52059] Updated weights for policy 1, policy_version 27862 (0.0007) [2023-10-08 00:59:47,685][52060] Updated weights for policy 0, policy_version 27510 (0.0009) [2023-10-08 00:59:47,858][52059] Updated weights for policy 1, policy_version 27872 (0.0008) [2023-10-08 00:59:48,054][52060] Updated weights for policy 0, policy_version 27520 (0.0008) [2023-10-08 00:59:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 56721408. Throughput: 0: 1691.9, 1: 1700.9. Samples: 14188070. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) [2023-10-08 00:59:51,212][50642] Avg episode reward: [(0, '18.000'), (1, '18.050')] [2023-10-08 00:59:51,879][52059] Updated weights for policy 1, policy_version 27882 (0.0008) [2023-10-08 00:59:52,243][52059] Updated weights for policy 1, policy_version 27892 (0.0009) [2023-10-08 00:59:52,254][52060] Updated weights for policy 0, policy_version 27530 (0.0009) [2023-10-08 00:59:52,610][52059] Updated weights for policy 1, policy_version 27902 (0.0008) [2023-10-08 00:59:52,623][52060] Updated weights for policy 0, policy_version 27540 (0.0008) [2023-10-08 00:59:52,989][52060] Updated weights for policy 0, policy_version 27550 (0.0009) [2023-10-08 00:59:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 56786944. Throughput: 0: 1709.7, 1: 1721.0. Samples: 14209100. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) [2023-10-08 00:59:56,211][50642] Avg episode reward: [(0, '16.940'), (1, '19.910')] [2023-10-08 00:59:56,676][52059] Updated weights for policy 1, policy_version 27912 (0.0008) [2023-10-08 00:59:56,871][52060] Updated weights for policy 0, policy_version 27560 (0.0007) [2023-10-08 00:59:57,046][52059] Updated weights for policy 1, policy_version 27922 (0.0008) [2023-10-08 00:59:57,242][52060] Updated weights for policy 0, policy_version 27570 (0.0008) [2023-10-08 00:59:57,400][52059] Updated weights for policy 1, policy_version 27932 (0.0008) [2023-10-08 00:59:57,603][52060] Updated weights for policy 0, policy_version 27580 (0.0010) [2023-10-08 01:00:01,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 56852480. Throughput: 0: 1713.6, 1: 1726.0. Samples: 14230466. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) [2023-10-08 01:00:01,211][50642] Avg episode reward: [(0, '17.500'), (1, '15.780')] [2023-10-08 01:00:01,443][52059] Updated weights for policy 1, policy_version 27942 (0.0008) [2023-10-08 01:00:01,584][52060] Updated weights for policy 0, policy_version 27590 (0.0009) [2023-10-08 01:00:01,801][52059] Updated weights for policy 1, policy_version 27952 (0.0008) [2023-10-08 01:00:01,954][52060] Updated weights for policy 0, policy_version 27600 (0.0009) [2023-10-08 01:00:02,165][52059] Updated weights for policy 1, policy_version 27962 (0.0010) [2023-10-08 01:00:02,332][52060] Updated weights for policy 0, policy_version 27610 (0.0007) [2023-10-08 01:00:06,142][52059] Updated weights for policy 1, policy_version 27972 (0.0007) [2023-10-08 01:00:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 56918016. Throughput: 0: 1698.1, 1: 1717.3. Samples: 14239676. Policy #0 lag: (min: 31.0, avg: 38.0, max: 63.0) [2023-10-08 01:00:06,211][50642] Avg episode reward: [(0, '19.810'), (1, '18.830')] [2023-10-08 01:00:06,230][52060] Updated weights for policy 0, policy_version 27620 (0.0008) [2023-10-08 01:00:06,494][52059] Updated weights for policy 1, policy_version 27982 (0.0007) [2023-10-08 01:00:06,596][52060] Updated weights for policy 0, policy_version 27630 (0.0007) [2023-10-08 01:00:06,857][52059] Updated weights for policy 1, policy_version 27992 (0.0008) [2023-10-08 01:00:06,956][52060] Updated weights for policy 0, policy_version 27640 (0.0009) [2023-10-08 01:00:07,249][51605] Saving new best policy, reward=19.810! [2023-10-08 01:00:10,666][52059] Updated weights for policy 1, policy_version 28002 (0.0008) [2023-10-08 01:00:10,951][52060] Updated weights for policy 0, policy_version 27650 (0.0007) [2023-10-08 01:00:11,031][52059] Updated weights for policy 1, policy_version 28012 (0.0009) [2023-10-08 01:00:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 56983552. Throughput: 0: 1716.5, 1: 1728.2. Samples: 14261070. Policy #0 lag: (min: 18.0, avg: 18.2, max: 28.0) [2023-10-08 01:00:11,211][50642] Avg episode reward: [(0, '17.500'), (1, '19.490')] [2023-10-08 01:00:11,306][52060] Updated weights for policy 0, policy_version 27660 (0.0007) [2023-10-08 01:00:11,395][52059] Updated weights for policy 1, policy_version 28022 (0.0008) [2023-10-08 01:00:11,673][52060] Updated weights for policy 0, policy_version 27670 (0.0007) [2023-10-08 01:00:11,752][52059] Updated weights for policy 1, policy_version 28032 (0.0009) [2023-10-08 01:00:12,045][52060] Updated weights for policy 0, policy_version 27680 (0.0009) [2023-10-08 01:00:15,708][52059] Updated weights for policy 1, policy_version 28042 (0.0008) [2023-10-08 01:00:15,974][52060] Updated weights for policy 0, policy_version 27690 (0.0008) [2023-10-08 01:00:16,069][52059] Updated weights for policy 1, policy_version 28052 (0.0007) [2023-10-08 01:00:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 57049088. Throughput: 0: 1710.5, 1: 1719.9. Samples: 14281656. Policy #0 lag: (min: 18.0, avg: 18.2, max: 28.0) [2023-10-08 01:00:16,211][50642] Avg episode reward: [(0, '17.730'), (1, '17.500')] [2023-10-08 01:00:16,343][52060] Updated weights for policy 0, policy_version 27700 (0.0007) [2023-10-08 01:00:16,440][52059] Updated weights for policy 1, policy_version 28062 (0.0009) [2023-10-08 01:00:16,718][52060] Updated weights for policy 0, policy_version 27710 (0.0007) [2023-10-08 01:00:20,234][52059] Updated weights for policy 1, policy_version 28072 (0.0009) [2023-10-08 01:00:20,579][52060] Updated weights for policy 0, policy_version 27720 (0.0008) [2023-10-08 01:00:20,587][52059] Updated weights for policy 1, policy_version 28082 (0.0008) [2023-10-08 01:00:20,946][52060] Updated weights for policy 0, policy_version 27730 (0.0007) [2023-10-08 01:00:20,949][52059] Updated weights for policy 1, policy_version 28092 (0.0009) [2023-10-08 01:00:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 57147392. Throughput: 0: 1718.6, 1: 1732.4. Samples: 14291914. Policy #0 lag: (min: 18.0, avg: 18.2, max: 28.0) [2023-10-08 01:00:21,211][50642] Avg episode reward: [(0, '19.830'), (1, '18.580')] [2023-10-08 01:00:21,311][52060] Updated weights for policy 0, policy_version 27740 (0.0007) [2023-10-08 01:00:21,459][51605] Saving new best policy, reward=19.830! [2023-10-08 01:00:24,906][52059] Updated weights for policy 1, policy_version 28102 (0.0008) [2023-10-08 01:00:25,258][52060] Updated weights for policy 0, policy_version 27750 (0.0008) [2023-10-08 01:00:25,266][52059] Updated weights for policy 1, policy_version 28112 (0.0009) [2023-10-08 01:00:25,617][52060] Updated weights for policy 0, policy_version 27760 (0.0007) [2023-10-08 01:00:25,633][52059] Updated weights for policy 1, policy_version 28122 (0.0008) [2023-10-08 01:00:25,980][52060] Updated weights for policy 0, policy_version 27770 (0.0007) [2023-10-08 01:00:26,210][50642] Fps is (10 sec: 19660.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 57245696. Throughput: 0: 1721.9, 1: 1729.2. Samples: 14313188. Policy #0 lag: (min: 15.0, avg: 17.1, max: 46.0) [2023-10-08 01:00:26,211][50642] Avg episode reward: [(0, '17.940'), (1, '19.370')] [2023-10-08 01:00:29,620][52059] Updated weights for policy 1, policy_version 28132 (0.0010) [2023-10-08 01:00:29,979][52060] Updated weights for policy 0, policy_version 27780 (0.0007) [2023-10-08 01:00:29,984][52059] Updated weights for policy 1, policy_version 28142 (0.0009) [2023-10-08 01:00:30,344][52059] Updated weights for policy 1, policy_version 28152 (0.0007) [2023-10-08 01:00:30,350][52060] Updated weights for policy 0, policy_version 27790 (0.0007) [2023-10-08 01:00:30,716][52060] Updated weights for policy 0, policy_version 27800 (0.0008) [2023-10-08 01:00:31,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 57311232. Throughput: 0: 1701.6, 1: 1710.0. Samples: 14332416. Policy #0 lag: (min: 15.0, avg: 17.1, max: 46.0) [2023-10-08 01:00:31,211][50642] Avg episode reward: [(0, '17.980'), (1, '19.300')] [2023-10-08 01:00:34,168][52059] Updated weights for policy 1, policy_version 28162 (0.0008) [2023-10-08 01:00:34,534][52059] Updated weights for policy 1, policy_version 28172 (0.0007) [2023-10-08 01:00:34,741][52060] Updated weights for policy 0, policy_version 27810 (0.0008) [2023-10-08 01:00:34,890][52059] Updated weights for policy 1, policy_version 28182 (0.0008) [2023-10-08 01:00:35,109][52060] Updated weights for policy 0, policy_version 27820 (0.0009) [2023-10-08 01:00:35,258][52059] Updated weights for policy 1, policy_version 28192 (0.0008) [2023-10-08 01:00:35,477][52060] Updated weights for policy 0, policy_version 27830 (0.0010) [2023-10-08 01:00:35,836][52060] Updated weights for policy 0, policy_version 27840 (0.0009) [2023-10-08 01:00:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 57376768. Throughput: 0: 1726.9, 1: 1744.5. Samples: 14344286. Policy #0 lag: (min: 15.0, avg: 17.1, max: 46.0) [2023-10-08 01:00:36,211][50642] Avg episode reward: [(0, '19.090'), (1, '17.040')] [2023-10-08 01:00:39,238][52059] Updated weights for policy 1, policy_version 28202 (0.0009) [2023-10-08 01:00:39,600][52059] Updated weights for policy 1, policy_version 28212 (0.0008) [2023-10-08 01:00:39,962][52059] Updated weights for policy 1, policy_version 28222 (0.0007) [2023-10-08 01:00:40,025][52060] Updated weights for policy 0, policy_version 27850 (0.0010) [2023-10-08 01:00:40,391][52060] Updated weights for policy 0, policy_version 27860 (0.0008) [2023-10-08 01:00:40,758][52060] Updated weights for policy 0, policy_version 27870 (0.0008) [2023-10-08 01:00:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 57442304. Throughput: 0: 1719.5, 1: 1722.8. Samples: 14364006. Policy #0 lag: (min: 15.0, avg: 17.1, max: 46.0) [2023-10-08 01:00:41,211][50642] Avg episode reward: [(0, '16.320'), (1, '16.710')] [2023-10-08 01:00:43,897][52059] Updated weights for policy 1, policy_version 28232 (0.0010) [2023-10-08 01:00:44,263][52059] Updated weights for policy 1, policy_version 28242 (0.0009) [2023-10-08 01:00:44,629][52059] Updated weights for policy 1, policy_version 28252 (0.0007) [2023-10-08 01:00:44,720][52060] Updated weights for policy 0, policy_version 27880 (0.0009) [2023-10-08 01:00:45,079][52060] Updated weights for policy 0, policy_version 27890 (0.0010) [2023-10-08 01:00:45,451][52060] Updated weights for policy 0, policy_version 27900 (0.0011) [2023-10-08 01:00:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 57507840. Throughput: 0: 1688.3, 1: 1715.3. Samples: 14383628. Policy #0 lag: (min: 25.0, avg: 42.8, max: 57.0) [2023-10-08 01:00:46,211][50642] Avg episode reward: [(0, '17.330'), (1, '20.160')] [2023-10-08 01:00:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000027904_28573696.pth... [2023-10-08 01:00:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000028256_28934144.pth... [2023-10-08 01:00:46,251][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000026304_26935296.pth [2023-10-08 01:00:46,255][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000026624_27262976.pth [2023-10-08 01:00:48,534][52059] Updated weights for policy 1, policy_version 28262 (0.0007) [2023-10-08 01:00:48,900][52059] Updated weights for policy 1, policy_version 28272 (0.0008) [2023-10-08 01:00:49,266][52059] Updated weights for policy 1, policy_version 28282 (0.0011) [2023-10-08 01:00:49,449][52060] Updated weights for policy 0, policy_version 27910 (0.0009) [2023-10-08 01:00:49,815][52060] Updated weights for policy 0, policy_version 27920 (0.0007) [2023-10-08 01:00:50,187][52060] Updated weights for policy 0, policy_version 27930 (0.0009) [2023-10-08 01:00:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 57573376. Throughput: 0: 1718.3, 1: 1731.6. Samples: 14394922. Policy #0 lag: (min: 25.0, avg: 42.8, max: 57.0) [2023-10-08 01:00:51,211][50642] Avg episode reward: [(0, '20.080'), (1, '17.830')] [2023-10-08 01:00:51,212][51605] Saving new best policy, reward=20.080! [2023-10-08 01:00:53,241][52059] Updated weights for policy 1, policy_version 28292 (0.0007) [2023-10-08 01:00:53,613][52059] Updated weights for policy 1, policy_version 28302 (0.0007) [2023-10-08 01:00:53,976][52059] Updated weights for policy 1, policy_version 28312 (0.0009) [2023-10-08 01:00:54,168][52060] Updated weights for policy 0, policy_version 27940 (0.0009) [2023-10-08 01:00:54,548][52060] Updated weights for policy 0, policy_version 27950 (0.0008) [2023-10-08 01:00:54,918][52060] Updated weights for policy 0, policy_version 27960 (0.0009) [2023-10-08 01:00:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 57638912. Throughput: 0: 1698.1, 1: 1715.3. Samples: 14414674. Policy #0 lag: (min: 25.0, avg: 42.8, max: 57.0) [2023-10-08 01:00:56,211][50642] Avg episode reward: [(0, '17.150'), (1, '18.570')] [2023-10-08 01:00:57,876][52059] Updated weights for policy 1, policy_version 28322 (0.0009) [2023-10-08 01:00:58,244][52059] Updated weights for policy 1, policy_version 28332 (0.0009) [2023-10-08 01:00:58,606][52059] Updated weights for policy 1, policy_version 28342 (0.0008) [2023-10-08 01:00:58,973][52059] Updated weights for policy 1, policy_version 28352 (0.0008) [2023-10-08 01:00:58,974][52060] Updated weights for policy 0, policy_version 27970 (0.0008) [2023-10-08 01:00:59,346][52060] Updated weights for policy 0, policy_version 27980 (0.0007) [2023-10-08 01:00:59,720][52060] Updated weights for policy 0, policy_version 27990 (0.0007) [2023-10-08 01:01:00,090][52060] Updated weights for policy 0, policy_version 28000 (0.0008) [2023-10-08 01:01:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 57704448. Throughput: 0: 1689.6, 1: 1727.9. Samples: 14435446. Policy #0 lag: (min: 25.0, avg: 42.8, max: 57.0) [2023-10-08 01:01:01,211][50642] Avg episode reward: [(0, '17.810'), (1, '20.380')] [2023-10-08 01:01:03,014][52059] Updated weights for policy 1, policy_version 28362 (0.0009) [2023-10-08 01:01:03,379][52059] Updated weights for policy 1, policy_version 28372 (0.0008) [2023-10-08 01:01:03,749][52059] Updated weights for policy 1, policy_version 28382 (0.0010) [2023-10-08 01:01:04,248][52060] Updated weights for policy 0, policy_version 28010 (0.0008) [2023-10-08 01:01:04,627][52060] Updated weights for policy 0, policy_version 28020 (0.0007) [2023-10-08 01:01:04,985][52060] Updated weights for policy 0, policy_version 28030 (0.0009) [2023-10-08 01:01:06,211][50642] Fps is (10 sec: 13106.5, 60 sec: 14199.4, 300 sec: 13773.6). Total num frames: 57769984. Throughput: 0: 1707.5, 1: 1713.9. Samples: 14445874. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:01:06,212][50642] Avg episode reward: [(0, '19.820'), (1, '19.570')] [2023-10-08 01:01:07,460][52059] Updated weights for policy 1, policy_version 28392 (0.0009) [2023-10-08 01:01:07,824][52059] Updated weights for policy 1, policy_version 28402 (0.0007) [2023-10-08 01:01:08,192][52059] Updated weights for policy 1, policy_version 28412 (0.0009) [2023-10-08 01:01:08,890][52060] Updated weights for policy 0, policy_version 28040 (0.0010) [2023-10-08 01:01:09,257][52060] Updated weights for policy 0, policy_version 28050 (0.0009) [2023-10-08 01:01:09,620][52060] Updated weights for policy 0, policy_version 28060 (0.0010) [2023-10-08 01:01:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 57835520. Throughput: 0: 1670.9, 1: 1728.2. Samples: 14466150. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:01:11,211][50642] Avg episode reward: [(0, '16.650'), (1, '20.190')] [2023-10-08 01:01:12,096][52059] Updated weights for policy 1, policy_version 28422 (0.0007) [2023-10-08 01:01:12,462][52059] Updated weights for policy 1, policy_version 28432 (0.0007) [2023-10-08 01:01:12,827][52059] Updated weights for policy 1, policy_version 28442 (0.0009) [2023-10-08 01:01:13,788][52060] Updated weights for policy 0, policy_version 28070 (0.0008) [2023-10-08 01:01:14,153][52060] Updated weights for policy 0, policy_version 28080 (0.0011) [2023-10-08 01:01:14,524][52060] Updated weights for policy 0, policy_version 28090 (0.0010) [2023-10-08 01:01:16,210][50642] Fps is (10 sec: 13107.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 57901056. Throughput: 0: 1692.2, 1: 1752.8. Samples: 14487442. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:01:16,211][50642] Avg episode reward: [(0, '17.980'), (1, '18.410')] [2023-10-08 01:01:16,671][52059] Updated weights for policy 1, policy_version 28452 (0.0010) [2023-10-08 01:01:17,040][52059] Updated weights for policy 1, policy_version 28462 (0.0011) [2023-10-08 01:01:17,409][52059] Updated weights for policy 1, policy_version 28472 (0.0010) [2023-10-08 01:01:18,482][52060] Updated weights for policy 0, policy_version 28100 (0.0009) [2023-10-08 01:01:18,848][52060] Updated weights for policy 0, policy_version 28110 (0.0007) [2023-10-08 01:01:19,217][52060] Updated weights for policy 0, policy_version 28120 (0.0007) [2023-10-08 01:01:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 57966592. Throughput: 0: 1687.8, 1: 1721.1. Samples: 14497688. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:01:21,211][50642] Avg episode reward: [(0, '19.380'), (1, '19.090')] [2023-10-08 01:01:21,300][52059] Updated weights for policy 1, policy_version 28482 (0.0007) [2023-10-08 01:01:21,672][52059] Updated weights for policy 1, policy_version 28492 (0.0008) [2023-10-08 01:01:22,038][52059] Updated weights for policy 1, policy_version 28502 (0.0011) [2023-10-08 01:01:22,405][52059] Updated weights for policy 1, policy_version 28512 (0.0009) [2023-10-08 01:01:23,139][52060] Updated weights for policy 0, policy_version 28130 (0.0007) [2023-10-08 01:01:23,508][52060] Updated weights for policy 0, policy_version 28140 (0.0008) [2023-10-08 01:01:23,888][52060] Updated weights for policy 0, policy_version 28150 (0.0008) [2023-10-08 01:01:24,259][52060] Updated weights for policy 0, policy_version 28160 (0.0007) [2023-10-08 01:01:26,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13773.7). Total num frames: 58032128. Throughput: 0: 1683.2, 1: 1750.0. Samples: 14518504. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:01:26,211][50642] Avg episode reward: [(0, '17.570'), (1, '20.190')] [2023-10-08 01:01:26,349][52059] Updated weights for policy 1, policy_version 28522 (0.0008) [2023-10-08 01:01:26,707][52059] Updated weights for policy 1, policy_version 28532 (0.0009) [2023-10-08 01:01:27,081][52059] Updated weights for policy 1, policy_version 28542 (0.0008) [2023-10-08 01:01:28,104][52060] Updated weights for policy 0, policy_version 28170 (0.0008) [2023-10-08 01:01:28,475][52060] Updated weights for policy 0, policy_version 28180 (0.0009) [2023-10-08 01:01:28,846][52060] Updated weights for policy 0, policy_version 28190 (0.0010) [2023-10-08 01:01:30,978][52059] Updated weights for policy 1, policy_version 28552 (0.0008) [2023-10-08 01:01:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 58097664. Throughput: 0: 1712.8, 1: 1753.6. Samples: 14539612. Policy #0 lag: (min: 2.0, avg: 4.7, max: 34.0) [2023-10-08 01:01:31,211][50642] Avg episode reward: [(0, '18.590'), (1, '19.620')] [2023-10-08 01:01:31,339][52059] Updated weights for policy 1, policy_version 28562 (0.0008) [2023-10-08 01:01:31,711][52059] Updated weights for policy 1, policy_version 28572 (0.0008) [2023-10-08 01:01:32,757][52060] Updated weights for policy 0, policy_version 28200 (0.0008) [2023-10-08 01:01:33,121][52060] Updated weights for policy 0, policy_version 28210 (0.0007) [2023-10-08 01:01:33,482][52060] Updated weights for policy 0, policy_version 28220 (0.0009) [2023-10-08 01:01:35,442][52059] Updated weights for policy 1, policy_version 28582 (0.0009) [2023-10-08 01:01:35,806][52059] Updated weights for policy 1, policy_version 28592 (0.0009) [2023-10-08 01:01:36,161][52059] Updated weights for policy 1, policy_version 28602 (0.0007) [2023-10-08 01:01:36,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 58163200. Throughput: 0: 1686.1, 1: 1747.4. Samples: 14549430. Policy #0 lag: (min: 2.0, avg: 4.7, max: 34.0) [2023-10-08 01:01:36,211][50642] Avg episode reward: [(0, '19.880'), (1, '17.380')] [2023-10-08 01:01:37,302][52060] Updated weights for policy 0, policy_version 28230 (0.0009) [2023-10-08 01:01:37,670][52060] Updated weights for policy 0, policy_version 28240 (0.0010) [2023-10-08 01:01:38,035][52060] Updated weights for policy 0, policy_version 28250 (0.0009) [2023-10-08 01:01:39,967][52059] Updated weights for policy 1, policy_version 28612 (0.0009) [2023-10-08 01:01:40,331][52059] Updated weights for policy 1, policy_version 28622 (0.0009) [2023-10-08 01:01:40,704][52059] Updated weights for policy 1, policy_version 28632 (0.0008) [2023-10-08 01:01:41,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 58261504. Throughput: 0: 1706.7, 1: 1765.5. Samples: 14570922. Policy #0 lag: (min: 2.0, avg: 4.7, max: 34.0) [2023-10-08 01:01:41,211][50642] Avg episode reward: [(0, '18.050'), (1, '18.170')] [2023-10-08 01:01:42,034][52060] Updated weights for policy 0, policy_version 28260 (0.0009) [2023-10-08 01:01:42,405][52060] Updated weights for policy 0, policy_version 28270 (0.0007) [2023-10-08 01:01:42,777][52060] Updated weights for policy 0, policy_version 28280 (0.0008) [2023-10-08 01:01:44,592][52059] Updated weights for policy 1, policy_version 28642 (0.0009) [2023-10-08 01:01:44,958][52059] Updated weights for policy 1, policy_version 28652 (0.0009) [2023-10-08 01:01:45,326][52059] Updated weights for policy 1, policy_version 28662 (0.0011) [2023-10-08 01:01:45,692][52059] Updated weights for policy 1, policy_version 28672 (0.0011) [2023-10-08 01:01:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 58327040. Throughput: 0: 1719.2, 1: 1739.2. Samples: 14591074. Policy #0 lag: (min: 2.0, avg: 4.7, max: 34.0) [2023-10-08 01:01:46,211][50642] Avg episode reward: [(0, '19.910'), (1, '21.150')] [2023-10-08 01:01:46,821][52060] Updated weights for policy 0, policy_version 28290 (0.0009) [2023-10-08 01:01:47,186][52060] Updated weights for policy 0, policy_version 28300 (0.0009) [2023-10-08 01:01:47,556][52060] Updated weights for policy 0, policy_version 28310 (0.0009) [2023-10-08 01:01:47,925][52060] Updated weights for policy 0, policy_version 28320 (0.0008) [2023-10-08 01:01:49,825][52059] Updated weights for policy 1, policy_version 28682 (0.0009) [2023-10-08 01:01:50,197][52059] Updated weights for policy 1, policy_version 28692 (0.0010) [2023-10-08 01:01:50,557][52059] Updated weights for policy 1, policy_version 28702 (0.0008) [2023-10-08 01:01:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 58392576. Throughput: 0: 1694.1, 1: 1770.2. Samples: 14601766. Policy #0 lag: (min: 7.0, avg: 14.6, max: 39.0) [2023-10-08 01:01:51,211][50642] Avg episode reward: [(0, '19.640'), (1, '18.960')] [2023-10-08 01:01:51,969][52060] Updated weights for policy 0, policy_version 28330 (0.0007) [2023-10-08 01:01:52,338][52060] Updated weights for policy 0, policy_version 28340 (0.0008) [2023-10-08 01:01:52,706][52060] Updated weights for policy 0, policy_version 28350 (0.0008) [2023-10-08 01:01:54,252][52059] Updated weights for policy 1, policy_version 28712 (0.0007) [2023-10-08 01:01:54,616][52059] Updated weights for policy 1, policy_version 28722 (0.0007) [2023-10-08 01:01:54,987][52059] Updated weights for policy 1, policy_version 28732 (0.0008) [2023-10-08 01:01:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 58458112. Throughput: 0: 1730.9, 1: 1745.8. Samples: 14622602. Policy #0 lag: (min: 7.0, avg: 14.6, max: 39.0) [2023-10-08 01:01:56,211][50642] Avg episode reward: [(0, '17.770'), (1, '16.460')] [2023-10-08 01:01:56,573][52060] Updated weights for policy 0, policy_version 28360 (0.0008) [2023-10-08 01:01:56,940][52060] Updated weights for policy 0, policy_version 28370 (0.0008) [2023-10-08 01:01:57,315][52060] Updated weights for policy 0, policy_version 28380 (0.0007) [2023-10-08 01:01:59,001][52059] Updated weights for policy 1, policy_version 28742 (0.0008) [2023-10-08 01:01:59,367][52059] Updated weights for policy 1, policy_version 28752 (0.0008) [2023-10-08 01:01:59,729][52059] Updated weights for policy 1, policy_version 28762 (0.0007) [2023-10-08 01:02:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 58523648. Throughput: 0: 1730.3, 1: 1733.2. Samples: 14643298. Policy #0 lag: (min: 7.0, avg: 14.6, max: 39.0) [2023-10-08 01:02:01,211][50642] Avg episode reward: [(0, '19.660'), (1, '20.870')] [2023-10-08 01:02:01,303][52060] Updated weights for policy 0, policy_version 28390 (0.0007) [2023-10-08 01:02:01,669][52060] Updated weights for policy 0, policy_version 28400 (0.0007) [2023-10-08 01:02:02,032][52060] Updated weights for policy 0, policy_version 28410 (0.0009) [2023-10-08 01:02:03,558][52059] Updated weights for policy 1, policy_version 28772 (0.0007) [2023-10-08 01:02:03,932][52059] Updated weights for policy 1, policy_version 28782 (0.0007) [2023-10-08 01:02:04,296][52059] Updated weights for policy 1, policy_version 28792 (0.0007) [2023-10-08 01:02:06,089][52060] Updated weights for policy 0, policy_version 28420 (0.0010) [2023-10-08 01:02:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 58589184. Throughput: 0: 1711.7, 1: 1754.9. Samples: 14653684. Policy #0 lag: (min: 7.0, avg: 14.6, max: 39.0) [2023-10-08 01:02:06,211][50642] Avg episode reward: [(0, '18.730'), (1, '19.220')] [2023-10-08 01:02:06,448][52060] Updated weights for policy 0, policy_version 28430 (0.0007) [2023-10-08 01:02:06,822][52060] Updated weights for policy 0, policy_version 28440 (0.0007) [2023-10-08 01:02:08,254][52059] Updated weights for policy 1, policy_version 28802 (0.0008) [2023-10-08 01:02:08,618][52059] Updated weights for policy 1, policy_version 28812 (0.0009) [2023-10-08 01:02:08,976][52059] Updated weights for policy 1, policy_version 28822 (0.0011) [2023-10-08 01:02:09,339][52059] Updated weights for policy 1, policy_version 28832 (0.0009) [2023-10-08 01:02:10,590][52060] Updated weights for policy 0, policy_version 28450 (0.0008) [2023-10-08 01:02:10,959][52060] Updated weights for policy 0, policy_version 28460 (0.0007) [2023-10-08 01:02:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 58654720. Throughput: 0: 1732.4, 1: 1732.5. Samples: 14674424. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:02:11,211][50642] Avg episode reward: [(0, '17.390'), (1, '17.620')] [2023-10-08 01:02:11,328][52060] Updated weights for policy 0, policy_version 28470 (0.0008) [2023-10-08 01:02:11,701][52060] Updated weights for policy 0, policy_version 28480 (0.0010) [2023-10-08 01:02:13,315][52059] Updated weights for policy 1, policy_version 28842 (0.0010) [2023-10-08 01:02:13,679][52059] Updated weights for policy 1, policy_version 28852 (0.0010) [2023-10-08 01:02:14,040][52059] Updated weights for policy 1, policy_version 28862 (0.0007) [2023-10-08 01:02:15,863][52060] Updated weights for policy 0, policy_version 28490 (0.0009) [2023-10-08 01:02:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 58720256. Throughput: 0: 1722.9, 1: 1740.0. Samples: 14695444. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:02:16,211][50642] Avg episode reward: [(0, '19.990'), (1, '19.970')] [2023-10-08 01:02:16,231][52060] Updated weights for policy 0, policy_version 28500 (0.0008) [2023-10-08 01:02:16,602][52060] Updated weights for policy 0, policy_version 28510 (0.0009) [2023-10-08 01:02:17,786][52059] Updated weights for policy 1, policy_version 28872 (0.0007) [2023-10-08 01:02:18,147][52059] Updated weights for policy 1, policy_version 28882 (0.0009) [2023-10-08 01:02:18,522][52059] Updated weights for policy 1, policy_version 28892 (0.0008) [2023-10-08 01:02:20,614][52060] Updated weights for policy 0, policy_version 28520 (0.0008) [2023-10-08 01:02:20,985][52060] Updated weights for policy 0, policy_version 28530 (0.0008) [2023-10-08 01:02:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 58785792. Throughput: 0: 1729.7, 1: 1732.1. Samples: 14705212. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:02:21,211][50642] Avg episode reward: [(0, '16.700'), (1, '21.110')] [2023-10-08 01:02:21,357][52060] Updated weights for policy 0, policy_version 28540 (0.0010) [2023-10-08 01:02:22,428][52059] Updated weights for policy 1, policy_version 28902 (0.0008) [2023-10-08 01:02:22,793][52059] Updated weights for policy 1, policy_version 28912 (0.0008) [2023-10-08 01:02:23,150][52059] Updated weights for policy 1, policy_version 28922 (0.0009) [2023-10-08 01:02:25,356][52060] Updated weights for policy 0, policy_version 28550 (0.0010) [2023-10-08 01:02:25,731][52060] Updated weights for policy 0, policy_version 28560 (0.0009) [2023-10-08 01:02:26,108][52060] Updated weights for policy 0, policy_version 28570 (0.0009) [2023-10-08 01:02:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 58851328. Throughput: 0: 1728.7, 1: 1734.0. Samples: 14726742. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:02:26,211][50642] Avg episode reward: [(0, '17.860'), (1, '17.180')] [2023-10-08 01:02:27,048][52059] Updated weights for policy 1, policy_version 28932 (0.0009) [2023-10-08 01:02:27,425][52059] Updated weights for policy 1, policy_version 28942 (0.0007) [2023-10-08 01:02:27,780][52059] Updated weights for policy 1, policy_version 28952 (0.0009) [2023-10-08 01:02:30,127][52060] Updated weights for policy 0, policy_version 28580 (0.0008) [2023-10-08 01:02:30,500][52060] Updated weights for policy 0, policy_version 28590 (0.0007) [2023-10-08 01:02:30,865][52060] Updated weights for policy 0, policy_version 28600 (0.0009) [2023-10-08 01:02:31,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 58949632. Throughput: 0: 1706.7, 1: 1762.8. Samples: 14747200. Policy #0 lag: (min: 17.0, avg: 28.5, max: 49.0) [2023-10-08 01:02:31,211][50642] Avg episode reward: [(0, '19.740'), (1, '17.330')] [2023-10-08 01:02:31,810][52059] Updated weights for policy 1, policy_version 28962 (0.0010) [2023-10-08 01:02:32,183][52059] Updated weights for policy 1, policy_version 28972 (0.0010) [2023-10-08 01:02:32,553][52059] Updated weights for policy 1, policy_version 28982 (0.0007) [2023-10-08 01:02:32,922][52059] Updated weights for policy 1, policy_version 28992 (0.0009) [2023-10-08 01:02:34,724][52060] Updated weights for policy 0, policy_version 28610 (0.0010) [2023-10-08 01:02:35,099][52060] Updated weights for policy 0, policy_version 28620 (0.0009) [2023-10-08 01:02:35,467][52060] Updated weights for policy 0, policy_version 28630 (0.0008) [2023-10-08 01:02:35,833][52060] Updated weights for policy 0, policy_version 28640 (0.0008) [2023-10-08 01:02:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 59015168. Throughput: 0: 1728.4, 1: 1731.9. Samples: 14757478. Policy #0 lag: (min: 17.0, avg: 28.5, max: 49.0) [2023-10-08 01:02:36,211][50642] Avg episode reward: [(0, '18.260'), (1, '21.830')] [2023-10-08 01:02:36,832][52059] Updated weights for policy 1, policy_version 29002 (0.0009) [2023-10-08 01:02:37,200][52059] Updated weights for policy 1, policy_version 29012 (0.0009) [2023-10-08 01:02:37,561][52059] Updated weights for policy 1, policy_version 29022 (0.0010) [2023-10-08 01:02:39,781][52060] Updated weights for policy 0, policy_version 28650 (0.0009) [2023-10-08 01:02:40,149][52060] Updated weights for policy 0, policy_version 28660 (0.0010) [2023-10-08 01:02:40,524][52060] Updated weights for policy 0, policy_version 28670 (0.0008) [2023-10-08 01:02:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 59080704. Throughput: 0: 1712.6, 1: 1742.5. Samples: 14778080. Policy #0 lag: (min: 17.0, avg: 28.5, max: 49.0) [2023-10-08 01:02:41,211][50642] Avg episode reward: [(0, '19.080'), (1, '16.790')] [2023-10-08 01:02:41,477][52059] Updated weights for policy 1, policy_version 29032 (0.0008) [2023-10-08 01:02:41,850][52059] Updated weights for policy 1, policy_version 29042 (0.0007) [2023-10-08 01:02:42,209][52059] Updated weights for policy 1, policy_version 29052 (0.0008) [2023-10-08 01:02:44,564][52060] Updated weights for policy 0, policy_version 28680 (0.0010) [2023-10-08 01:02:44,935][52060] Updated weights for policy 0, policy_version 28690 (0.0009) [2023-10-08 01:02:45,303][52060] Updated weights for policy 0, policy_version 28700 (0.0007) [2023-10-08 01:02:46,012][52059] Updated weights for policy 1, policy_version 29062 (0.0008) [2023-10-08 01:02:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 59146240. Throughput: 0: 1697.1, 1: 1762.0. Samples: 14798956. Policy #0 lag: (min: 17.0, avg: 28.5, max: 49.0) [2023-10-08 01:02:46,211][50642] Avg episode reward: [(0, '18.980'), (1, '16.830')] [2023-10-08 01:02:46,217][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000028704_29392896.pth... [2023-10-08 01:02:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000027104_27754496.pth [2023-10-08 01:02:46,372][52059] Updated weights for policy 1, policy_version 29072 (0.0010) [2023-10-08 01:02:46,743][52059] Updated weights for policy 1, policy_version 29082 (0.0010) [2023-10-08 01:02:46,958][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000029088_29786112.pth... [2023-10-08 01:02:46,995][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000027456_28114944.pth [2023-10-08 01:02:49,163][52060] Updated weights for policy 0, policy_version 28710 (0.0008) [2023-10-08 01:02:49,533][52060] Updated weights for policy 0, policy_version 28720 (0.0008) [2023-10-08 01:02:49,899][52060] Updated weights for policy 0, policy_version 28730 (0.0009) [2023-10-08 01:02:50,556][52059] Updated weights for policy 1, policy_version 29092 (0.0007) [2023-10-08 01:02:50,935][52059] Updated weights for policy 1, policy_version 29102 (0.0009) [2023-10-08 01:02:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 59211776. Throughput: 0: 1723.5, 1: 1740.0. Samples: 14809542. Policy #0 lag: (min: 17.0, avg: 28.5, max: 49.0) [2023-10-08 01:02:51,211][50642] Avg episode reward: [(0, '17.690'), (1, '20.480')] [2023-10-08 01:02:51,296][52059] Updated weights for policy 1, policy_version 29112 (0.0008) [2023-10-08 01:02:53,846][52060] Updated weights for policy 0, policy_version 28740 (0.0008) [2023-10-08 01:02:54,221][52060] Updated weights for policy 0, policy_version 28750 (0.0008) [2023-10-08 01:02:54,591][52060] Updated weights for policy 0, policy_version 28760 (0.0007) [2023-10-08 01:02:55,146][52059] Updated weights for policy 1, policy_version 29122 (0.0008) [2023-10-08 01:02:55,513][52059] Updated weights for policy 1, policy_version 29132 (0.0009) [2023-10-08 01:02:55,883][52059] Updated weights for policy 1, policy_version 29142 (0.0008) [2023-10-08 01:02:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 59277312. Throughput: 0: 1692.2, 1: 1765.2. Samples: 14830008. Policy #0 lag: (min: 3.0, avg: 7.3, max: 35.0) [2023-10-08 01:02:56,211][50642] Avg episode reward: [(0, '18.170'), (1, '20.630')] [2023-10-08 01:02:56,242][52059] Updated weights for policy 1, policy_version 29152 (0.0008) [2023-10-08 01:02:58,561][52060] Updated weights for policy 0, policy_version 28770 (0.0007) [2023-10-08 01:02:58,929][52060] Updated weights for policy 0, policy_version 28780 (0.0008) [2023-10-08 01:02:59,298][52060] Updated weights for policy 0, policy_version 28790 (0.0008) [2023-10-08 01:02:59,675][52060] Updated weights for policy 0, policy_version 28800 (0.0007) [2023-10-08 01:03:00,190][52059] Updated weights for policy 1, policy_version 29162 (0.0010) [2023-10-08 01:03:00,548][52059] Updated weights for policy 1, policy_version 29172 (0.0009) [2023-10-08 01:03:00,912][52059] Updated weights for policy 1, policy_version 29182 (0.0010) [2023-10-08 01:03:01,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 59375616. Throughput: 0: 1702.3, 1: 1736.1. Samples: 14850170. Policy #0 lag: (min: 3.0, avg: 7.3, max: 35.0) [2023-10-08 01:03:01,211][50642] Avg episode reward: [(0, '17.690'), (1, '17.100')] [2023-10-08 01:03:03,904][52060] Updated weights for policy 0, policy_version 28810 (0.0009) [2023-10-08 01:03:04,283][52060] Updated weights for policy 0, policy_version 28820 (0.0009) [2023-10-08 01:03:04,652][52060] Updated weights for policy 0, policy_version 28830 (0.0008) [2023-10-08 01:03:04,844][52059] Updated weights for policy 1, policy_version 29192 (0.0007) [2023-10-08 01:03:05,212][52059] Updated weights for policy 1, policy_version 29202 (0.0010) [2023-10-08 01:03:05,577][52059] Updated weights for policy 1, policy_version 29212 (0.0011) [2023-10-08 01:03:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 59441152. Throughput: 0: 1708.6, 1: 1758.4. Samples: 14861226. Policy #0 lag: (min: 3.0, avg: 7.3, max: 35.0) [2023-10-08 01:03:06,211][50642] Avg episode reward: [(0, '17.280'), (1, '18.020')] [2023-10-08 01:03:08,653][52060] Updated weights for policy 0, policy_version 28840 (0.0007) [2023-10-08 01:03:09,020][52060] Updated weights for policy 0, policy_version 28850 (0.0010) [2023-10-08 01:03:09,353][52059] Updated weights for policy 1, policy_version 29222 (0.0009) [2023-10-08 01:03:09,390][52060] Updated weights for policy 0, policy_version 28860 (0.0008) [2023-10-08 01:03:09,716][52059] Updated weights for policy 1, policy_version 29232 (0.0009) [2023-10-08 01:03:10,082][52059] Updated weights for policy 1, policy_version 29242 (0.0008) [2023-10-08 01:03:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 59506688. Throughput: 0: 1686.3, 1: 1746.7. Samples: 14881228. Policy #0 lag: (min: 3.0, avg: 7.3, max: 35.0) [2023-10-08 01:03:11,211][50642] Avg episode reward: [(0, '19.040'), (1, '21.010')] [2023-10-08 01:03:13,567][52060] Updated weights for policy 0, policy_version 28870 (0.0007) [2023-10-08 01:03:13,936][52060] Updated weights for policy 0, policy_version 28880 (0.0009) [2023-10-08 01:03:14,039][52059] Updated weights for policy 1, policy_version 29252 (0.0007) [2023-10-08 01:03:14,300][52060] Updated weights for policy 0, policy_version 28890 (0.0009) [2023-10-08 01:03:14,399][52059] Updated weights for policy 1, policy_version 29262 (0.0009) [2023-10-08 01:03:14,763][52059] Updated weights for policy 1, policy_version 29272 (0.0010) [2023-10-08 01:03:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 59572224. Throughput: 0: 1703.4, 1: 1732.9. Samples: 14901834. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 01:03:16,211][50642] Avg episode reward: [(0, '18.550'), (1, '18.320')] [2023-10-08 01:03:18,085][52060] Updated weights for policy 0, policy_version 28900 (0.0009) [2023-10-08 01:03:18,446][52060] Updated weights for policy 0, policy_version 28910 (0.0008) [2023-10-08 01:03:18,640][52059] Updated weights for policy 1, policy_version 29282 (0.0010) [2023-10-08 01:03:18,813][52060] Updated weights for policy 0, policy_version 28920 (0.0007) [2023-10-08 01:03:19,006][52059] Updated weights for policy 1, policy_version 29292 (0.0007) [2023-10-08 01:03:19,372][52059] Updated weights for policy 1, policy_version 29302 (0.0008) [2023-10-08 01:03:19,735][52059] Updated weights for policy 1, policy_version 29312 (0.0009) [2023-10-08 01:03:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 59637760. Throughput: 0: 1693.2, 1: 1758.4. Samples: 14912804. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 01:03:21,211][50642] Avg episode reward: [(0, '19.420'), (1, '17.320')] [2023-10-08 01:03:22,795][52060] Updated weights for policy 0, policy_version 28930 (0.0009) [2023-10-08 01:03:23,173][52060] Updated weights for policy 0, policy_version 28940 (0.0008) [2023-10-08 01:03:23,539][52060] Updated weights for policy 0, policy_version 28950 (0.0008) [2023-10-08 01:03:23,646][52059] Updated weights for policy 1, policy_version 29322 (0.0007) [2023-10-08 01:03:23,905][52060] Updated weights for policy 0, policy_version 28960 (0.0008) [2023-10-08 01:03:24,010][52059] Updated weights for policy 1, policy_version 29332 (0.0008) [2023-10-08 01:03:24,374][52059] Updated weights for policy 1, policy_version 29342 (0.0007) [2023-10-08 01:03:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 59703296. Throughput: 0: 1693.8, 1: 1739.4. Samples: 14932576. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 01:03:26,211][50642] Avg episode reward: [(0, '20.240'), (1, '18.260')] [2023-10-08 01:03:26,212][51605] Saving new best policy, reward=20.240! [2023-10-08 01:03:27,801][52060] Updated weights for policy 0, policy_version 28970 (0.0010) [2023-10-08 01:03:28,175][52060] Updated weights for policy 0, policy_version 28980 (0.0010) [2023-10-08 01:03:28,378][52059] Updated weights for policy 1, policy_version 29352 (0.0009) [2023-10-08 01:03:28,539][52060] Updated weights for policy 0, policy_version 28990 (0.0009) [2023-10-08 01:03:28,745][52059] Updated weights for policy 1, policy_version 29362 (0.0007) [2023-10-08 01:03:29,111][52059] Updated weights for policy 1, policy_version 29372 (0.0008) [2023-10-08 01:03:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 59768832. Throughput: 0: 1709.4, 1: 1729.3. Samples: 14953698. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 01:03:31,211][50642] Avg episode reward: [(0, '19.130'), (1, '18.550')] [2023-10-08 01:03:32,721][52060] Updated weights for policy 0, policy_version 29000 (0.0008) [2023-10-08 01:03:33,085][52060] Updated weights for policy 0, policy_version 29010 (0.0007) [2023-10-08 01:03:33,124][52059] Updated weights for policy 1, policy_version 29382 (0.0007) [2023-10-08 01:03:33,455][52060] Updated weights for policy 0, policy_version 29020 (0.0009) [2023-10-08 01:03:33,482][52059] Updated weights for policy 1, policy_version 29392 (0.0008) [2023-10-08 01:03:33,847][52059] Updated weights for policy 1, policy_version 29402 (0.0008) [2023-10-08 01:03:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 59834368. Throughput: 0: 1679.8, 1: 1733.9. Samples: 14963156. Policy #0 lag: (min: 30.0, avg: 33.6, max: 62.0) [2023-10-08 01:03:36,211][50642] Avg episode reward: [(0, '19.780'), (1, '16.750')] [2023-10-08 01:03:37,464][52060] Updated weights for policy 0, policy_version 29030 (0.0007) [2023-10-08 01:03:37,763][52059] Updated weights for policy 1, policy_version 29412 (0.0009) [2023-10-08 01:03:37,833][52060] Updated weights for policy 0, policy_version 29040 (0.0008) [2023-10-08 01:03:38,122][52059] Updated weights for policy 1, policy_version 29422 (0.0009) [2023-10-08 01:03:38,202][52060] Updated weights for policy 0, policy_version 29050 (0.0007) [2023-10-08 01:03:38,492][52059] Updated weights for policy 1, policy_version 29432 (0.0008) [2023-10-08 01:03:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 59899904. Throughput: 0: 1706.5, 1: 1721.7. Samples: 14984280. Policy #0 lag: (min: 21.0, avg: 26.9, max: 53.0) [2023-10-08 01:03:41,211][50642] Avg episode reward: [(0, '20.020'), (1, '18.360')] [2023-10-08 01:03:42,129][52060] Updated weights for policy 0, policy_version 29060 (0.0009) [2023-10-08 01:03:42,504][52060] Updated weights for policy 0, policy_version 29070 (0.0008) [2023-10-08 01:03:42,584][52059] Updated weights for policy 1, policy_version 29442 (0.0007) [2023-10-08 01:03:42,861][52060] Updated weights for policy 0, policy_version 29080 (0.0009) [2023-10-08 01:03:42,955][52059] Updated weights for policy 1, policy_version 29452 (0.0008) [2023-10-08 01:03:43,314][52059] Updated weights for policy 1, policy_version 29462 (0.0008) [2023-10-08 01:03:43,677][52059] Updated weights for policy 1, policy_version 29472 (0.0008) [2023-10-08 01:03:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 59965440. Throughput: 0: 1712.4, 1: 1745.7. Samples: 15005784. Policy #0 lag: (min: 21.0, avg: 26.9, max: 53.0) [2023-10-08 01:03:46,211][50642] Avg episode reward: [(0, '18.320'), (1, '18.850')] [2023-10-08 01:03:46,553][52060] Updated weights for policy 0, policy_version 29090 (0.0009) [2023-10-08 01:03:46,918][52060] Updated weights for policy 0, policy_version 29100 (0.0010) [2023-10-08 01:03:47,289][52060] Updated weights for policy 0, policy_version 29110 (0.0007) [2023-10-08 01:03:47,516][52059] Updated weights for policy 1, policy_version 29482 (0.0010) [2023-10-08 01:03:47,657][52060] Updated weights for policy 0, policy_version 29120 (0.0010) [2023-10-08 01:03:47,877][52059] Updated weights for policy 1, policy_version 29492 (0.0009) [2023-10-08 01:03:48,229][52059] Updated weights for policy 1, policy_version 29502 (0.0007) [2023-10-08 01:03:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 60030976. Throughput: 0: 1695.2, 1: 1721.9. Samples: 15015000. Policy #0 lag: (min: 21.0, avg: 26.9, max: 53.0) [2023-10-08 01:03:51,212][50642] Avg episode reward: [(0, '19.720'), (1, '18.440')] [2023-10-08 01:03:51,744][52060] Updated weights for policy 0, policy_version 29130 (0.0011) [2023-10-08 01:03:52,110][52060] Updated weights for policy 0, policy_version 29140 (0.0008) [2023-10-08 01:03:52,204][52059] Updated weights for policy 1, policy_version 29512 (0.0007) [2023-10-08 01:03:52,476][52060] Updated weights for policy 0, policy_version 29150 (0.0007) [2023-10-08 01:03:52,567][52059] Updated weights for policy 1, policy_version 29522 (0.0007) [2023-10-08 01:03:52,928][52059] Updated weights for policy 1, policy_version 29532 (0.0007) [2023-10-08 01:03:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 60096512. Throughput: 0: 1714.5, 1: 1736.4. Samples: 15036518. Policy #0 lag: (min: 21.0, avg: 26.9, max: 53.0) [2023-10-08 01:03:56,211][50642] Avg episode reward: [(0, '20.020'), (1, '19.250')] [2023-10-08 01:03:56,477][52060] Updated weights for policy 0, policy_version 29160 (0.0007) [2023-10-08 01:03:56,732][52059] Updated weights for policy 1, policy_version 29542 (0.0009) [2023-10-08 01:03:56,854][52060] Updated weights for policy 0, policy_version 29170 (0.0009) [2023-10-08 01:03:57,107][52059] Updated weights for policy 1, policy_version 29552 (0.0007) [2023-10-08 01:03:57,215][52060] Updated weights for policy 0, policy_version 29180 (0.0009) [2023-10-08 01:03:57,477][52059] Updated weights for policy 1, policy_version 29562 (0.0007) [2023-10-08 01:04:01,207][52060] Updated weights for policy 0, policy_version 29190 (0.0008) [2023-10-08 01:04:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 60162048. Throughput: 0: 1717.8, 1: 1751.2. Samples: 15057940. Policy #0 lag: (min: 21.0, avg: 26.9, max: 53.0) [2023-10-08 01:04:01,211][50642] Avg episode reward: [(0, '18.540'), (1, '19.820')] [2023-10-08 01:04:01,342][52059] Updated weights for policy 1, policy_version 29572 (0.0008) [2023-10-08 01:04:01,572][52060] Updated weights for policy 0, policy_version 29200 (0.0007) [2023-10-08 01:04:01,711][52059] Updated weights for policy 1, policy_version 29582 (0.0007) [2023-10-08 01:04:01,949][52060] Updated weights for policy 0, policy_version 29210 (0.0008) [2023-10-08 01:04:02,065][52059] Updated weights for policy 1, policy_version 29592 (0.0008) [2023-10-08 01:04:05,887][52060] Updated weights for policy 0, policy_version 29220 (0.0007) [2023-10-08 01:04:05,964][52059] Updated weights for policy 1, policy_version 29602 (0.0010) [2023-10-08 01:04:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 60227584. Throughput: 0: 1706.8, 1: 1725.5. Samples: 15067254. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 01:04:06,211][50642] Avg episode reward: [(0, '21.360'), (1, '18.760')] [2023-10-08 01:04:06,264][52060] Updated weights for policy 0, policy_version 29230 (0.0008) [2023-10-08 01:04:06,323][52059] Updated weights for policy 1, policy_version 29612 (0.0008) [2023-10-08 01:04:06,640][52060] Updated weights for policy 0, policy_version 29240 (0.0009) [2023-10-08 01:04:06,690][52059] Updated weights for policy 1, policy_version 29622 (0.0007) [2023-10-08 01:04:06,925][51605] Saving new best policy, reward=21.360! [2023-10-08 01:04:07,055][52059] Updated weights for policy 1, policy_version 29632 (0.0007) [2023-10-08 01:04:10,574][52060] Updated weights for policy 0, policy_version 29250 (0.0008) [2023-10-08 01:04:10,934][52060] Updated weights for policy 0, policy_version 29260 (0.0010) [2023-10-08 01:04:11,123][52059] Updated weights for policy 1, policy_version 29642 (0.0008) [2023-10-08 01:04:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 60293120. Throughput: 0: 1719.8, 1: 1748.0. Samples: 15088626. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 01:04:11,211][50642] Avg episode reward: [(0, '18.850'), (1, '19.910')] [2023-10-08 01:04:11,297][52060] Updated weights for policy 0, policy_version 29270 (0.0007) [2023-10-08 01:04:11,476][52059] Updated weights for policy 1, policy_version 29652 (0.0007) [2023-10-08 01:04:11,657][52060] Updated weights for policy 0, policy_version 29280 (0.0008) [2023-10-08 01:04:11,842][52059] Updated weights for policy 1, policy_version 29662 (0.0008) [2023-10-08 01:04:15,632][52060] Updated weights for policy 0, policy_version 29290 (0.0008) [2023-10-08 01:04:15,759][52059] Updated weights for policy 1, policy_version 29672 (0.0009) [2023-10-08 01:04:16,008][52060] Updated weights for policy 0, policy_version 29300 (0.0009) [2023-10-08 01:04:16,118][52059] Updated weights for policy 1, policy_version 29682 (0.0009) [2023-10-08 01:04:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 60358656. Throughput: 0: 1706.1, 1: 1741.0. Samples: 15108818. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 01:04:16,211][50642] Avg episode reward: [(0, '18.480'), (1, '19.130')] [2023-10-08 01:04:16,367][52060] Updated weights for policy 0, policy_version 29310 (0.0008) [2023-10-08 01:04:16,492][52059] Updated weights for policy 1, policy_version 29692 (0.0009) [2023-10-08 01:04:20,267][52060] Updated weights for policy 0, policy_version 29320 (0.0010) [2023-10-08 01:04:20,382][52059] Updated weights for policy 1, policy_version 29702 (0.0008) [2023-10-08 01:04:20,633][52060] Updated weights for policy 0, policy_version 29330 (0.0009) [2023-10-08 01:04:20,753][52059] Updated weights for policy 1, policy_version 29712 (0.0008) [2023-10-08 01:04:21,002][52060] Updated weights for policy 0, policy_version 29340 (0.0009) [2023-10-08 01:04:21,109][52059] Updated weights for policy 1, policy_version 29722 (0.0008) [2023-10-08 01:04:21,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 60456960. Throughput: 0: 1726.0, 1: 1743.3. Samples: 15119272. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 01:04:21,211][50642] Avg episode reward: [(0, '21.270'), (1, '18.770')] [2023-10-08 01:04:25,110][52059] Updated weights for policy 1, policy_version 29732 (0.0007) [2023-10-08 01:04:25,136][52060] Updated weights for policy 0, policy_version 29350 (0.0009) [2023-10-08 01:04:25,478][52059] Updated weights for policy 1, policy_version 29742 (0.0008) [2023-10-08 01:04:25,498][52060] Updated weights for policy 0, policy_version 29360 (0.0009) [2023-10-08 01:04:25,834][52059] Updated weights for policy 1, policy_version 29752 (0.0009) [2023-10-08 01:04:25,873][52060] Updated weights for policy 0, policy_version 29370 (0.0009) [2023-10-08 01:04:26,210][50642] Fps is (10 sec: 19661.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 60555264. Throughput: 0: 1720.5, 1: 1748.8. Samples: 15140396. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:04:26,211][50642] Avg episode reward: [(0, '18.530'), (1, '18.550')] [2023-10-08 01:04:29,760][52060] Updated weights for policy 0, policy_version 29380 (0.0009) [2023-10-08 01:04:29,795][52059] Updated weights for policy 1, policy_version 29762 (0.0009) [2023-10-08 01:04:30,127][52060] Updated weights for policy 0, policy_version 29390 (0.0007) [2023-10-08 01:04:30,164][52059] Updated weights for policy 1, policy_version 29772 (0.0008) [2023-10-08 01:04:30,493][52060] Updated weights for policy 0, policy_version 29400 (0.0008) [2023-10-08 01:04:30,516][52059] Updated weights for policy 1, policy_version 29782 (0.0007) [2023-10-08 01:04:30,888][52059] Updated weights for policy 1, policy_version 29792 (0.0008) [2023-10-08 01:04:31,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 60620800. Throughput: 0: 1689.1, 1: 1719.3. Samples: 15159162. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:04:31,211][50642] Avg episode reward: [(0, '19.030'), (1, '18.730')] [2023-10-08 01:04:34,493][52060] Updated weights for policy 0, policy_version 29410 (0.0008) [2023-10-08 01:04:34,774][52059] Updated weights for policy 1, policy_version 29802 (0.0010) [2023-10-08 01:04:34,860][52060] Updated weights for policy 0, policy_version 29420 (0.0007) [2023-10-08 01:04:35,142][52059] Updated weights for policy 1, policy_version 29812 (0.0007) [2023-10-08 01:04:35,229][52060] Updated weights for policy 0, policy_version 29430 (0.0010) [2023-10-08 01:04:35,506][52059] Updated weights for policy 1, policy_version 29822 (0.0008) [2023-10-08 01:04:35,600][52060] Updated weights for policy 0, policy_version 29440 (0.0008) [2023-10-08 01:04:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 60686336. Throughput: 0: 1719.8, 1: 1748.3. Samples: 15171064. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:04:36,211][50642] Avg episode reward: [(0, '21.230'), (1, '17.090')] [2023-10-08 01:04:39,333][52059] Updated weights for policy 1, policy_version 29832 (0.0008) [2023-10-08 01:04:39,698][52059] Updated weights for policy 1, policy_version 29842 (0.0008) [2023-10-08 01:04:39,782][52060] Updated weights for policy 0, policy_version 29450 (0.0009) [2023-10-08 01:04:40,070][52059] Updated weights for policy 1, policy_version 29852 (0.0008) [2023-10-08 01:04:40,157][52060] Updated weights for policy 0, policy_version 29460 (0.0008) [2023-10-08 01:04:40,526][52060] Updated weights for policy 0, policy_version 29470 (0.0009) [2023-10-08 01:04:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 60751872. Throughput: 0: 1709.4, 1: 1730.3. Samples: 15191308. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:04:41,211][50642] Avg episode reward: [(0, '18.680'), (1, '18.520')] [2023-10-08 01:04:43,910][52059] Updated weights for policy 1, policy_version 29862 (0.0008) [2023-10-08 01:04:44,271][52059] Updated weights for policy 1, policy_version 29872 (0.0009) [2023-10-08 01:04:44,553][52060] Updated weights for policy 0, policy_version 29480 (0.0007) [2023-10-08 01:04:44,634][52059] Updated weights for policy 1, policy_version 29882 (0.0010) [2023-10-08 01:04:44,929][52060] Updated weights for policy 0, policy_version 29490 (0.0008) [2023-10-08 01:04:45,305][52060] Updated weights for policy 0, policy_version 29500 (0.0008) [2023-10-08 01:04:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 60817408. Throughput: 0: 1686.5, 1: 1721.3. Samples: 15211294. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:04:46,211][50642] Avg episode reward: [(0, '19.190'), (1, '19.710')] [2023-10-08 01:04:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000029504_30212096.pth... [2023-10-08 01:04:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000029888_30605312.pth... [2023-10-08 01:04:46,255][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000027904_28573696.pth [2023-10-08 01:04:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000028256_28934144.pth [2023-10-08 01:04:48,408][52059] Updated weights for policy 1, policy_version 29892 (0.0010) [2023-10-08 01:04:48,773][52059] Updated weights for policy 1, policy_version 29902 (0.0007) [2023-10-08 01:04:49,134][52059] Updated weights for policy 1, policy_version 29912 (0.0008) [2023-10-08 01:04:49,380][52060] Updated weights for policy 0, policy_version 29510 (0.0008) [2023-10-08 01:04:49,747][52060] Updated weights for policy 0, policy_version 29520 (0.0010) [2023-10-08 01:04:50,113][52060] Updated weights for policy 0, policy_version 29530 (0.0010) [2023-10-08 01:04:51,211][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 60882944. Throughput: 0: 1717.6, 1: 1738.5. Samples: 15222780. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:04:51,212][50642] Avg episode reward: [(0, '21.260'), (1, '18.380')] [2023-10-08 01:04:53,046][52059] Updated weights for policy 1, policy_version 29922 (0.0010) [2023-10-08 01:04:53,410][52059] Updated weights for policy 1, policy_version 29932 (0.0007) [2023-10-08 01:04:53,777][52059] Updated weights for policy 1, policy_version 29942 (0.0008) [2023-10-08 01:04:54,132][52060] Updated weights for policy 0, policy_version 29540 (0.0009) [2023-10-08 01:04:54,144][52059] Updated weights for policy 1, policy_version 29952 (0.0007) [2023-10-08 01:04:54,509][52060] Updated weights for policy 0, policy_version 29550 (0.0007) [2023-10-08 01:04:54,878][52060] Updated weights for policy 0, policy_version 29560 (0.0009) [2023-10-08 01:04:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 60948480. Throughput: 0: 1698.3, 1: 1723.1. Samples: 15242590. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:04:56,211][50642] Avg episode reward: [(0, '19.420'), (1, '21.090')] [2023-10-08 01:04:57,862][52059] Updated weights for policy 1, policy_version 29962 (0.0010) [2023-10-08 01:04:58,222][52059] Updated weights for policy 1, policy_version 29972 (0.0008) [2023-10-08 01:04:58,596][52059] Updated weights for policy 1, policy_version 29982 (0.0008) [2023-10-08 01:04:58,902][52060] Updated weights for policy 0, policy_version 29570 (0.0008) [2023-10-08 01:04:59,270][52060] Updated weights for policy 0, policy_version 29580 (0.0009) [2023-10-08 01:04:59,642][52060] Updated weights for policy 0, policy_version 29590 (0.0008) [2023-10-08 01:05:00,012][52060] Updated weights for policy 0, policy_version 29600 (0.0008) [2023-10-08 01:05:01,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 61014016. Throughput: 0: 1704.3, 1: 1737.1. Samples: 15263678. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:05:01,211][50642] Avg episode reward: [(0, '20.300'), (1, '20.120')] [2023-10-08 01:05:02,721][52059] Updated weights for policy 1, policy_version 29992 (0.0008) [2023-10-08 01:05:03,091][52059] Updated weights for policy 1, policy_version 30002 (0.0010) [2023-10-08 01:05:03,461][52059] Updated weights for policy 1, policy_version 30012 (0.0010) [2023-10-08 01:05:03,881][52060] Updated weights for policy 0, policy_version 29610 (0.0010) [2023-10-08 01:05:04,255][52060] Updated weights for policy 0, policy_version 29620 (0.0008) [2023-10-08 01:05:04,621][52060] Updated weights for policy 0, policy_version 29630 (0.0008) [2023-10-08 01:05:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 61079552. Throughput: 0: 1712.8, 1: 1725.0. Samples: 15273974. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:05:06,211][50642] Avg episode reward: [(0, '20.060'), (1, '18.270')] [2023-10-08 01:05:07,183][52059] Updated weights for policy 1, policy_version 30022 (0.0007) [2023-10-08 01:05:07,555][52059] Updated weights for policy 1, policy_version 30032 (0.0008) [2023-10-08 01:05:07,928][52059] Updated weights for policy 1, policy_version 30042 (0.0009) [2023-10-08 01:05:08,547][52060] Updated weights for policy 0, policy_version 29640 (0.0010) [2023-10-08 01:05:08,910][52060] Updated weights for policy 0, policy_version 29650 (0.0007) [2023-10-08 01:05:09,294][52060] Updated weights for policy 0, policy_version 29660 (0.0009) [2023-10-08 01:05:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 61145088. Throughput: 0: 1696.0, 1: 1728.1. Samples: 15294484. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:05:11,211][50642] Avg episode reward: [(0, '19.090'), (1, '18.080')] [2023-10-08 01:05:11,903][52059] Updated weights for policy 1, policy_version 30052 (0.0007) [2023-10-08 01:05:12,271][52059] Updated weights for policy 1, policy_version 30062 (0.0009) [2023-10-08 01:05:12,637][52059] Updated weights for policy 1, policy_version 30072 (0.0008) [2023-10-08 01:05:13,317][52060] Updated weights for policy 0, policy_version 29670 (0.0011) [2023-10-08 01:05:13,673][52060] Updated weights for policy 0, policy_version 29680 (0.0010) [2023-10-08 01:05:14,041][52060] Updated weights for policy 0, policy_version 29690 (0.0010) [2023-10-08 01:05:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 61210624. Throughput: 0: 1722.8, 1: 1760.6. Samples: 15315914. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 01:05:16,211][50642] Avg episode reward: [(0, '19.290'), (1, '17.550')] [2023-10-08 01:05:16,588][52059] Updated weights for policy 1, policy_version 30082 (0.0007) [2023-10-08 01:05:16,955][52059] Updated weights for policy 1, policy_version 30092 (0.0007) [2023-10-08 01:05:17,316][52059] Updated weights for policy 1, policy_version 30102 (0.0007) [2023-10-08 01:05:17,676][52059] Updated weights for policy 1, policy_version 30112 (0.0008) [2023-10-08 01:05:18,035][52060] Updated weights for policy 0, policy_version 29700 (0.0010) [2023-10-08 01:05:18,401][52060] Updated weights for policy 0, policy_version 29710 (0.0008) [2023-10-08 01:05:18,773][52060] Updated weights for policy 0, policy_version 29720 (0.0008) [2023-10-08 01:05:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 61276160. Throughput: 0: 1702.4, 1: 1733.3. Samples: 15325670. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 01:05:21,211][50642] Avg episode reward: [(0, '19.660'), (1, '19.230')] [2023-10-08 01:05:21,620][52059] Updated weights for policy 1, policy_version 30122 (0.0011) [2023-10-08 01:05:21,986][52059] Updated weights for policy 1, policy_version 30132 (0.0009) [2023-10-08 01:05:22,353][52059] Updated weights for policy 1, policy_version 30142 (0.0007) [2023-10-08 01:05:22,758][52060] Updated weights for policy 0, policy_version 29730 (0.0008) [2023-10-08 01:05:23,130][52060] Updated weights for policy 0, policy_version 29740 (0.0008) [2023-10-08 01:05:23,498][52060] Updated weights for policy 0, policy_version 29750 (0.0008) [2023-10-08 01:05:23,870][52060] Updated weights for policy 0, policy_version 29760 (0.0008) [2023-10-08 01:05:26,112][52059] Updated weights for policy 1, policy_version 30152 (0.0008) [2023-10-08 01:05:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 61341696. Throughput: 0: 1708.0, 1: 1750.1. Samples: 15346918. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 01:05:26,211][50642] Avg episode reward: [(0, '18.340'), (1, '16.420')] [2023-10-08 01:05:26,492][52059] Updated weights for policy 1, policy_version 30162 (0.0008) [2023-10-08 01:05:26,857][52059] Updated weights for policy 1, policy_version 30172 (0.0009) [2023-10-08 01:05:27,722][52060] Updated weights for policy 0, policy_version 29770 (0.0010) [2023-10-08 01:05:28,094][52060] Updated weights for policy 0, policy_version 29780 (0.0009) [2023-10-08 01:05:28,464][52060] Updated weights for policy 0, policy_version 29790 (0.0009) [2023-10-08 01:05:30,808][52059] Updated weights for policy 1, policy_version 30182 (0.0010) [2023-10-08 01:05:31,183][52059] Updated weights for policy 1, policy_version 30192 (0.0008) [2023-10-08 01:05:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 61407232. Throughput: 0: 1728.0, 1: 1749.7. Samples: 15367792. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 01:05:31,211][50642] Avg episode reward: [(0, '18.920'), (1, '16.850')] [2023-10-08 01:05:31,542][52059] Updated weights for policy 1, policy_version 30202 (0.0008) [2023-10-08 01:05:32,535][52060] Updated weights for policy 0, policy_version 29800 (0.0010) [2023-10-08 01:05:32,912][52060] Updated weights for policy 0, policy_version 29810 (0.0009) [2023-10-08 01:05:33,285][52060] Updated weights for policy 0, policy_version 29820 (0.0008) [2023-10-08 01:05:35,478][52059] Updated weights for policy 1, policy_version 30212 (0.0010) [2023-10-08 01:05:35,848][52059] Updated weights for policy 1, policy_version 30222 (0.0010) [2023-10-08 01:05:36,207][52059] Updated weights for policy 1, policy_version 30232 (0.0009) [2023-10-08 01:05:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 61472768. Throughput: 0: 1695.0, 1: 1741.4. Samples: 15377420. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 01:05:36,211][50642] Avg episode reward: [(0, '20.040'), (1, '19.360')] [2023-10-08 01:05:37,124][52060] Updated weights for policy 0, policy_version 29830 (0.0009) [2023-10-08 01:05:37,501][52060] Updated weights for policy 0, policy_version 29840 (0.0009) [2023-10-08 01:05:37,864][52060] Updated weights for policy 0, policy_version 29850 (0.0007) [2023-10-08 01:05:40,055][52059] Updated weights for policy 1, policy_version 30242 (0.0008) [2023-10-08 01:05:40,424][52059] Updated weights for policy 1, policy_version 30252 (0.0009) [2023-10-08 01:05:40,778][52059] Updated weights for policy 1, policy_version 30262 (0.0008) [2023-10-08 01:05:41,143][52059] Updated weights for policy 1, policy_version 30272 (0.0008) [2023-10-08 01:05:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 61571072. Throughput: 0: 1715.2, 1: 1760.3. Samples: 15398986. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-10-08 01:05:41,211][50642] Avg episode reward: [(0, '20.090'), (1, '18.620')] [2023-10-08 01:05:41,889][52060] Updated weights for policy 0, policy_version 29860 (0.0008) [2023-10-08 01:05:42,269][52060] Updated weights for policy 0, policy_version 29870 (0.0008) [2023-10-08 01:05:42,635][52060] Updated weights for policy 0, policy_version 29880 (0.0009) [2023-10-08 01:05:45,123][52059] Updated weights for policy 1, policy_version 30282 (0.0007) [2023-10-08 01:05:45,487][52059] Updated weights for policy 1, policy_version 30292 (0.0009) [2023-10-08 01:05:45,856][52059] Updated weights for policy 1, policy_version 30302 (0.0008) [2023-10-08 01:05:46,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 61636608. Throughput: 0: 1727.9, 1: 1728.4. Samples: 15419216. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-10-08 01:05:46,211][50642] Avg episode reward: [(0, '19.290'), (1, '16.660')] [2023-10-08 01:05:46,453][52060] Updated weights for policy 0, policy_version 29890 (0.0008) [2023-10-08 01:05:46,826][52060] Updated weights for policy 0, policy_version 29900 (0.0008) [2023-10-08 01:05:47,209][52060] Updated weights for policy 0, policy_version 29910 (0.0007) [2023-10-08 01:05:47,581][52060] Updated weights for policy 0, policy_version 29920 (0.0008) [2023-10-08 01:05:49,946][52059] Updated weights for policy 1, policy_version 30312 (0.0008) [2023-10-08 01:05:50,313][52059] Updated weights for policy 1, policy_version 30322 (0.0010) [2023-10-08 01:05:50,674][52059] Updated weights for policy 1, policy_version 30332 (0.0010) [2023-10-08 01:05:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 61702144. Throughput: 0: 1700.4, 1: 1756.7. Samples: 15429540. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-10-08 01:05:51,211][50642] Avg episode reward: [(0, '20.110'), (1, '20.450')] [2023-10-08 01:05:51,617][52060] Updated weights for policy 0, policy_version 29930 (0.0007) [2023-10-08 01:05:51,985][52060] Updated weights for policy 0, policy_version 29940 (0.0008) [2023-10-08 01:05:52,357][52060] Updated weights for policy 0, policy_version 29950 (0.0007) [2023-10-08 01:05:54,573][52059] Updated weights for policy 1, policy_version 30342 (0.0009) [2023-10-08 01:05:54,939][52059] Updated weights for policy 1, policy_version 30352 (0.0008) [2023-10-08 01:05:55,304][52059] Updated weights for policy 1, policy_version 30362 (0.0008) [2023-10-08 01:05:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 61767680. Throughput: 0: 1722.9, 1: 1737.5. Samples: 15450200. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-10-08 01:05:56,211][50642] Avg episode reward: [(0, '19.810'), (1, '20.720')] [2023-10-08 01:05:56,315][52060] Updated weights for policy 0, policy_version 29960 (0.0008) [2023-10-08 01:05:56,689][52060] Updated weights for policy 0, policy_version 29970 (0.0008) [2023-10-08 01:05:57,059][52060] Updated weights for policy 0, policy_version 29980 (0.0008) [2023-10-08 01:05:59,146][52059] Updated weights for policy 1, policy_version 30372 (0.0009) [2023-10-08 01:05:59,522][52059] Updated weights for policy 1, policy_version 30382 (0.0009) [2023-10-08 01:05:59,886][52059] Updated weights for policy 1, policy_version 30392 (0.0009) [2023-10-08 01:06:00,977][52060] Updated weights for policy 0, policy_version 29990 (0.0010) [2023-10-08 01:06:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 61833216. Throughput: 0: 1724.7, 1: 1722.8. Samples: 15471048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) [2023-10-08 01:06:01,211][50642] Avg episode reward: [(0, '19.420'), (1, '16.550')] [2023-10-08 01:06:01,347][52060] Updated weights for policy 0, policy_version 30000 (0.0008) [2023-10-08 01:06:01,717][52060] Updated weights for policy 0, policy_version 30010 (0.0010) [2023-10-08 01:06:03,858][52059] Updated weights for policy 1, policy_version 30402 (0.0009) [2023-10-08 01:06:04,230][52059] Updated weights for policy 1, policy_version 30412 (0.0009) [2023-10-08 01:06:04,591][52059] Updated weights for policy 1, policy_version 30422 (0.0007) [2023-10-08 01:06:04,957][52059] Updated weights for policy 1, policy_version 30432 (0.0008) [2023-10-08 01:06:05,835][52060] Updated weights for policy 0, policy_version 30020 (0.0009) [2023-10-08 01:06:06,196][52060] Updated weights for policy 0, policy_version 30030 (0.0008) [2023-10-08 01:06:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 61898752. Throughput: 0: 1715.5, 1: 1749.5. Samples: 15481592. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:06:06,211][50642] Avg episode reward: [(0, '20.570'), (1, '20.190')] [2023-10-08 01:06:06,567][52060] Updated weights for policy 0, policy_version 30040 (0.0008) [2023-10-08 01:06:08,691][52059] Updated weights for policy 1, policy_version 30442 (0.0007) [2023-10-08 01:06:09,059][52059] Updated weights for policy 1, policy_version 30452 (0.0009) [2023-10-08 01:06:09,419][52059] Updated weights for policy 1, policy_version 30462 (0.0010) [2023-10-08 01:06:10,497][52060] Updated weights for policy 0, policy_version 30050 (0.0009) [2023-10-08 01:06:10,872][52060] Updated weights for policy 0, policy_version 30060 (0.0008) [2023-10-08 01:06:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 61964288. Throughput: 0: 1724.8, 1: 1722.2. Samples: 15502032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:06:11,211][50642] Avg episode reward: [(0, '19.730'), (1, '19.600')] [2023-10-08 01:06:11,241][52060] Updated weights for policy 0, policy_version 30070 (0.0008) [2023-10-08 01:06:11,601][52060] Updated weights for policy 0, policy_version 30080 (0.0009) [2023-10-08 01:06:13,312][52059] Updated weights for policy 1, policy_version 30472 (0.0008) [2023-10-08 01:06:13,678][52059] Updated weights for policy 1, policy_version 30482 (0.0007) [2023-10-08 01:06:14,039][52059] Updated weights for policy 1, policy_version 30492 (0.0007) [2023-10-08 01:06:15,682][52060] Updated weights for policy 0, policy_version 30090 (0.0009) [2023-10-08 01:06:16,041][52060] Updated weights for policy 0, policy_version 30100 (0.0010) [2023-10-08 01:06:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 62029824. Throughput: 0: 1719.0, 1: 1736.0. Samples: 15523266. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:06:16,211][50642] Avg episode reward: [(0, '19.870'), (1, '17.510')] [2023-10-08 01:06:16,405][52060] Updated weights for policy 0, policy_version 30110 (0.0008) [2023-10-08 01:06:17,933][52059] Updated weights for policy 1, policy_version 30502 (0.0007) [2023-10-08 01:06:18,297][52059] Updated weights for policy 1, policy_version 30512 (0.0008) [2023-10-08 01:06:18,658][52059] Updated weights for policy 1, policy_version 30522 (0.0008) [2023-10-08 01:06:20,140][52060] Updated weights for policy 0, policy_version 30120 (0.0008) [2023-10-08 01:06:20,507][52060] Updated weights for policy 0, policy_version 30130 (0.0007) [2023-10-08 01:06:20,865][52060] Updated weights for policy 0, policy_version 30140 (0.0009) [2023-10-08 01:06:21,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 62128128. Throughput: 0: 1736.8, 1: 1731.3. Samples: 15533486. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:06:21,211][50642] Avg episode reward: [(0, '20.270'), (1, '17.290')] [2023-10-08 01:06:22,606][52059] Updated weights for policy 1, policy_version 30532 (0.0007) [2023-10-08 01:06:22,974][52059] Updated weights for policy 1, policy_version 30542 (0.0008) [2023-10-08 01:06:23,355][52059] Updated weights for policy 1, policy_version 30552 (0.0008) [2023-10-08 01:06:24,764][52060] Updated weights for policy 0, policy_version 30150 (0.0009) [2023-10-08 01:06:25,137][52060] Updated weights for policy 0, policy_version 30160 (0.0008) [2023-10-08 01:06:25,500][52060] Updated weights for policy 0, policy_version 30170 (0.0009) [2023-10-08 01:06:26,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 62193664. Throughput: 0: 1730.6, 1: 1728.0. Samples: 15554626. Policy #0 lag: (min: 5.0, avg: 13.4, max: 37.0) [2023-10-08 01:06:26,211][50642] Avg episode reward: [(0, '18.770'), (1, '19.340')] [2023-10-08 01:06:27,243][52059] Updated weights for policy 1, policy_version 30562 (0.0008) [2023-10-08 01:06:27,609][52059] Updated weights for policy 1, policy_version 30572 (0.0007) [2023-10-08 01:06:27,967][52059] Updated weights for policy 1, policy_version 30582 (0.0009) [2023-10-08 01:06:28,337][52059] Updated weights for policy 1, policy_version 30592 (0.0008) [2023-10-08 01:06:29,615][52060] Updated weights for policy 0, policy_version 30180 (0.0009) [2023-10-08 01:06:29,980][52060] Updated weights for policy 0, policy_version 30190 (0.0008) [2023-10-08 01:06:30,348][52060] Updated weights for policy 0, policy_version 30200 (0.0007) [2023-10-08 01:06:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 62259200. Throughput: 0: 1703.8, 1: 1756.5. Samples: 15574932. Policy #0 lag: (min: 5.0, avg: 13.4, max: 37.0) [2023-10-08 01:06:31,211][50642] Avg episode reward: [(0, '19.650'), (1, '18.360')] [2023-10-08 01:06:32,201][52059] Updated weights for policy 1, policy_version 30602 (0.0007) [2023-10-08 01:06:32,572][52059] Updated weights for policy 1, policy_version 30612 (0.0009) [2023-10-08 01:06:32,931][52059] Updated weights for policy 1, policy_version 30622 (0.0007) [2023-10-08 01:06:34,175][52060] Updated weights for policy 0, policy_version 30210 (0.0008) [2023-10-08 01:06:34,541][52060] Updated weights for policy 0, policy_version 30220 (0.0010) [2023-10-08 01:06:34,911][52060] Updated weights for policy 0, policy_version 30230 (0.0010) [2023-10-08 01:06:35,272][52060] Updated weights for policy 0, policy_version 30240 (0.0009) [2023-10-08 01:06:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 62324736. Throughput: 0: 1739.5, 1: 1732.0. Samples: 15585758. Policy #0 lag: (min: 5.0, avg: 13.4, max: 37.0) [2023-10-08 01:06:36,211][50642] Avg episode reward: [(0, '20.790'), (1, '19.990')] [2023-10-08 01:06:36,992][52059] Updated weights for policy 1, policy_version 30632 (0.0008) [2023-10-08 01:06:37,363][52059] Updated weights for policy 1, policy_version 30642 (0.0010) [2023-10-08 01:06:37,730][52059] Updated weights for policy 1, policy_version 30652 (0.0007) [2023-10-08 01:06:39,321][52060] Updated weights for policy 0, policy_version 30250 (0.0009) [2023-10-08 01:06:39,689][52060] Updated weights for policy 0, policy_version 30260 (0.0011) [2023-10-08 01:06:40,070][52060] Updated weights for policy 0, policy_version 30270 (0.0011) [2023-10-08 01:06:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 62390272. Throughput: 0: 1720.2, 1: 1748.4. Samples: 15606288. Policy #0 lag: (min: 5.0, avg: 13.4, max: 37.0) [2023-10-08 01:06:41,211][50642] Avg episode reward: [(0, '20.110'), (1, '20.220')] [2023-10-08 01:06:41,555][52059] Updated weights for policy 1, policy_version 30662 (0.0008) [2023-10-08 01:06:41,924][52059] Updated weights for policy 1, policy_version 30672 (0.0009) [2023-10-08 01:06:42,285][52059] Updated weights for policy 1, policy_version 30682 (0.0008) [2023-10-08 01:06:44,057][52060] Updated weights for policy 0, policy_version 30280 (0.0009) [2023-10-08 01:06:44,438][52060] Updated weights for policy 0, policy_version 30290 (0.0008) [2023-10-08 01:06:44,801][52060] Updated weights for policy 0, policy_version 30300 (0.0007) [2023-10-08 01:06:46,179][52059] Updated weights for policy 1, policy_version 30692 (0.0008) [2023-10-08 01:06:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 62455808. Throughput: 0: 1706.0, 1: 1759.4. Samples: 15626990. Policy #0 lag: (min: 5.0, avg: 13.4, max: 37.0) [2023-10-08 01:06:46,211][50642] Avg episode reward: [(0, '19.690'), (1, '19.650')] [2023-10-08 01:06:46,217][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000030304_31031296.pth... [2023-10-08 01:06:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000028704_29392896.pth [2023-10-08 01:06:46,537][52059] Updated weights for policy 1, policy_version 30702 (0.0007) [2023-10-08 01:06:46,904][52059] Updated weights for policy 1, policy_version 30712 (0.0007) [2023-10-08 01:06:47,186][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000030720_31457280.pth... [2023-10-08 01:06:47,224][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000029088_29786112.pth [2023-10-08 01:06:48,853][52060] Updated weights for policy 0, policy_version 30310 (0.0009) [2023-10-08 01:06:49,221][52060] Updated weights for policy 0, policy_version 30320 (0.0008) [2023-10-08 01:06:49,590][52060] Updated weights for policy 0, policy_version 30330 (0.0007) [2023-10-08 01:06:50,760][52059] Updated weights for policy 1, policy_version 30722 (0.0008) [2023-10-08 01:06:51,130][52059] Updated weights for policy 1, policy_version 30732 (0.0007) [2023-10-08 01:06:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 62521344. Throughput: 0: 1727.4, 1: 1731.0. Samples: 15637220. Policy #0 lag: (min: 5.0, avg: 5.8, max: 25.0) [2023-10-08 01:06:51,211][50642] Avg episode reward: [(0, '20.370'), (1, '19.860')] [2023-10-08 01:06:51,494][52059] Updated weights for policy 1, policy_version 30742 (0.0007) [2023-10-08 01:06:51,857][52059] Updated weights for policy 1, policy_version 30752 (0.0011) [2023-10-08 01:06:53,683][52060] Updated weights for policy 0, policy_version 30340 (0.0010) [2023-10-08 01:06:54,056][52060] Updated weights for policy 0, policy_version 30350 (0.0010) [2023-10-08 01:06:54,429][52060] Updated weights for policy 0, policy_version 30360 (0.0008) [2023-10-08 01:06:55,905][52059] Updated weights for policy 1, policy_version 30762 (0.0010) [2023-10-08 01:06:56,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 62586880. Throughput: 0: 1692.8, 1: 1754.1. Samples: 15657142. Policy #0 lag: (min: 5.0, avg: 5.8, max: 25.0) [2023-10-08 01:06:56,211][50642] Avg episode reward: [(0, '20.380'), (1, '19.910')] [2023-10-08 01:06:56,271][52059] Updated weights for policy 1, policy_version 30772 (0.0010) [2023-10-08 01:06:56,633][52059] Updated weights for policy 1, policy_version 30782 (0.0008) [2023-10-08 01:06:58,328][52060] Updated weights for policy 0, policy_version 30370 (0.0008) [2023-10-08 01:06:58,688][52060] Updated weights for policy 0, policy_version 30380 (0.0010) [2023-10-08 01:06:59,056][52060] Updated weights for policy 0, policy_version 30390 (0.0010) [2023-10-08 01:06:59,421][52060] Updated weights for policy 0, policy_version 30400 (0.0010) [2023-10-08 01:07:00,636][52059] Updated weights for policy 1, policy_version 30792 (0.0009) [2023-10-08 01:07:01,005][52059] Updated weights for policy 1, policy_version 30802 (0.0011) [2023-10-08 01:07:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 62652416. Throughput: 0: 1703.3, 1: 1734.2. Samples: 15677952. Policy #0 lag: (min: 5.0, avg: 5.8, max: 25.0) [2023-10-08 01:07:01,211][50642] Avg episode reward: [(0, '20.480'), (1, '19.660')] [2023-10-08 01:07:01,369][52059] Updated weights for policy 1, policy_version 30812 (0.0011) [2023-10-08 01:07:03,576][52060] Updated weights for policy 0, policy_version 30410 (0.0007) [2023-10-08 01:07:03,948][52060] Updated weights for policy 0, policy_version 30420 (0.0007) [2023-10-08 01:07:04,321][52060] Updated weights for policy 0, policy_version 30430 (0.0007) [2023-10-08 01:07:05,257][52059] Updated weights for policy 1, policy_version 30822 (0.0008) [2023-10-08 01:07:05,617][52059] Updated weights for policy 1, policy_version 30832 (0.0010) [2023-10-08 01:07:05,979][52059] Updated weights for policy 1, policy_version 30842 (0.0007) [2023-10-08 01:07:06,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 62750720. Throughput: 0: 1697.7, 1: 1747.4. Samples: 15688514. Policy #0 lag: (min: 5.0, avg: 5.8, max: 25.0) [2023-10-08 01:07:06,211][50642] Avg episode reward: [(0, '20.190'), (1, '18.990')] [2023-10-08 01:07:08,069][52060] Updated weights for policy 0, policy_version 30440 (0.0008) [2023-10-08 01:07:08,441][52060] Updated weights for policy 0, policy_version 30450 (0.0009) [2023-10-08 01:07:08,804][52060] Updated weights for policy 0, policy_version 30460 (0.0008) [2023-10-08 01:07:09,803][52059] Updated weights for policy 1, policy_version 30852 (0.0009) [2023-10-08 01:07:10,170][52059] Updated weights for policy 1, policy_version 30862 (0.0008) [2023-10-08 01:07:10,539][52059] Updated weights for policy 1, policy_version 30872 (0.0009) [2023-10-08 01:07:11,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 62816256. Throughput: 0: 1693.8, 1: 1744.3. Samples: 15709338. Policy #0 lag: (min: 5.0, avg: 5.8, max: 25.0) [2023-10-08 01:07:11,212][50642] Avg episode reward: [(0, '20.000'), (1, '20.440')] [2023-10-08 01:07:12,728][52060] Updated weights for policy 0, policy_version 30470 (0.0007) [2023-10-08 01:07:13,099][52060] Updated weights for policy 0, policy_version 30480 (0.0008) [2023-10-08 01:07:13,465][52060] Updated weights for policy 0, policy_version 30490 (0.0007) [2023-10-08 01:07:14,596][52059] Updated weights for policy 1, policy_version 30882 (0.0009) [2023-10-08 01:07:14,963][52059] Updated weights for policy 1, policy_version 30892 (0.0008) [2023-10-08 01:07:15,324][52059] Updated weights for policy 1, policy_version 30902 (0.0009) [2023-10-08 01:07:15,690][52059] Updated weights for policy 1, policy_version 30912 (0.0008) [2023-10-08 01:07:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 62881792. Throughput: 0: 1725.8, 1: 1716.2. Samples: 15729822. Policy #0 lag: (min: 30.0, avg: 42.4, max: 62.0) [2023-10-08 01:07:16,211][50642] Avg episode reward: [(0, '20.530'), (1, '19.620')] [2023-10-08 01:07:17,269][52060] Updated weights for policy 0, policy_version 30500 (0.0007) [2023-10-08 01:07:17,640][52060] Updated weights for policy 0, policy_version 30510 (0.0007) [2023-10-08 01:07:18,003][52060] Updated weights for policy 0, policy_version 30520 (0.0009) [2023-10-08 01:07:19,593][52059] Updated weights for policy 1, policy_version 30922 (0.0008) [2023-10-08 01:07:19,953][52059] Updated weights for policy 1, policy_version 30932 (0.0007) [2023-10-08 01:07:20,320][52059] Updated weights for policy 1, policy_version 30942 (0.0009) [2023-10-08 01:07:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 62947328. Throughput: 0: 1693.9, 1: 1751.3. Samples: 15740794. Policy #0 lag: (min: 30.0, avg: 42.4, max: 62.0) [2023-10-08 01:07:21,211][50642] Avg episode reward: [(0, '19.060'), (1, '20.050')] [2023-10-08 01:07:21,861][52060] Updated weights for policy 0, policy_version 30530 (0.0009) [2023-10-08 01:07:22,242][52060] Updated weights for policy 0, policy_version 30540 (0.0010) [2023-10-08 01:07:22,613][52060] Updated weights for policy 0, policy_version 30550 (0.0009) [2023-10-08 01:07:22,986][52060] Updated weights for policy 0, policy_version 30560 (0.0008) [2023-10-08 01:07:24,264][52059] Updated weights for policy 1, policy_version 30952 (0.0008) [2023-10-08 01:07:24,643][52059] Updated weights for policy 1, policy_version 30962 (0.0007) [2023-10-08 01:07:25,010][52059] Updated weights for policy 1, policy_version 30972 (0.0008) [2023-10-08 01:07:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 63012864. Throughput: 0: 1716.8, 1: 1727.8. Samples: 15761292. Policy #0 lag: (min: 30.0, avg: 42.4, max: 62.0) [2023-10-08 01:07:26,211][50642] Avg episode reward: [(0, '19.460'), (1, '18.130')] [2023-10-08 01:07:26,870][52060] Updated weights for policy 0, policy_version 30570 (0.0007) [2023-10-08 01:07:27,233][52060] Updated weights for policy 0, policy_version 30580 (0.0008) [2023-10-08 01:07:27,600][52060] Updated weights for policy 0, policy_version 30590 (0.0007) [2023-10-08 01:07:28,862][52059] Updated weights for policy 1, policy_version 30982 (0.0008) [2023-10-08 01:07:29,230][52059] Updated weights for policy 1, policy_version 30992 (0.0007) [2023-10-08 01:07:29,584][52059] Updated weights for policy 1, policy_version 31002 (0.0007) [2023-10-08 01:07:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 63078400. Throughput: 0: 1734.1, 1: 1721.0. Samples: 15782468. Policy #0 lag: (min: 30.0, avg: 42.4, max: 62.0) [2023-10-08 01:07:31,211][50642] Avg episode reward: [(0, '17.950'), (1, '19.680')] [2023-10-08 01:07:31,440][52060] Updated weights for policy 0, policy_version 30600 (0.0008) [2023-10-08 01:07:31,819][52060] Updated weights for policy 0, policy_version 30610 (0.0008) [2023-10-08 01:07:32,193][52060] Updated weights for policy 0, policy_version 30620 (0.0008) [2023-10-08 01:07:33,626][52059] Updated weights for policy 1, policy_version 31012 (0.0007) [2023-10-08 01:07:33,989][52059] Updated weights for policy 1, policy_version 31022 (0.0009) [2023-10-08 01:07:34,353][52059] Updated weights for policy 1, policy_version 31032 (0.0009) [2023-10-08 01:07:36,117][52060] Updated weights for policy 0, policy_version 30630 (0.0009) [2023-10-08 01:07:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 63143936. Throughput: 0: 1713.5, 1: 1742.6. Samples: 15792746. Policy #0 lag: (min: 30.0, avg: 42.4, max: 62.0) [2023-10-08 01:07:36,211][50642] Avg episode reward: [(0, '18.840'), (1, '21.080')] [2023-10-08 01:07:36,487][52060] Updated weights for policy 0, policy_version 30640 (0.0008) [2023-10-08 01:07:36,864][52060] Updated weights for policy 0, policy_version 30650 (0.0008) [2023-10-08 01:07:38,073][52059] Updated weights for policy 1, policy_version 31042 (0.0008) [2023-10-08 01:07:38,439][52059] Updated weights for policy 1, policy_version 31052 (0.0007) [2023-10-08 01:07:38,801][52059] Updated weights for policy 1, policy_version 31062 (0.0007) [2023-10-08 01:07:39,163][52059] Updated weights for policy 1, policy_version 31072 (0.0008) [2023-10-08 01:07:40,925][52060] Updated weights for policy 0, policy_version 30660 (0.0008) [2023-10-08 01:07:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 63209472. Throughput: 0: 1746.8, 1: 1723.7. Samples: 15813310. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:07:41,211][50642] Avg episode reward: [(0, '20.190'), (1, '18.890')] [2023-10-08 01:07:41,289][52060] Updated weights for policy 0, policy_version 30670 (0.0009) [2023-10-08 01:07:41,655][52060] Updated weights for policy 0, policy_version 30680 (0.0011) [2023-10-08 01:07:43,065][52059] Updated weights for policy 1, policy_version 31082 (0.0009) [2023-10-08 01:07:43,428][52059] Updated weights for policy 1, policy_version 31092 (0.0009) [2023-10-08 01:07:43,789][52059] Updated weights for policy 1, policy_version 31102 (0.0008) [2023-10-08 01:07:45,545][52060] Updated weights for policy 0, policy_version 30690 (0.0010) [2023-10-08 01:07:45,925][52060] Updated weights for policy 0, policy_version 30700 (0.0007) [2023-10-08 01:07:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 63275008. Throughput: 0: 1738.8, 1: 1735.1. Samples: 15834278. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:07:46,211][50642] Avg episode reward: [(0, '18.030'), (1, '20.070')] [2023-10-08 01:07:46,294][52060] Updated weights for policy 0, policy_version 30710 (0.0009) [2023-10-08 01:07:46,650][52060] Updated weights for policy 0, policy_version 30720 (0.0009) [2023-10-08 01:07:47,697][52059] Updated weights for policy 1, policy_version 31112 (0.0008) [2023-10-08 01:07:48,069][52059] Updated weights for policy 1, policy_version 31122 (0.0007) [2023-10-08 01:07:48,425][52059] Updated weights for policy 1, policy_version 31132 (0.0009) [2023-10-08 01:07:50,605][52060] Updated weights for policy 0, policy_version 30730 (0.0009) [2023-10-08 01:07:50,972][52060] Updated weights for policy 0, policy_version 30740 (0.0008) [2023-10-08 01:07:51,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 63340544. Throughput: 0: 1736.6, 1: 1720.1. Samples: 15844064. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:07:51,211][50642] Avg episode reward: [(0, '19.860'), (1, '21.400')] [2023-10-08 01:07:51,344][52060] Updated weights for policy 0, policy_version 30750 (0.0008) [2023-10-08 01:07:52,364][52059] Updated weights for policy 1, policy_version 31142 (0.0008) [2023-10-08 01:07:52,728][52059] Updated weights for policy 1, policy_version 31152 (0.0010) [2023-10-08 01:07:53,091][52059] Updated weights for policy 1, policy_version 31162 (0.0009) [2023-10-08 01:07:55,378][52060] Updated weights for policy 0, policy_version 30760 (0.0008) [2023-10-08 01:07:55,752][52060] Updated weights for policy 0, policy_version 30770 (0.0011) [2023-10-08 01:07:56,116][52060] Updated weights for policy 0, policy_version 30780 (0.0007) [2023-10-08 01:07:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 63406080. Throughput: 0: 1746.3, 1: 1724.9. Samples: 15865542. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:07:56,211][50642] Avg episode reward: [(0, '21.260'), (1, '19.200')] [2023-10-08 01:07:56,970][52059] Updated weights for policy 1, policy_version 31172 (0.0008) [2023-10-08 01:07:57,345][52059] Updated weights for policy 1, policy_version 31182 (0.0007) [2023-10-08 01:07:57,719][52059] Updated weights for policy 1, policy_version 31192 (0.0007) [2023-10-08 01:07:59,977][52060] Updated weights for policy 0, policy_version 30790 (0.0008) [2023-10-08 01:08:00,347][52060] Updated weights for policy 0, policy_version 30800 (0.0007) [2023-10-08 01:08:00,712][52060] Updated weights for policy 0, policy_version 30810 (0.0008) [2023-10-08 01:08:01,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 63504384. Throughput: 0: 1712.8, 1: 1756.2. Samples: 15885930. Policy #0 lag: (min: 5.0, avg: 11.0, max: 37.0) [2023-10-08 01:08:01,211][50642] Avg episode reward: [(0, '17.170'), (1, '18.980')] [2023-10-08 01:08:01,663][52059] Updated weights for policy 1, policy_version 31202 (0.0009) [2023-10-08 01:08:02,033][52059] Updated weights for policy 1, policy_version 31212 (0.0008) [2023-10-08 01:08:02,408][52059] Updated weights for policy 1, policy_version 31222 (0.0008) [2023-10-08 01:08:02,765][52059] Updated weights for policy 1, policy_version 31232 (0.0007) [2023-10-08 01:08:04,614][52060] Updated weights for policy 0, policy_version 30820 (0.0008) [2023-10-08 01:08:04,991][52060] Updated weights for policy 0, policy_version 30830 (0.0007) [2023-10-08 01:08:05,358][52060] Updated weights for policy 0, policy_version 30840 (0.0009) [2023-10-08 01:08:06,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 63569920. Throughput: 0: 1738.1, 1: 1722.1. Samples: 15896502. Policy #0 lag: (min: 5.0, avg: 11.0, max: 37.0) [2023-10-08 01:08:06,211][50642] Avg episode reward: [(0, '19.570'), (1, '21.010')] [2023-10-08 01:08:06,609][52059] Updated weights for policy 1, policy_version 31242 (0.0007) [2023-10-08 01:08:06,972][52059] Updated weights for policy 1, policy_version 31252 (0.0009) [2023-10-08 01:08:07,335][52059] Updated weights for policy 1, policy_version 31262 (0.0011) [2023-10-08 01:08:09,348][52060] Updated weights for policy 0, policy_version 30850 (0.0010) [2023-10-08 01:08:09,713][52060] Updated weights for policy 0, policy_version 30860 (0.0008) [2023-10-08 01:08:10,085][52060] Updated weights for policy 0, policy_version 30870 (0.0010) [2023-10-08 01:08:10,450][52060] Updated weights for policy 0, policy_version 30880 (0.0007) [2023-10-08 01:08:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 63635456. Throughput: 0: 1722.3, 1: 1746.6. Samples: 15917390. Policy #0 lag: (min: 5.0, avg: 11.0, max: 37.0) [2023-10-08 01:08:11,211][50642] Avg episode reward: [(0, '19.980'), (1, '18.760')] [2023-10-08 01:08:11,414][52059] Updated weights for policy 1, policy_version 31272 (0.0008) [2023-10-08 01:08:11,784][52059] Updated weights for policy 1, policy_version 31282 (0.0009) [2023-10-08 01:08:12,148][52059] Updated weights for policy 1, policy_version 31292 (0.0007) [2023-10-08 01:08:14,404][52060] Updated weights for policy 0, policy_version 30890 (0.0008) [2023-10-08 01:08:14,766][52060] Updated weights for policy 0, policy_version 30900 (0.0007) [2023-10-08 01:08:15,135][52060] Updated weights for policy 0, policy_version 30910 (0.0008) [2023-10-08 01:08:15,755][52059] Updated weights for policy 1, policy_version 31302 (0.0009) [2023-10-08 01:08:16,123][52059] Updated weights for policy 1, policy_version 31312 (0.0007) [2023-10-08 01:08:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 63700992. Throughput: 0: 1697.9, 1: 1752.0. Samples: 15937716. Policy #0 lag: (min: 5.0, avg: 11.0, max: 37.0) [2023-10-08 01:08:16,211][50642] Avg episode reward: [(0, '16.380'), (1, '18.730')] [2023-10-08 01:08:16,485][52059] Updated weights for policy 1, policy_version 31322 (0.0009) [2023-10-08 01:08:19,140][52060] Updated weights for policy 0, policy_version 30920 (0.0009) [2023-10-08 01:08:19,515][52060] Updated weights for policy 0, policy_version 30930 (0.0008) [2023-10-08 01:08:19,899][52060] Updated weights for policy 0, policy_version 30940 (0.0008) [2023-10-08 01:08:20,494][52059] Updated weights for policy 1, policy_version 31332 (0.0008) [2023-10-08 01:08:20,856][52059] Updated weights for policy 1, policy_version 31342 (0.0008) [2023-10-08 01:08:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 63766528. Throughput: 0: 1733.6, 1: 1738.0. Samples: 15948970. Policy #0 lag: (min: 5.0, avg: 11.0, max: 37.0) [2023-10-08 01:08:21,211][50642] Avg episode reward: [(0, '19.830'), (1, '19.200')] [2023-10-08 01:08:21,227][52059] Updated weights for policy 1, policy_version 31352 (0.0008) [2023-10-08 01:08:23,766][52060] Updated weights for policy 0, policy_version 30950 (0.0007) [2023-10-08 01:08:24,145][52060] Updated weights for policy 0, policy_version 30960 (0.0008) [2023-10-08 01:08:24,516][52060] Updated weights for policy 0, policy_version 30970 (0.0008) [2023-10-08 01:08:25,177][52059] Updated weights for policy 1, policy_version 31362 (0.0007) [2023-10-08 01:08:25,536][52059] Updated weights for policy 1, policy_version 31372 (0.0009) [2023-10-08 01:08:25,907][52059] Updated weights for policy 1, policy_version 31382 (0.0008) [2023-10-08 01:08:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 63832064. Throughput: 0: 1700.8, 1: 1757.2. Samples: 15968922. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:08:26,211][50642] Avg episode reward: [(0, '19.310'), (1, '19.540')] [2023-10-08 01:08:26,256][52059] Updated weights for policy 1, policy_version 31392 (0.0009) [2023-10-08 01:08:28,560][52060] Updated weights for policy 0, policy_version 30980 (0.0008) [2023-10-08 01:08:28,923][52060] Updated weights for policy 0, policy_version 30990 (0.0008) [2023-10-08 01:08:29,293][52060] Updated weights for policy 0, policy_version 31000 (0.0008) [2023-10-08 01:08:29,978][52059] Updated weights for policy 1, policy_version 31402 (0.0008) [2023-10-08 01:08:30,342][52059] Updated weights for policy 1, policy_version 31412 (0.0007) [2023-10-08 01:08:30,708][52059] Updated weights for policy 1, policy_version 31422 (0.0007) [2023-10-08 01:08:31,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 63930368. Throughput: 0: 1707.1, 1: 1732.4. Samples: 15989056. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:08:31,211][50642] Avg episode reward: [(0, '16.670'), (1, '17.740')] [2023-10-08 01:08:33,260][52060] Updated weights for policy 0, policy_version 31010 (0.0008) [2023-10-08 01:08:33,634][52060] Updated weights for policy 0, policy_version 31020 (0.0010) [2023-10-08 01:08:34,005][52060] Updated weights for policy 0, policy_version 31030 (0.0010) [2023-10-08 01:08:34,384][52060] Updated weights for policy 0, policy_version 31040 (0.0010) [2023-10-08 01:08:34,763][52059] Updated weights for policy 1, policy_version 31432 (0.0007) [2023-10-08 01:08:35,129][52059] Updated weights for policy 1, policy_version 31442 (0.0010) [2023-10-08 01:08:35,505][52059] Updated weights for policy 1, policy_version 31452 (0.0010) [2023-10-08 01:08:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 63995904. Throughput: 0: 1712.0, 1: 1759.2. Samples: 16000268. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:08:36,211][50642] Avg episode reward: [(0, '20.740'), (1, '19.680')] [2023-10-08 01:08:38,541][52060] Updated weights for policy 0, policy_version 31050 (0.0008) [2023-10-08 01:08:38,911][52060] Updated weights for policy 0, policy_version 31060 (0.0010) [2023-10-08 01:08:39,283][52060] Updated weights for policy 0, policy_version 31070 (0.0009) [2023-10-08 01:08:39,411][52059] Updated weights for policy 1, policy_version 31462 (0.0009) [2023-10-08 01:08:39,776][52059] Updated weights for policy 1, policy_version 31472 (0.0008) [2023-10-08 01:08:40,152][52059] Updated weights for policy 1, policy_version 31482 (0.0008) [2023-10-08 01:08:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 64061440. Throughput: 0: 1695.1, 1: 1740.1. Samples: 16020128. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:08:41,211][50642] Avg episode reward: [(0, '18.750'), (1, '19.540')] [2023-10-08 01:08:43,027][52060] Updated weights for policy 0, policy_version 31080 (0.0007) [2023-10-08 01:08:43,398][52060] Updated weights for policy 0, policy_version 31090 (0.0010) [2023-10-08 01:08:43,768][52060] Updated weights for policy 0, policy_version 31100 (0.0007) [2023-10-08 01:08:44,068][52059] Updated weights for policy 1, policy_version 31492 (0.0009) [2023-10-08 01:08:44,437][52059] Updated weights for policy 1, policy_version 31502 (0.0008) [2023-10-08 01:08:44,810][52059] Updated weights for policy 1, policy_version 31512 (0.0008) [2023-10-08 01:08:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 64126976. Throughput: 0: 1720.5, 1: 1724.7. Samples: 16040966. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:08:46,211][50642] Avg episode reward: [(0, '18.180'), (1, '16.920')] [2023-10-08 01:08:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000031520_32276480.pth... [2023-10-08 01:08:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000031104_31850496.pth... [2023-10-08 01:08:46,254][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000029504_30212096.pth [2023-10-08 01:08:46,257][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000029888_30605312.pth [2023-10-08 01:08:47,829][52060] Updated weights for policy 0, policy_version 31110 (0.0007) [2023-10-08 01:08:48,204][52060] Updated weights for policy 0, policy_version 31120 (0.0008) [2023-10-08 01:08:48,563][52060] Updated weights for policy 0, policy_version 31130 (0.0009) [2023-10-08 01:08:48,642][52059] Updated weights for policy 1, policy_version 31522 (0.0007) [2023-10-08 01:08:49,008][52059] Updated weights for policy 1, policy_version 31532 (0.0008) [2023-10-08 01:08:49,385][52059] Updated weights for policy 1, policy_version 31542 (0.0009) [2023-10-08 01:08:49,749][52059] Updated weights for policy 1, policy_version 31552 (0.0010) [2023-10-08 01:08:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 64192512. Throughput: 0: 1698.0, 1: 1748.9. Samples: 16051614. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:08:51,211][50642] Avg episode reward: [(0, '20.750'), (1, '18.190')] [2023-10-08 01:08:52,504][52060] Updated weights for policy 0, policy_version 31140 (0.0007) [2023-10-08 01:08:52,861][52060] Updated weights for policy 0, policy_version 31150 (0.0008) [2023-10-08 01:08:53,226][52060] Updated weights for policy 0, policy_version 31160 (0.0010) [2023-10-08 01:08:53,678][52059] Updated weights for policy 1, policy_version 31562 (0.0007) [2023-10-08 01:08:54,039][52059] Updated weights for policy 1, policy_version 31572 (0.0008) [2023-10-08 01:08:54,399][52059] Updated weights for policy 1, policy_version 31582 (0.0007) [2023-10-08 01:08:56,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 64258048. Throughput: 0: 1714.0, 1: 1721.9. Samples: 16072004. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:08:56,211][50642] Avg episode reward: [(0, '17.120'), (1, '20.500')] [2023-10-08 01:08:57,057][52060] Updated weights for policy 0, policy_version 31170 (0.0009) [2023-10-08 01:08:57,421][52060] Updated weights for policy 0, policy_version 31180 (0.0007) [2023-10-08 01:08:57,795][52060] Updated weights for policy 0, policy_version 31190 (0.0008) [2023-10-08 01:08:58,157][52060] Updated weights for policy 0, policy_version 31200 (0.0010) [2023-10-08 01:08:58,271][52059] Updated weights for policy 1, policy_version 31592 (0.0009) [2023-10-08 01:08:58,627][52059] Updated weights for policy 1, policy_version 31602 (0.0010) [2023-10-08 01:08:58,999][52059] Updated weights for policy 1, policy_version 31612 (0.0008) [2023-10-08 01:09:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 64323584. Throughput: 0: 1733.7, 1: 1724.9. Samples: 16093354. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:09:01,211][50642] Avg episode reward: [(0, '19.420'), (1, '16.870')] [2023-10-08 01:09:02,141][52060] Updated weights for policy 0, policy_version 31210 (0.0010) [2023-10-08 01:09:02,510][52060] Updated weights for policy 0, policy_version 31220 (0.0010) [2023-10-08 01:09:02,878][52060] Updated weights for policy 0, policy_version 31230 (0.0010) [2023-10-08 01:09:03,050][52059] Updated weights for policy 1, policy_version 31622 (0.0007) [2023-10-08 01:09:03,412][52059] Updated weights for policy 1, policy_version 31632 (0.0007) [2023-10-08 01:09:03,785][52059] Updated weights for policy 1, policy_version 31642 (0.0009) [2023-10-08 01:09:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 64389120. Throughput: 0: 1698.8, 1: 1725.7. Samples: 16103076. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:09:06,211][50642] Avg episode reward: [(0, '21.490'), (1, '19.030')] [2023-10-08 01:09:06,212][51605] Saving new best policy, reward=21.490! [2023-10-08 01:09:06,793][52060] Updated weights for policy 0, policy_version 31240 (0.0007) [2023-10-08 01:09:07,154][52060] Updated weights for policy 0, policy_version 31250 (0.0010) [2023-10-08 01:09:07,525][52060] Updated weights for policy 0, policy_version 31260 (0.0010) [2023-10-08 01:09:07,703][52059] Updated weights for policy 1, policy_version 31652 (0.0009) [2023-10-08 01:09:08,061][52059] Updated weights for policy 1, policy_version 31662 (0.0010) [2023-10-08 01:09:08,426][52059] Updated weights for policy 1, policy_version 31672 (0.0012) [2023-10-08 01:09:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 64454656. Throughput: 0: 1732.6, 1: 1719.6. Samples: 16124270. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:09:11,211][50642] Avg episode reward: [(0, '17.830'), (1, '19.030')] [2023-10-08 01:09:11,452][52060] Updated weights for policy 0, policy_version 31270 (0.0008) [2023-10-08 01:09:11,815][52060] Updated weights for policy 0, policy_version 31280 (0.0010) [2023-10-08 01:09:12,193][52060] Updated weights for policy 0, policy_version 31290 (0.0009) [2023-10-08 01:09:12,312][52059] Updated weights for policy 1, policy_version 31682 (0.0009) [2023-10-08 01:09:12,679][52059] Updated weights for policy 1, policy_version 31692 (0.0008) [2023-10-08 01:09:13,048][52059] Updated weights for policy 1, policy_version 31702 (0.0008) [2023-10-08 01:09:13,408][52059] Updated weights for policy 1, policy_version 31712 (0.0010) [2023-10-08 01:09:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 64520192. Throughput: 0: 1735.6, 1: 1746.5. Samples: 16145752. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:16,211][50642] Avg episode reward: [(0, '19.850'), (1, '19.670')] [2023-10-08 01:09:16,352][52060] Updated weights for policy 0, policy_version 31300 (0.0007) [2023-10-08 01:09:16,728][52060] Updated weights for policy 0, policy_version 31310 (0.0007) [2023-10-08 01:09:17,095][52060] Updated weights for policy 0, policy_version 31320 (0.0007) [2023-10-08 01:09:17,304][52059] Updated weights for policy 1, policy_version 31722 (0.0008) [2023-10-08 01:09:17,676][52059] Updated weights for policy 1, policy_version 31732 (0.0008) [2023-10-08 01:09:18,047][52059] Updated weights for policy 1, policy_version 31742 (0.0010) [2023-10-08 01:09:20,936][52060] Updated weights for policy 0, policy_version 31330 (0.0008) [2023-10-08 01:09:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 64585728. Throughput: 0: 1724.3, 1: 1719.8. Samples: 16155252. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:21,211][50642] Avg episode reward: [(0, '20.700'), (1, '16.620')] [2023-10-08 01:09:21,295][52060] Updated weights for policy 0, policy_version 31340 (0.0007) [2023-10-08 01:09:21,673][52060] Updated weights for policy 0, policy_version 31350 (0.0007) [2023-10-08 01:09:21,952][52059] Updated weights for policy 1, policy_version 31752 (0.0008) [2023-10-08 01:09:22,037][52060] Updated weights for policy 0, policy_version 31360 (0.0007) [2023-10-08 01:09:22,314][52059] Updated weights for policy 1, policy_version 31762 (0.0007) [2023-10-08 01:09:22,675][52059] Updated weights for policy 1, policy_version 31772 (0.0007) [2023-10-08 01:09:25,966][52060] Updated weights for policy 0, policy_version 31370 (0.0008) [2023-10-08 01:09:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 64651264. Throughput: 0: 1741.8, 1: 1736.9. Samples: 16176670. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:26,211][50642] Avg episode reward: [(0, '17.290'), (1, '19.750')] [2023-10-08 01:09:26,347][52060] Updated weights for policy 0, policy_version 31380 (0.0011) [2023-10-08 01:09:26,653][52059] Updated weights for policy 1, policy_version 31782 (0.0007) [2023-10-08 01:09:26,704][52060] Updated weights for policy 0, policy_version 31390 (0.0011) [2023-10-08 01:09:27,020][52059] Updated weights for policy 1, policy_version 31792 (0.0007) [2023-10-08 01:09:27,383][52059] Updated weights for policy 1, policy_version 31802 (0.0007) [2023-10-08 01:09:30,706][52060] Updated weights for policy 0, policy_version 31400 (0.0008) [2023-10-08 01:09:31,067][52060] Updated weights for policy 0, policy_version 31410 (0.0011) [2023-10-08 01:09:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 64716800. Throughput: 0: 1730.9, 1: 1749.2. Samples: 16197568. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:31,211][50642] Avg episode reward: [(0, '19.590'), (1, '19.220')] [2023-10-08 01:09:31,374][52059] Updated weights for policy 1, policy_version 31812 (0.0007) [2023-10-08 01:09:31,438][52060] Updated weights for policy 0, policy_version 31420 (0.0009) [2023-10-08 01:09:31,737][52059] Updated weights for policy 1, policy_version 31822 (0.0008) [2023-10-08 01:09:32,094][52059] Updated weights for policy 1, policy_version 31832 (0.0009) [2023-10-08 01:09:35,309][52060] Updated weights for policy 0, policy_version 31430 (0.0008) [2023-10-08 01:09:35,677][52060] Updated weights for policy 0, policy_version 31440 (0.0010) [2023-10-08 01:09:35,905][52059] Updated weights for policy 1, policy_version 31842 (0.0007) [2023-10-08 01:09:36,046][52060] Updated weights for policy 0, policy_version 31450 (0.0008) [2023-10-08 01:09:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 64782336. Throughput: 0: 1739.4, 1: 1729.3. Samples: 16207706. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:36,211][50642] Avg episode reward: [(0, '19.670'), (1, '19.250')] [2023-10-08 01:09:36,271][52059] Updated weights for policy 1, policy_version 31852 (0.0008) [2023-10-08 01:09:36,635][52059] Updated weights for policy 1, policy_version 31862 (0.0010) [2023-10-08 01:09:37,003][52059] Updated weights for policy 1, policy_version 31872 (0.0010) [2023-10-08 01:09:40,021][52060] Updated weights for policy 0, policy_version 31460 (0.0007) [2023-10-08 01:09:40,400][52060] Updated weights for policy 0, policy_version 31470 (0.0008) [2023-10-08 01:09:40,769][52060] Updated weights for policy 0, policy_version 31480 (0.0008) [2023-10-08 01:09:40,824][52059] Updated weights for policy 1, policy_version 31882 (0.0007) [2023-10-08 01:09:41,189][52059] Updated weights for policy 1, policy_version 31892 (0.0008) [2023-10-08 01:09:41,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 64880640. Throughput: 0: 1729.0, 1: 1760.2. Samples: 16229018. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:41,211][50642] Avg episode reward: [(0, '17.080'), (1, '18.550')] [2023-10-08 01:09:41,550][52059] Updated weights for policy 1, policy_version 31902 (0.0010) [2023-10-08 01:09:44,814][52060] Updated weights for policy 0, policy_version 31490 (0.0009) [2023-10-08 01:09:45,189][52060] Updated weights for policy 0, policy_version 31500 (0.0009) [2023-10-08 01:09:45,547][52059] Updated weights for policy 1, policy_version 31912 (0.0008) [2023-10-08 01:09:45,554][52060] Updated weights for policy 0, policy_version 31510 (0.0009) [2023-10-08 01:09:45,921][52060] Updated weights for policy 0, policy_version 31520 (0.0009) [2023-10-08 01:09:45,923][52059] Updated weights for policy 1, policy_version 31922 (0.0009) [2023-10-08 01:09:46,210][50642] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 64946176. Throughput: 0: 1700.3, 1: 1744.4. Samples: 16248364. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:46,211][50642] Avg episode reward: [(0, '20.380'), (1, '20.650')] [2023-10-08 01:09:46,282][52059] Updated weights for policy 1, policy_version 31932 (0.0008) [2023-10-08 01:09:49,891][52060] Updated weights for policy 0, policy_version 31530 (0.0009) [2023-10-08 01:09:50,106][52059] Updated weights for policy 1, policy_version 31942 (0.0008) [2023-10-08 01:09:50,251][52060] Updated weights for policy 0, policy_version 31540 (0.0008) [2023-10-08 01:09:50,467][52059] Updated weights for policy 1, policy_version 31952 (0.0008) [2023-10-08 01:09:50,619][52060] Updated weights for policy 0, policy_version 31550 (0.0008) [2023-10-08 01:09:50,830][52059] Updated weights for policy 1, policy_version 31962 (0.0010) [2023-10-08 01:09:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 65044480. Throughput: 0: 1725.7, 1: 1749.0. Samples: 16259438. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:51,211][50642] Avg episode reward: [(0, '20.230'), (1, '18.050')] [2023-10-08 01:09:54,581][52060] Updated weights for policy 0, policy_version 31560 (0.0008) [2023-10-08 01:09:54,698][52059] Updated weights for policy 1, policy_version 31972 (0.0010) [2023-10-08 01:09:54,950][52060] Updated weights for policy 0, policy_version 31570 (0.0008) [2023-10-08 01:09:55,066][52059] Updated weights for policy 1, policy_version 31982 (0.0009) [2023-10-08 01:09:55,318][52060] Updated weights for policy 0, policy_version 31580 (0.0008) [2023-10-08 01:09:55,428][52059] Updated weights for policy 1, policy_version 31992 (0.0009) [2023-10-08 01:09:56,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 65110016. Throughput: 0: 1710.0, 1: 1749.5. Samples: 16279948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:09:56,211][50642] Avg episode reward: [(0, '17.670'), (1, '17.250')] [2023-10-08 01:09:59,407][52060] Updated weights for policy 0, policy_version 31590 (0.0008) [2023-10-08 01:09:59,489][52059] Updated weights for policy 1, policy_version 32002 (0.0010) [2023-10-08 01:09:59,780][52060] Updated weights for policy 0, policy_version 31600 (0.0008) [2023-10-08 01:09:59,850][52059] Updated weights for policy 1, policy_version 32012 (0.0009) [2023-10-08 01:10:00,155][52060] Updated weights for policy 0, policy_version 31610 (0.0008) [2023-10-08 01:10:00,215][52059] Updated weights for policy 1, policy_version 32022 (0.0007) [2023-10-08 01:10:00,575][52059] Updated weights for policy 1, policy_version 32032 (0.0008) [2023-10-08 01:10:01,210][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 65175552. Throughput: 0: 1685.7, 1: 1723.8. Samples: 16299180. Policy #0 lag: (min: 10.0, avg: 15.4, max: 42.0) [2023-10-08 01:10:01,212][50642] Avg episode reward: [(0, '20.150'), (1, '21.010')] [2023-10-08 01:10:04,178][52060] Updated weights for policy 0, policy_version 31620 (0.0008) [2023-10-08 01:10:04,548][52060] Updated weights for policy 0, policy_version 31630 (0.0010) [2023-10-08 01:10:04,584][52059] Updated weights for policy 1, policy_version 32042 (0.0008) [2023-10-08 01:10:04,913][52060] Updated weights for policy 0, policy_version 31640 (0.0008) [2023-10-08 01:10:04,950][52059] Updated weights for policy 1, policy_version 32052 (0.0009) [2023-10-08 01:10:05,315][52059] Updated weights for policy 1, policy_version 32062 (0.0009) [2023-10-08 01:10:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 65241088. Throughput: 0: 1711.2, 1: 1751.1. Samples: 16311052. Policy #0 lag: (min: 10.0, avg: 15.4, max: 42.0) [2023-10-08 01:10:06,211][50642] Avg episode reward: [(0, '19.830'), (1, '16.980')] [2023-10-08 01:10:08,846][52060] Updated weights for policy 0, policy_version 31650 (0.0010) [2023-10-08 01:10:09,208][52060] Updated weights for policy 0, policy_version 31660 (0.0007) [2023-10-08 01:10:09,217][52059] Updated weights for policy 1, policy_version 32072 (0.0007) [2023-10-08 01:10:09,571][52060] Updated weights for policy 0, policy_version 31670 (0.0007) [2023-10-08 01:10:09,581][52059] Updated weights for policy 1, policy_version 32082 (0.0007) [2023-10-08 01:10:09,947][52059] Updated weights for policy 1, policy_version 32092 (0.0007) [2023-10-08 01:10:09,949][52060] Updated weights for policy 0, policy_version 31680 (0.0008) [2023-10-08 01:10:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 65306624. Throughput: 0: 1687.5, 1: 1727.7. Samples: 16330354. Policy #0 lag: (min: 10.0, avg: 15.4, max: 42.0) [2023-10-08 01:10:11,211][50642] Avg episode reward: [(0, '16.710'), (1, '17.300')] [2023-10-08 01:10:13,806][52059] Updated weights for policy 1, policy_version 32102 (0.0009) [2023-10-08 01:10:14,094][52060] Updated weights for policy 0, policy_version 31690 (0.0010) [2023-10-08 01:10:14,167][52059] Updated weights for policy 1, policy_version 32112 (0.0009) [2023-10-08 01:10:14,462][52060] Updated weights for policy 0, policy_version 31700 (0.0009) [2023-10-08 01:10:14,535][52059] Updated weights for policy 1, policy_version 32122 (0.0009) [2023-10-08 01:10:14,838][52060] Updated weights for policy 0, policy_version 31710 (0.0008) [2023-10-08 01:10:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 65372160. Throughput: 0: 1692.9, 1: 1722.5. Samples: 16351262. Policy #0 lag: (min: 10.0, avg: 15.4, max: 42.0) [2023-10-08 01:10:16,211][50642] Avg episode reward: [(0, '21.010'), (1, '19.490')] [2023-10-08 01:10:18,469][52059] Updated weights for policy 1, policy_version 32132 (0.0007) [2023-10-08 01:10:18,710][52060] Updated weights for policy 0, policy_version 31720 (0.0007) [2023-10-08 01:10:18,830][52059] Updated weights for policy 1, policy_version 32142 (0.0008) [2023-10-08 01:10:19,082][52060] Updated weights for policy 0, policy_version 31730 (0.0009) [2023-10-08 01:10:19,194][52059] Updated weights for policy 1, policy_version 32152 (0.0009) [2023-10-08 01:10:19,441][52060] Updated weights for policy 0, policy_version 31740 (0.0008) [2023-10-08 01:10:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 65437696. Throughput: 0: 1700.6, 1: 1737.6. Samples: 16362424. Policy #0 lag: (min: 10.0, avg: 15.4, max: 42.0) [2023-10-08 01:10:21,211][50642] Avg episode reward: [(0, '17.820'), (1, '16.140')] [2023-10-08 01:10:23,252][52059] Updated weights for policy 1, policy_version 32162 (0.0008) [2023-10-08 01:10:23,549][52060] Updated weights for policy 0, policy_version 31750 (0.0008) [2023-10-08 01:10:23,612][52059] Updated weights for policy 1, policy_version 32172 (0.0008) [2023-10-08 01:10:23,922][52060] Updated weights for policy 0, policy_version 31760 (0.0008) [2023-10-08 01:10:23,977][52059] Updated weights for policy 1, policy_version 32182 (0.0010) [2023-10-08 01:10:24,278][52060] Updated weights for policy 0, policy_version 31770 (0.0010) [2023-10-08 01:10:24,333][52059] Updated weights for policy 1, policy_version 32192 (0.0008) [2023-10-08 01:10:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 65503232. Throughput: 0: 1684.9, 1: 1715.2. Samples: 16382026. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 01:10:26,211][50642] Avg episode reward: [(0, '16.780'), (1, '13.950')] [2023-10-08 01:10:28,158][52060] Updated weights for policy 0, policy_version 31780 (0.0009) [2023-10-08 01:10:28,195][52059] Updated weights for policy 1, policy_version 32202 (0.0009) [2023-10-08 01:10:28,520][52060] Updated weights for policy 0, policy_version 31790 (0.0010) [2023-10-08 01:10:28,559][52059] Updated weights for policy 1, policy_version 32212 (0.0008) [2023-10-08 01:10:28,886][52060] Updated weights for policy 0, policy_version 31800 (0.0009) [2023-10-08 01:10:28,926][52059] Updated weights for policy 1, policy_version 32222 (0.0007) [2023-10-08 01:10:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 65568768. Throughput: 0: 1712.0, 1: 1731.4. Samples: 16403316. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 01:10:31,211][50642] Avg episode reward: [(0, '21.510'), (1, '19.370')] [2023-10-08 01:10:31,224][51605] Saving new best policy, reward=21.510! [2023-10-08 01:10:32,911][52060] Updated weights for policy 0, policy_version 31810 (0.0008) [2023-10-08 01:10:32,959][52059] Updated weights for policy 1, policy_version 32232 (0.0008) [2023-10-08 01:10:33,277][52060] Updated weights for policy 0, policy_version 31820 (0.0007) [2023-10-08 01:10:33,320][52059] Updated weights for policy 1, policy_version 32242 (0.0010) [2023-10-08 01:10:33,647][52060] Updated weights for policy 0, policy_version 31830 (0.0007) [2023-10-08 01:10:33,687][52059] Updated weights for policy 1, policy_version 32252 (0.0008) [2023-10-08 01:10:34,001][52060] Updated weights for policy 0, policy_version 31840 (0.0010) [2023-10-08 01:10:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 65634304. Throughput: 0: 1690.8, 1: 1716.4. Samples: 16412766. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 01:10:36,211][50642] Avg episode reward: [(0, '18.180'), (1, '17.960')] [2023-10-08 01:10:37,598][52059] Updated weights for policy 1, policy_version 32262 (0.0008) [2023-10-08 01:10:37,841][52060] Updated weights for policy 0, policy_version 31850 (0.0007) [2023-10-08 01:10:37,966][52059] Updated weights for policy 1, policy_version 32272 (0.0008) [2023-10-08 01:10:38,205][52060] Updated weights for policy 0, policy_version 31860 (0.0008) [2023-10-08 01:10:38,322][52059] Updated weights for policy 1, policy_version 32282 (0.0007) [2023-10-08 01:10:38,570][52060] Updated weights for policy 0, policy_version 31870 (0.0009) [2023-10-08 01:10:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 65699840. Throughput: 0: 1700.7, 1: 1718.9. Samples: 16433832. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 01:10:41,211][50642] Avg episode reward: [(0, '17.530'), (1, '17.110')] [2023-10-08 01:10:42,325][52059] Updated weights for policy 1, policy_version 32292 (0.0009) [2023-10-08 01:10:42,692][52059] Updated weights for policy 1, policy_version 32302 (0.0007) [2023-10-08 01:10:42,744][52060] Updated weights for policy 0, policy_version 31880 (0.0010) [2023-10-08 01:10:43,050][52059] Updated weights for policy 1, policy_version 32312 (0.0007) [2023-10-08 01:10:43,115][52060] Updated weights for policy 0, policy_version 31890 (0.0008) [2023-10-08 01:10:43,480][52060] Updated weights for policy 0, policy_version 31900 (0.0007) [2023-10-08 01:10:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 65765376. Throughput: 0: 1716.5, 1: 1741.4. Samples: 16454784. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 01:10:46,211][50642] Avg episode reward: [(0, '21.260'), (1, '18.850')] [2023-10-08 01:10:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000032320_33095680.pth... [2023-10-08 01:10:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000031904_32669696.pth... [2023-10-08 01:10:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000030720_31457280.pth [2023-10-08 01:10:46,252][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000032320_33095680.pth [2023-10-08 01:10:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000030304_31031296.pth [2023-10-08 01:10:46,256][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000031904_32669696.pth [2023-10-08 01:10:47,041][52059] Updated weights for policy 1, policy_version 32322 (0.0008) [2023-10-08 01:10:47,406][52059] Updated weights for policy 1, policy_version 32332 (0.0008) [2023-10-08 01:10:47,422][52060] Updated weights for policy 0, policy_version 31910 (0.0009) [2023-10-08 01:10:47,769][52059] Updated weights for policy 1, policy_version 32342 (0.0008) [2023-10-08 01:10:47,792][52060] Updated weights for policy 0, policy_version 31920 (0.0009) [2023-10-08 01:10:48,133][52059] Updated weights for policy 1, policy_version 32352 (0.0007) [2023-10-08 01:10:48,163][52060] Updated weights for policy 0, policy_version 31930 (0.0009) [2023-10-08 01:10:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 65830912. Throughput: 0: 1689.6, 1: 1710.5. Samples: 16464058. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:10:51,211][50642] Avg episode reward: [(0, '17.760'), (1, '20.950')] [2023-10-08 01:10:52,179][52059] Updated weights for policy 1, policy_version 32362 (0.0009) [2023-10-08 01:10:52,198][52060] Updated weights for policy 0, policy_version 31940 (0.0009) [2023-10-08 01:10:52,554][52059] Updated weights for policy 1, policy_version 32372 (0.0007) [2023-10-08 01:10:52,568][52060] Updated weights for policy 0, policy_version 31950 (0.0010) [2023-10-08 01:10:52,925][52060] Updated weights for policy 0, policy_version 31960 (0.0008) [2023-10-08 01:10:52,929][52059] Updated weights for policy 1, policy_version 32382 (0.0009) [2023-10-08 01:10:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 65896448. Throughput: 0: 1712.3, 1: 1724.2. Samples: 16484994. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:10:56,211][50642] Avg episode reward: [(0, '18.400'), (1, '17.750')] [2023-10-08 01:10:56,887][52059] Updated weights for policy 1, policy_version 32392 (0.0007) [2023-10-08 01:10:57,102][52060] Updated weights for policy 0, policy_version 31970 (0.0007) [2023-10-08 01:10:57,249][52059] Updated weights for policy 1, policy_version 32402 (0.0007) [2023-10-08 01:10:57,474][52060] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-10-08 01:10:57,615][52059] Updated weights for policy 1, policy_version 32412 (0.0007) [2023-10-08 01:10:57,829][52060] Updated weights for policy 0, policy_version 31990 (0.0008) [2023-10-08 01:10:58,196][52060] Updated weights for policy 0, policy_version 32000 (0.0008) [2023-10-08 01:11:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13773.7). Total num frames: 65961984. Throughput: 0: 1709.4, 1: 1726.7. Samples: 16505888. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:11:01,211][50642] Avg episode reward: [(0, '21.140'), (1, '19.510')] [2023-10-08 01:11:01,562][52059] Updated weights for policy 1, policy_version 32422 (0.0008) [2023-10-08 01:11:01,928][52059] Updated weights for policy 1, policy_version 32432 (0.0007) [2023-10-08 01:11:02,296][52059] Updated weights for policy 1, policy_version 32442 (0.0007) [2023-10-08 01:11:02,532][52060] Updated weights for policy 0, policy_version 32010 (0.0007) [2023-10-08 01:11:02,901][52060] Updated weights for policy 0, policy_version 32020 (0.0008) [2023-10-08 01:11:03,263][52060] Updated weights for policy 0, policy_version 32030 (0.0009) [2023-10-08 01:11:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 66027520. Throughput: 0: 1684.8, 1: 1708.4. Samples: 16515118. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:11:06,211][50642] Avg episode reward: [(0, '18.120'), (1, '19.640')] [2023-10-08 01:11:06,250][52059] Updated weights for policy 1, policy_version 32452 (0.0007) [2023-10-08 01:11:06,617][52059] Updated weights for policy 1, policy_version 32462 (0.0007) [2023-10-08 01:11:06,990][52059] Updated weights for policy 1, policy_version 32472 (0.0009) [2023-10-08 01:11:07,104][52060] Updated weights for policy 0, policy_version 32040 (0.0008) [2023-10-08 01:11:07,474][52060] Updated weights for policy 0, policy_version 32050 (0.0008) [2023-10-08 01:11:07,839][52060] Updated weights for policy 0, policy_version 32060 (0.0008) [2023-10-08 01:11:10,878][52059] Updated weights for policy 1, policy_version 32482 (0.0008) [2023-10-08 01:11:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13773.7). Total num frames: 66093056. Throughput: 0: 1708.3, 1: 1727.3. Samples: 16536624. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:11:11,211][50642] Avg episode reward: [(0, '19.930'), (1, '17.660')] [2023-10-08 01:11:11,248][52059] Updated weights for policy 1, policy_version 32492 (0.0007) [2023-10-08 01:11:11,610][52059] Updated weights for policy 1, policy_version 32502 (0.0009) [2023-10-08 01:11:11,802][52060] Updated weights for policy 0, policy_version 32070 (0.0008) [2023-10-08 01:11:11,972][52059] Updated weights for policy 1, policy_version 32512 (0.0007) [2023-10-08 01:11:12,170][52060] Updated weights for policy 0, policy_version 32080 (0.0010) [2023-10-08 01:11:12,545][52060] Updated weights for policy 0, policy_version 32090 (0.0010) [2023-10-08 01:11:15,874][52059] Updated weights for policy 1, policy_version 32522 (0.0009) [2023-10-08 01:11:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 66158592. Throughput: 0: 1707.7, 1: 1720.6. Samples: 16557586. Policy #0 lag: (min: 9.0, avg: 16.0, max: 41.0) [2023-10-08 01:11:16,211][50642] Avg episode reward: [(0, '18.780'), (1, '17.790')] [2023-10-08 01:11:16,236][52059] Updated weights for policy 1, policy_version 32532 (0.0009) [2023-10-08 01:11:16,589][52060] Updated weights for policy 0, policy_version 32100 (0.0008) [2023-10-08 01:11:16,594][52059] Updated weights for policy 1, policy_version 32542 (0.0007) [2023-10-08 01:11:16,963][52060] Updated weights for policy 0, policy_version 32110 (0.0008) [2023-10-08 01:11:17,319][52060] Updated weights for policy 0, policy_version 32120 (0.0007) [2023-10-08 01:11:20,745][52059] Updated weights for policy 1, policy_version 32552 (0.0008) [2023-10-08 01:11:21,117][52059] Updated weights for policy 1, policy_version 32562 (0.0010) [2023-10-08 01:11:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 66224128. Throughput: 0: 1702.7, 1: 1729.9. Samples: 16567234. Policy #0 lag: (min: 9.0, avg: 16.0, max: 41.0) [2023-10-08 01:11:21,211][50642] Avg episode reward: [(0, '17.480'), (1, '18.340')] [2023-10-08 01:11:21,255][52060] Updated weights for policy 0, policy_version 32130 (0.0007) [2023-10-08 01:11:21,486][52059] Updated weights for policy 1, policy_version 32572 (0.0008) [2023-10-08 01:11:21,624][52060] Updated weights for policy 0, policy_version 32140 (0.0009) [2023-10-08 01:11:21,993][52060] Updated weights for policy 0, policy_version 32150 (0.0008) [2023-10-08 01:11:22,369][52060] Updated weights for policy 0, policy_version 32160 (0.0010) [2023-10-08 01:11:25,502][52059] Updated weights for policy 1, policy_version 32582 (0.0009) [2023-10-08 01:11:25,865][52059] Updated weights for policy 1, policy_version 32592 (0.0010) [2023-10-08 01:11:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 66289664. Throughput: 0: 1704.9, 1: 1724.6. Samples: 16588160. Policy #0 lag: (min: 9.0, avg: 16.0, max: 41.0) [2023-10-08 01:11:26,211][50642] Avg episode reward: [(0, '20.230'), (1, '15.220')] [2023-10-08 01:11:26,223][52059] Updated weights for policy 1, policy_version 32602 (0.0007) [2023-10-08 01:11:26,333][52060] Updated weights for policy 0, policy_version 32170 (0.0009) [2023-10-08 01:11:26,708][52060] Updated weights for policy 0, policy_version 32180 (0.0007) [2023-10-08 01:11:27,082][52060] Updated weights for policy 0, policy_version 32190 (0.0010) [2023-10-08 01:11:30,099][52059] Updated weights for policy 1, policy_version 32612 (0.0008) [2023-10-08 01:11:30,465][52059] Updated weights for policy 1, policy_version 32622 (0.0010) [2023-10-08 01:11:30,823][52059] Updated weights for policy 1, policy_version 32632 (0.0009) [2023-10-08 01:11:31,022][52060] Updated weights for policy 0, policy_version 32200 (0.0008) [2023-10-08 01:11:31,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 66387968. Throughput: 0: 1707.7, 1: 1705.7. Samples: 16608388. Policy #0 lag: (min: 9.0, avg: 16.0, max: 41.0) [2023-10-08 01:11:31,211][50642] Avg episode reward: [(0, '18.080'), (1, '16.600')] [2023-10-08 01:11:31,394][52060] Updated weights for policy 0, policy_version 32210 (0.0009) [2023-10-08 01:11:31,757][52060] Updated weights for policy 0, policy_version 32220 (0.0010) [2023-10-08 01:11:34,650][52059] Updated weights for policy 1, policy_version 32642 (0.0007) [2023-10-08 01:11:35,019][52059] Updated weights for policy 1, policy_version 32652 (0.0007) [2023-10-08 01:11:35,376][52059] Updated weights for policy 1, policy_version 32662 (0.0009) [2023-10-08 01:11:35,729][52059] Updated weights for policy 1, policy_version 32672 (0.0009) [2023-10-08 01:11:35,734][52060] Updated weights for policy 0, policy_version 32230 (0.0009) [2023-10-08 01:11:36,097][52060] Updated weights for policy 0, policy_version 32240 (0.0011) [2023-10-08 01:11:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 66453504. Throughput: 0: 1710.8, 1: 1733.8. Samples: 16619068. Policy #0 lag: (min: 9.0, avg: 16.0, max: 41.0) [2023-10-08 01:11:36,211][50642] Avg episode reward: [(0, '17.830'), (1, '19.890')] [2023-10-08 01:11:36,471][52060] Updated weights for policy 0, policy_version 32250 (0.0008) [2023-10-08 01:11:39,726][52059] Updated weights for policy 1, policy_version 32682 (0.0008) [2023-10-08 01:11:40,104][52059] Updated weights for policy 1, policy_version 32692 (0.0008) [2023-10-08 01:11:40,299][52060] Updated weights for policy 0, policy_version 32260 (0.0008) [2023-10-08 01:11:40,470][52059] Updated weights for policy 1, policy_version 32702 (0.0007) [2023-10-08 01:11:40,665][52060] Updated weights for policy 0, policy_version 32270 (0.0008) [2023-10-08 01:11:41,039][52060] Updated weights for policy 0, policy_version 32280 (0.0007) [2023-10-08 01:11:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 66519040. Throughput: 0: 1715.5, 1: 1728.9. Samples: 16639990. Policy #0 lag: (min: 31.0, avg: 33.0, max: 62.0) [2023-10-08 01:11:41,211][50642] Avg episode reward: [(0, '21.370'), (1, '18.230')] [2023-10-08 01:11:44,223][52059] Updated weights for policy 1, policy_version 32712 (0.0008) [2023-10-08 01:11:44,595][52059] Updated weights for policy 1, policy_version 32722 (0.0007) [2023-10-08 01:11:44,955][52059] Updated weights for policy 1, policy_version 32732 (0.0007) [2023-10-08 01:11:45,190][52060] Updated weights for policy 0, policy_version 32290 (0.0007) [2023-10-08 01:11:45,554][52060] Updated weights for policy 0, policy_version 32300 (0.0008) [2023-10-08 01:11:45,921][52060] Updated weights for policy 0, policy_version 32310 (0.0009) [2023-10-08 01:11:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 66584576. Throughput: 0: 1703.8, 1: 1717.7. Samples: 16659856. Policy #0 lag: (min: 31.0, avg: 33.0, max: 62.0) [2023-10-08 01:11:46,211][50642] Avg episode reward: [(0, '19.770'), (1, '18.860')] [2023-10-08 01:11:46,288][52060] Updated weights for policy 0, policy_version 32320 (0.0008) [2023-10-08 01:11:48,904][52059] Updated weights for policy 1, policy_version 32742 (0.0009) [2023-10-08 01:11:49,254][52059] Updated weights for policy 1, policy_version 32752 (0.0010) [2023-10-08 01:11:49,616][52059] Updated weights for policy 1, policy_version 32762 (0.0008) [2023-10-08 01:11:50,426][52060] Updated weights for policy 0, policy_version 32330 (0.0010) [2023-10-08 01:11:50,801][52060] Updated weights for policy 0, policy_version 32340 (0.0009) [2023-10-08 01:11:51,167][52060] Updated weights for policy 0, policy_version 32350 (0.0008) [2023-10-08 01:11:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 66650112. Throughput: 0: 1725.2, 1: 1743.6. Samples: 16671210. Policy #0 lag: (min: 31.0, avg: 33.0, max: 62.0) [2023-10-08 01:11:51,211][50642] Avg episode reward: [(0, '18.310'), (1, '21.790')] [2023-10-08 01:11:53,685][52059] Updated weights for policy 1, policy_version 32772 (0.0009) [2023-10-08 01:11:54,042][52059] Updated weights for policy 1, policy_version 32782 (0.0007) [2023-10-08 01:11:54,404][52059] Updated weights for policy 1, policy_version 32792 (0.0007) [2023-10-08 01:11:55,115][52060] Updated weights for policy 0, policy_version 32360 (0.0007) [2023-10-08 01:11:55,482][52060] Updated weights for policy 0, policy_version 32370 (0.0008) [2023-10-08 01:11:55,854][52060] Updated weights for policy 0, policy_version 32380 (0.0009) [2023-10-08 01:11:56,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 66748416. Throughput: 0: 1717.8, 1: 1716.0. Samples: 16691144. Policy #0 lag: (min: 31.0, avg: 33.0, max: 62.0) [2023-10-08 01:11:56,211][50642] Avg episode reward: [(0, '22.100'), (1, '18.210')] [2023-10-08 01:11:56,211][51605] Saving new best policy, reward=22.100! [2023-10-08 01:11:58,088][52059] Updated weights for policy 1, policy_version 32802 (0.0008) [2023-10-08 01:11:58,460][52059] Updated weights for policy 1, policy_version 32812 (0.0008) [2023-10-08 01:11:58,818][52059] Updated weights for policy 1, policy_version 32822 (0.0009) [2023-10-08 01:11:59,179][52059] Updated weights for policy 1, policy_version 32832 (0.0009) [2023-10-08 01:11:59,589][52060] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-10-08 01:11:59,961][52060] Updated weights for policy 0, policy_version 32400 (0.0009) [2023-10-08 01:12:00,332][52060] Updated weights for policy 0, policy_version 32410 (0.0010) [2023-10-08 01:12:01,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 66813952. Throughput: 0: 1690.4, 1: 1730.0. Samples: 16711508. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 01:12:01,211][50642] Avg episode reward: [(0, '19.210'), (1, '17.490')] [2023-10-08 01:12:03,155][52059] Updated weights for policy 1, policy_version 32842 (0.0007) [2023-10-08 01:12:03,526][52059] Updated weights for policy 1, policy_version 32852 (0.0009) [2023-10-08 01:12:03,876][52059] Updated weights for policy 1, policy_version 32862 (0.0010) [2023-10-08 01:12:04,283][52060] Updated weights for policy 0, policy_version 32420 (0.0008) [2023-10-08 01:12:04,651][52060] Updated weights for policy 0, policy_version 32430 (0.0008) [2023-10-08 01:12:05,014][52060] Updated weights for policy 0, policy_version 32440 (0.0007) [2023-10-08 01:12:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 66879488. Throughput: 0: 1722.9, 1: 1724.1. Samples: 16722344. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 01:12:06,211][50642] Avg episode reward: [(0, '18.590'), (1, '19.580')] [2023-10-08 01:12:07,953][52059] Updated weights for policy 1, policy_version 32872 (0.0009) [2023-10-08 01:12:08,335][52059] Updated weights for policy 1, policy_version 32882 (0.0007) [2023-10-08 01:12:08,712][52059] Updated weights for policy 1, policy_version 32892 (0.0008) [2023-10-08 01:12:09,075][52060] Updated weights for policy 0, policy_version 32450 (0.0009) [2023-10-08 01:12:09,440][52060] Updated weights for policy 0, policy_version 32460 (0.0011) [2023-10-08 01:12:09,811][52060] Updated weights for policy 0, policy_version 32470 (0.0007) [2023-10-08 01:12:10,173][52060] Updated weights for policy 0, policy_version 32480 (0.0009) [2023-10-08 01:12:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 66945024. Throughput: 0: 1705.1, 1: 1727.3. Samples: 16742616. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 01:12:11,211][50642] Avg episode reward: [(0, '22.020'), (1, '18.250')] [2023-10-08 01:12:12,586][52059] Updated weights for policy 1, policy_version 32902 (0.0009) [2023-10-08 01:12:12,958][52059] Updated weights for policy 1, policy_version 32912 (0.0008) [2023-10-08 01:12:13,322][52059] Updated weights for policy 1, policy_version 32922 (0.0007) [2023-10-08 01:12:14,074][52060] Updated weights for policy 0, policy_version 32490 (0.0007) [2023-10-08 01:12:14,449][52060] Updated weights for policy 0, policy_version 32500 (0.0007) [2023-10-08 01:12:14,814][52060] Updated weights for policy 0, policy_version 32510 (0.0008) [2023-10-08 01:12:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 67010560. Throughput: 0: 1699.2, 1: 1747.3. Samples: 16763482. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 01:12:16,211][50642] Avg episode reward: [(0, '18.580'), (1, '17.970')] [2023-10-08 01:12:17,245][52059] Updated weights for policy 1, policy_version 32932 (0.0009) [2023-10-08 01:12:17,619][52059] Updated weights for policy 1, policy_version 32942 (0.0007) [2023-10-08 01:12:17,976][52059] Updated weights for policy 1, policy_version 32952 (0.0011) [2023-10-08 01:12:18,753][52060] Updated weights for policy 0, policy_version 32520 (0.0009) [2023-10-08 01:12:19,122][52060] Updated weights for policy 0, policy_version 32530 (0.0009) [2023-10-08 01:12:19,497][52060] Updated weights for policy 0, policy_version 32540 (0.0009) [2023-10-08 01:12:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 67076096. Throughput: 0: 1718.6, 1: 1721.0. Samples: 16773852. Policy #0 lag: (min: 29.0, avg: 29.0, max: 29.0) [2023-10-08 01:12:21,211][50642] Avg episode reward: [(0, '18.380'), (1, '19.460')] [2023-10-08 01:12:21,807][52059] Updated weights for policy 1, policy_version 32962 (0.0010) [2023-10-08 01:12:22,184][52059] Updated weights for policy 1, policy_version 32972 (0.0008) [2023-10-08 01:12:22,559][52059] Updated weights for policy 1, policy_version 32982 (0.0007) [2023-10-08 01:12:22,919][52059] Updated weights for policy 1, policy_version 32992 (0.0007) [2023-10-08 01:12:23,485][52060] Updated weights for policy 0, policy_version 32550 (0.0007) [2023-10-08 01:12:23,859][52060] Updated weights for policy 0, policy_version 32560 (0.0007) [2023-10-08 01:12:24,215][52060] Updated weights for policy 0, policy_version 32570 (0.0009) [2023-10-08 01:12:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 67141632. Throughput: 0: 1695.5, 1: 1735.9. Samples: 16794404. Policy #0 lag: (min: 1.0, avg: 11.5, max: 33.0) [2023-10-08 01:12:26,211][50642] Avg episode reward: [(0, '21.960'), (1, '18.870')] [2023-10-08 01:12:26,896][52059] Updated weights for policy 1, policy_version 33002 (0.0010) [2023-10-08 01:12:27,269][52059] Updated weights for policy 1, policy_version 33012 (0.0010) [2023-10-08 01:12:27,639][52059] Updated weights for policy 1, policy_version 33022 (0.0007) [2023-10-08 01:12:28,222][52060] Updated weights for policy 0, policy_version 32580 (0.0007) [2023-10-08 01:12:28,593][52060] Updated weights for policy 0, policy_version 32590 (0.0007) [2023-10-08 01:12:28,967][52060] Updated weights for policy 0, policy_version 32600 (0.0008) [2023-10-08 01:12:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 67207168. Throughput: 0: 1712.9, 1: 1745.8. Samples: 16815496. Policy #0 lag: (min: 1.0, avg: 11.5, max: 33.0) [2023-10-08 01:12:31,211][50642] Avg episode reward: [(0, '18.270'), (1, '19.630')] [2023-10-08 01:12:31,563][52059] Updated weights for policy 1, policy_version 33032 (0.0010) [2023-10-08 01:12:31,934][52059] Updated weights for policy 1, policy_version 33042 (0.0009) [2023-10-08 01:12:32,306][52059] Updated weights for policy 1, policy_version 33052 (0.0009) [2023-10-08 01:12:32,900][52060] Updated weights for policy 0, policy_version 32610 (0.0010) [2023-10-08 01:12:33,260][52060] Updated weights for policy 0, policy_version 32620 (0.0008) [2023-10-08 01:12:33,634][52060] Updated weights for policy 0, policy_version 32630 (0.0009) [2023-10-08 01:12:34,004][52060] Updated weights for policy 0, policy_version 32640 (0.0010) [2023-10-08 01:12:36,167][52059] Updated weights for policy 1, policy_version 33062 (0.0007) [2023-10-08 01:12:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 67272704. Throughput: 0: 1701.1, 1: 1718.5. Samples: 16825094. Policy #0 lag: (min: 1.0, avg: 11.5, max: 33.0) [2023-10-08 01:12:36,211][50642] Avg episode reward: [(0, '18.430'), (1, '19.700')] [2023-10-08 01:12:36,534][52059] Updated weights for policy 1, policy_version 33072 (0.0007) [2023-10-08 01:12:36,895][52059] Updated weights for policy 1, policy_version 33082 (0.0009) [2023-10-08 01:12:37,769][52060] Updated weights for policy 0, policy_version 32650 (0.0007) [2023-10-08 01:12:38,140][52060] Updated weights for policy 0, policy_version 32660 (0.0008) [2023-10-08 01:12:38,513][52060] Updated weights for policy 0, policy_version 32670 (0.0008) [2023-10-08 01:12:40,880][52059] Updated weights for policy 1, policy_version 33092 (0.0008) [2023-10-08 01:12:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 67338240. Throughput: 0: 1702.4, 1: 1745.3. Samples: 16846294. Policy #0 lag: (min: 1.0, avg: 11.5, max: 33.0) [2023-10-08 01:12:41,211][50642] Avg episode reward: [(0, '22.400'), (1, '22.080')] [2023-10-08 01:12:41,211][51605] Saving new best policy, reward=22.400! [2023-10-08 01:12:41,243][52059] Updated weights for policy 1, policy_version 33102 (0.0009) [2023-10-08 01:12:41,613][52059] Updated weights for policy 1, policy_version 33112 (0.0008) [2023-10-08 01:12:42,409][52060] Updated weights for policy 0, policy_version 32680 (0.0007) [2023-10-08 01:12:42,771][52060] Updated weights for policy 0, policy_version 32690 (0.0007) [2023-10-08 01:12:43,142][52060] Updated weights for policy 0, policy_version 32700 (0.0007) [2023-10-08 01:12:45,438][52059] Updated weights for policy 1, policy_version 33122 (0.0008) [2023-10-08 01:12:45,809][52059] Updated weights for policy 1, policy_version 33132 (0.0008) [2023-10-08 01:12:46,164][52059] Updated weights for policy 1, policy_version 33142 (0.0010) [2023-10-08 01:12:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 67403776. Throughput: 0: 1731.9, 1: 1725.7. Samples: 16867102. Policy #0 lag: (min: 1.0, avg: 11.5, max: 33.0) [2023-10-08 01:12:46,211][50642] Avg episode reward: [(0, '18.470'), (1, '19.940')] [2023-10-08 01:12:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000032704_33488896.pth... [2023-10-08 01:12:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000031104_31850496.pth [2023-10-08 01:12:46,526][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000033152_33947648.pth... [2023-10-08 01:12:46,531][52059] Updated weights for policy 1, policy_version 33152 (0.0010) [2023-10-08 01:12:46,555][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000031520_32276480.pth [2023-10-08 01:12:47,074][52060] Updated weights for policy 0, policy_version 32710 (0.0007) [2023-10-08 01:12:47,447][52060] Updated weights for policy 0, policy_version 32720 (0.0010) [2023-10-08 01:12:47,830][52060] Updated weights for policy 0, policy_version 32730 (0.0010) [2023-10-08 01:12:50,633][52059] Updated weights for policy 1, policy_version 33162 (0.0008) [2023-10-08 01:12:50,997][52059] Updated weights for policy 1, policy_version 33172 (0.0009) [2023-10-08 01:12:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 67469312. Throughput: 0: 1704.1, 1: 1734.0. Samples: 16877062. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:12:51,211][50642] Avg episode reward: [(0, '18.150'), (1, '19.060')] [2023-10-08 01:12:51,364][52059] Updated weights for policy 1, policy_version 33182 (0.0009) [2023-10-08 01:12:51,861][52060] Updated weights for policy 0, policy_version 32740 (0.0010) [2023-10-08 01:12:52,235][52060] Updated weights for policy 0, policy_version 32750 (0.0008) [2023-10-08 01:12:52,601][52060] Updated weights for policy 0, policy_version 32760 (0.0007) [2023-10-08 01:12:55,316][52059] Updated weights for policy 1, policy_version 33192 (0.0008) [2023-10-08 01:12:55,693][52059] Updated weights for policy 1, policy_version 33202 (0.0007) [2023-10-08 01:12:56,067][52059] Updated weights for policy 1, policy_version 33212 (0.0007) [2023-10-08 01:12:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 67534848. Throughput: 0: 1726.2, 1: 1742.8. Samples: 16898722. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:12:56,211][50642] Avg episode reward: [(0, '23.070'), (1, '19.100')] [2023-10-08 01:12:56,213][51605] Saving new best policy, reward=23.070! [2023-10-08 01:12:56,608][52060] Updated weights for policy 0, policy_version 32770 (0.0008) [2023-10-08 01:12:56,974][52060] Updated weights for policy 0, policy_version 32780 (0.0010) [2023-10-08 01:12:57,344][52060] Updated weights for policy 0, policy_version 32790 (0.0010) [2023-10-08 01:12:57,706][52060] Updated weights for policy 0, policy_version 32800 (0.0011) [2023-10-08 01:12:59,908][52059] Updated weights for policy 1, policy_version 33222 (0.0009) [2023-10-08 01:13:00,262][52059] Updated weights for policy 1, policy_version 33232 (0.0008) [2023-10-08 01:13:00,630][52059] Updated weights for policy 1, policy_version 33242 (0.0009) [2023-10-08 01:13:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 67633152. Throughput: 0: 1732.9, 1: 1712.5. Samples: 16918526. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:01,211][50642] Avg episode reward: [(0, '18.160'), (1, '20.280')] [2023-10-08 01:13:01,648][52060] Updated weights for policy 0, policy_version 32810 (0.0007) [2023-10-08 01:13:02,022][52060] Updated weights for policy 0, policy_version 32820 (0.0010) [2023-10-08 01:13:02,397][52060] Updated weights for policy 0, policy_version 32830 (0.0010) [2023-10-08 01:13:04,523][52059] Updated weights for policy 1, policy_version 33252 (0.0008) [2023-10-08 01:13:04,890][52059] Updated weights for policy 1, policy_version 33262 (0.0007) [2023-10-08 01:13:05,254][52059] Updated weights for policy 1, policy_version 33272 (0.0008) [2023-10-08 01:13:06,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 67698688. Throughput: 0: 1713.9, 1: 1742.7. Samples: 16929396. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:06,211][50642] Avg episode reward: [(0, '18.330'), (1, '16.730')] [2023-10-08 01:13:06,373][52060] Updated weights for policy 0, policy_version 32840 (0.0008) [2023-10-08 01:13:06,749][52060] Updated weights for policy 0, policy_version 32850 (0.0007) [2023-10-08 01:13:07,120][52060] Updated weights for policy 0, policy_version 32860 (0.0010) [2023-10-08 01:13:08,977][52059] Updated weights for policy 1, policy_version 33282 (0.0009) [2023-10-08 01:13:09,343][52059] Updated weights for policy 1, policy_version 33292 (0.0009) [2023-10-08 01:13:09,711][52059] Updated weights for policy 1, policy_version 33302 (0.0010) [2023-10-08 01:13:10,083][52059] Updated weights for policy 1, policy_version 33312 (0.0010) [2023-10-08 01:13:11,009][52060] Updated weights for policy 0, policy_version 32870 (0.0010) [2023-10-08 01:13:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 67764224. Throughput: 0: 1734.6, 1: 1726.1. Samples: 16950138. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:11,211][50642] Avg episode reward: [(0, '22.660'), (1, '18.120')] [2023-10-08 01:13:11,372][52060] Updated weights for policy 0, policy_version 32880 (0.0011) [2023-10-08 01:13:11,745][52060] Updated weights for policy 0, policy_version 32890 (0.0007) [2023-10-08 01:13:13,924][52059] Updated weights for policy 1, policy_version 33322 (0.0008) [2023-10-08 01:13:14,284][52059] Updated weights for policy 1, policy_version 33332 (0.0010) [2023-10-08 01:13:14,645][52059] Updated weights for policy 1, policy_version 33342 (0.0007) [2023-10-08 01:13:15,754][52060] Updated weights for policy 0, policy_version 32900 (0.0009) [2023-10-08 01:13:16,127][52060] Updated weights for policy 0, policy_version 32910 (0.0009) [2023-10-08 01:13:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 67829760. Throughput: 0: 1730.8, 1: 1725.0. Samples: 16971004. Policy #0 lag: (min: 17.0, avg: 29.1, max: 49.0) [2023-10-08 01:13:16,211][50642] Avg episode reward: [(0, '18.220'), (1, '21.100')] [2023-10-08 01:13:16,506][52060] Updated weights for policy 0, policy_version 32920 (0.0009) [2023-10-08 01:13:18,592][52059] Updated weights for policy 1, policy_version 33352 (0.0009) [2023-10-08 01:13:18,946][52059] Updated weights for policy 1, policy_version 33362 (0.0008) [2023-10-08 01:13:19,311][52059] Updated weights for policy 1, policy_version 33372 (0.0007) [2023-10-08 01:13:20,435][52060] Updated weights for policy 0, policy_version 32930 (0.0010) [2023-10-08 01:13:20,801][52060] Updated weights for policy 0, policy_version 32940 (0.0010) [2023-10-08 01:13:21,171][52060] Updated weights for policy 0, policy_version 32950 (0.0009) [2023-10-08 01:13:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 67895296. Throughput: 0: 1733.5, 1: 1741.8. Samples: 16981480. Policy #0 lag: (min: 17.0, avg: 29.1, max: 49.0) [2023-10-08 01:13:21,211][50642] Avg episode reward: [(0, '18.970'), (1, '19.150')] [2023-10-08 01:13:21,549][52060] Updated weights for policy 0, policy_version 32960 (0.0008) [2023-10-08 01:13:23,257][52059] Updated weights for policy 1, policy_version 33382 (0.0008) [2023-10-08 01:13:23,613][52059] Updated weights for policy 1, policy_version 33392 (0.0008) [2023-10-08 01:13:23,982][52059] Updated weights for policy 1, policy_version 33402 (0.0008) [2023-10-08 01:13:25,676][52060] Updated weights for policy 0, policy_version 32970 (0.0009) [2023-10-08 01:13:26,048][52060] Updated weights for policy 0, policy_version 32980 (0.0010) [2023-10-08 01:13:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 67960832. Throughput: 0: 1733.5, 1: 1726.2. Samples: 17001980. Policy #0 lag: (min: 17.0, avg: 29.1, max: 49.0) [2023-10-08 01:13:26,211][50642] Avg episode reward: [(0, '22.230'), (1, '18.430')] [2023-10-08 01:13:26,411][52060] Updated weights for policy 0, policy_version 32990 (0.0009) [2023-10-08 01:13:27,838][52059] Updated weights for policy 1, policy_version 33412 (0.0009) [2023-10-08 01:13:28,210][52059] Updated weights for policy 1, policy_version 33422 (0.0009) [2023-10-08 01:13:28,589][52059] Updated weights for policy 1, policy_version 33432 (0.0009) [2023-10-08 01:13:30,213][52060] Updated weights for policy 0, policy_version 33000 (0.0009) [2023-10-08 01:13:30,582][52060] Updated weights for policy 0, policy_version 33010 (0.0011) [2023-10-08 01:13:30,938][52060] Updated weights for policy 0, policy_version 33020 (0.0011) [2023-10-08 01:13:31,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 68059136. Throughput: 0: 1711.2, 1: 1742.1. Samples: 17022502. Policy #0 lag: (min: 17.0, avg: 29.1, max: 49.0) [2023-10-08 01:13:31,211][50642] Avg episode reward: [(0, '17.970'), (1, '22.690')] [2023-10-08 01:13:31,223][51710] Saving new best policy, reward=22.690! [2023-10-08 01:13:32,616][52059] Updated weights for policy 1, policy_version 33442 (0.0009) [2023-10-08 01:13:32,986][52059] Updated weights for policy 1, policy_version 33452 (0.0008) [2023-10-08 01:13:33,338][52059] Updated weights for policy 1, policy_version 33462 (0.0010) [2023-10-08 01:13:33,705][52059] Updated weights for policy 1, policy_version 33472 (0.0009) [2023-10-08 01:13:34,951][52060] Updated weights for policy 0, policy_version 33030 (0.0010) [2023-10-08 01:13:35,315][52060] Updated weights for policy 0, policy_version 33040 (0.0009) [2023-10-08 01:13:35,683][52060] Updated weights for policy 0, policy_version 33050 (0.0009) [2023-10-08 01:13:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 68124672. Throughput: 0: 1729.3, 1: 1732.4. Samples: 17032842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:36,211][50642] Avg episode reward: [(0, '18.370'), (1, '19.200')] [2023-10-08 01:13:37,623][52059] Updated weights for policy 1, policy_version 33482 (0.0010) [2023-10-08 01:13:37,994][52059] Updated weights for policy 1, policy_version 33492 (0.0008) [2023-10-08 01:13:38,359][52059] Updated weights for policy 1, policy_version 33502 (0.0010) [2023-10-08 01:13:39,699][52060] Updated weights for policy 0, policy_version 33060 (0.0008) [2023-10-08 01:13:40,067][52060] Updated weights for policy 0, policy_version 33070 (0.0010) [2023-10-08 01:13:40,431][52060] Updated weights for policy 0, policy_version 33080 (0.0009) [2023-10-08 01:13:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 68190208. Throughput: 0: 1718.2, 1: 1727.0. Samples: 17053756. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:41,211][50642] Avg episode reward: [(0, '22.520'), (1, '16.920')] [2023-10-08 01:13:42,381][52059] Updated weights for policy 1, policy_version 33512 (0.0008) [2023-10-08 01:13:42,761][52059] Updated weights for policy 1, policy_version 33522 (0.0007) [2023-10-08 01:13:43,124][52059] Updated weights for policy 1, policy_version 33532 (0.0008) [2023-10-08 01:13:44,470][52060] Updated weights for policy 0, policy_version 33090 (0.0008) [2023-10-08 01:13:44,846][52060] Updated weights for policy 0, policy_version 33100 (0.0009) [2023-10-08 01:13:45,223][52060] Updated weights for policy 0, policy_version 33110 (0.0008) [2023-10-08 01:13:45,597][52060] Updated weights for policy 0, policy_version 33120 (0.0010) [2023-10-08 01:13:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 68255744. Throughput: 0: 1696.4, 1: 1757.3. Samples: 17073942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:46,211][50642] Avg episode reward: [(0, '18.040'), (1, '22.430')] [2023-10-08 01:13:47,106][52059] Updated weights for policy 1, policy_version 33542 (0.0010) [2023-10-08 01:13:47,468][52059] Updated weights for policy 1, policy_version 33552 (0.0008) [2023-10-08 01:13:47,829][52059] Updated weights for policy 1, policy_version 33562 (0.0008) [2023-10-08 01:13:49,661][52060] Updated weights for policy 0, policy_version 33130 (0.0009) [2023-10-08 01:13:50,031][52060] Updated weights for policy 0, policy_version 33140 (0.0008) [2023-10-08 01:13:50,397][52060] Updated weights for policy 0, policy_version 33150 (0.0010) [2023-10-08 01:13:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 68321280. Throughput: 0: 1724.4, 1: 1724.8. Samples: 17084612. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:51,211][50642] Avg episode reward: [(0, '18.470'), (1, '20.920')] [2023-10-08 01:13:51,693][52059] Updated weights for policy 1, policy_version 33572 (0.0009) [2023-10-08 01:13:52,062][52059] Updated weights for policy 1, policy_version 33582 (0.0011) [2023-10-08 01:13:52,430][52059] Updated weights for policy 1, policy_version 33592 (0.0009) [2023-10-08 01:13:54,389][52060] Updated weights for policy 0, policy_version 33160 (0.0009) [2023-10-08 01:13:54,763][52060] Updated weights for policy 0, policy_version 33170 (0.0010) [2023-10-08 01:13:55,124][52060] Updated weights for policy 0, policy_version 33180 (0.0007) [2023-10-08 01:13:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 68386816. Throughput: 0: 1701.7, 1: 1738.3. Samples: 17104938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:13:56,211][50642] Avg episode reward: [(0, '22.480'), (1, '17.480')] [2023-10-08 01:13:56,445][52059] Updated weights for policy 1, policy_version 33602 (0.0007) [2023-10-08 01:13:56,824][52059] Updated weights for policy 1, policy_version 33612 (0.0012) [2023-10-08 01:13:57,185][52059] Updated weights for policy 1, policy_version 33622 (0.0008) [2023-10-08 01:13:57,541][52059] Updated weights for policy 1, policy_version 33632 (0.0007) [2023-10-08 01:13:59,292][52060] Updated weights for policy 0, policy_version 33190 (0.0010) [2023-10-08 01:13:59,667][52060] Updated weights for policy 0, policy_version 33200 (0.0008) [2023-10-08 01:14:00,036][52060] Updated weights for policy 0, policy_version 33210 (0.0009) [2023-10-08 01:14:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 68452352. Throughput: 0: 1689.2, 1: 1747.3. Samples: 17125646. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) [2023-10-08 01:14:01,211][50642] Avg episode reward: [(0, '17.700'), (1, '19.560')] [2023-10-08 01:14:01,380][52059] Updated weights for policy 1, policy_version 33642 (0.0009) [2023-10-08 01:14:01,754][52059] Updated weights for policy 1, policy_version 33652 (0.0007) [2023-10-08 01:14:02,109][52059] Updated weights for policy 1, policy_version 33662 (0.0011) [2023-10-08 01:14:03,979][52060] Updated weights for policy 0, policy_version 33220 (0.0009) [2023-10-08 01:14:04,337][52060] Updated weights for policy 0, policy_version 33230 (0.0008) [2023-10-08 01:14:04,709][52060] Updated weights for policy 0, policy_version 33240 (0.0007) [2023-10-08 01:14:06,007][52059] Updated weights for policy 1, policy_version 33672 (0.0011) [2023-10-08 01:14:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 68517888. Throughput: 0: 1706.8, 1: 1728.1. Samples: 17136054. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) [2023-10-08 01:14:06,211][50642] Avg episode reward: [(0, '18.030'), (1, '21.020')] [2023-10-08 01:14:06,388][52059] Updated weights for policy 1, policy_version 33682 (0.0008) [2023-10-08 01:14:06,749][52059] Updated weights for policy 1, policy_version 33692 (0.0007) [2023-10-08 01:14:08,604][52060] Updated weights for policy 0, policy_version 33250 (0.0007) [2023-10-08 01:14:08,974][52060] Updated weights for policy 0, policy_version 33260 (0.0007) [2023-10-08 01:14:09,338][52060] Updated weights for policy 0, policy_version 33270 (0.0008) [2023-10-08 01:14:09,707][52060] Updated weights for policy 0, policy_version 33280 (0.0007) [2023-10-08 01:14:10,734][52059] Updated weights for policy 1, policy_version 33702 (0.0009) [2023-10-08 01:14:11,094][52059] Updated weights for policy 1, policy_version 33712 (0.0009) [2023-10-08 01:14:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 68583424. Throughput: 0: 1686.4, 1: 1742.0. Samples: 17156260. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) [2023-10-08 01:14:11,211][50642] Avg episode reward: [(0, '22.350'), (1, '17.680')] [2023-10-08 01:14:11,456][52059] Updated weights for policy 1, policy_version 33722 (0.0010) [2023-10-08 01:14:13,576][52060] Updated weights for policy 0, policy_version 33290 (0.0011) [2023-10-08 01:14:13,938][52060] Updated weights for policy 0, policy_version 33300 (0.0009) [2023-10-08 01:14:14,313][52060] Updated weights for policy 0, policy_version 33310 (0.0011) [2023-10-08 01:14:15,320][52059] Updated weights for policy 1, policy_version 33732 (0.0009) [2023-10-08 01:14:15,689][52059] Updated weights for policy 1, policy_version 33742 (0.0009) [2023-10-08 01:14:16,059][52059] Updated weights for policy 1, policy_version 33752 (0.0007) [2023-10-08 01:14:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 68648960. Throughput: 0: 1705.2, 1: 1724.4. Samples: 17176832. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) [2023-10-08 01:14:16,211][50642] Avg episode reward: [(0, '17.990'), (1, '19.090')] [2023-10-08 01:14:18,293][52060] Updated weights for policy 0, policy_version 33320 (0.0009) [2023-10-08 01:14:18,672][52060] Updated weights for policy 0, policy_version 33330 (0.0008) [2023-10-08 01:14:19,031][52060] Updated weights for policy 0, policy_version 33340 (0.0008) [2023-10-08 01:14:19,973][52059] Updated weights for policy 1, policy_version 33762 (0.0009) [2023-10-08 01:14:20,337][52059] Updated weights for policy 1, policy_version 33772 (0.0010) [2023-10-08 01:14:20,708][52059] Updated weights for policy 1, policy_version 33782 (0.0010) [2023-10-08 01:14:21,069][52059] Updated weights for policy 1, policy_version 33792 (0.0009) [2023-10-08 01:14:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 68747264. Throughput: 0: 1693.4, 1: 1743.7. Samples: 17187512. Policy #0 lag: (min: 31.0, avg: 33.6, max: 63.0) [2023-10-08 01:14:21,211][50642] Avg episode reward: [(0, '18.940'), (1, '20.590')] [2023-10-08 01:14:23,065][52060] Updated weights for policy 0, policy_version 33350 (0.0009) [2023-10-08 01:14:23,427][52060] Updated weights for policy 0, policy_version 33360 (0.0009) [2023-10-08 01:14:23,810][52060] Updated weights for policy 0, policy_version 33370 (0.0007) [2023-10-08 01:14:24,889][52059] Updated weights for policy 1, policy_version 33802 (0.0008) [2023-10-08 01:14:25,255][52059] Updated weights for policy 1, policy_version 33812 (0.0007) [2023-10-08 01:14:25,610][52059] Updated weights for policy 1, policy_version 33822 (0.0008) [2023-10-08 01:14:26,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 68812800. Throughput: 0: 1691.5, 1: 1738.4. Samples: 17208104. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:14:26,211][50642] Avg episode reward: [(0, '23.190'), (1, '19.890')] [2023-10-08 01:14:26,211][51605] Saving new best policy, reward=23.190! [2023-10-08 01:14:27,695][52060] Updated weights for policy 0, policy_version 33380 (0.0009) [2023-10-08 01:14:28,063][52060] Updated weights for policy 0, policy_version 33390 (0.0011) [2023-10-08 01:14:28,429][52060] Updated weights for policy 0, policy_version 33400 (0.0009) [2023-10-08 01:14:29,626][52059] Updated weights for policy 1, policy_version 33832 (0.0008) [2023-10-08 01:14:29,995][52059] Updated weights for policy 1, policy_version 33842 (0.0011) [2023-10-08 01:14:30,361][52059] Updated weights for policy 1, policy_version 33852 (0.0009) [2023-10-08 01:14:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 68878336. Throughput: 0: 1717.6, 1: 1715.4. Samples: 17228426. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:14:31,211][50642] Avg episode reward: [(0, '16.340'), (1, '18.360')] [2023-10-08 01:14:32,359][52060] Updated weights for policy 0, policy_version 33410 (0.0009) [2023-10-08 01:14:32,724][52060] Updated weights for policy 0, policy_version 33420 (0.0009) [2023-10-08 01:14:33,093][52060] Updated weights for policy 0, policy_version 33430 (0.0008) [2023-10-08 01:14:33,458][52060] Updated weights for policy 0, policy_version 33440 (0.0008) [2023-10-08 01:14:34,209][52059] Updated weights for policy 1, policy_version 33862 (0.0007) [2023-10-08 01:14:34,561][52059] Updated weights for policy 1, policy_version 33872 (0.0009) [2023-10-08 01:14:34,937][52059] Updated weights for policy 1, policy_version 33882 (0.0008) [2023-10-08 01:14:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 68943872. Throughput: 0: 1684.1, 1: 1751.6. Samples: 17239216. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:14:36,211][50642] Avg episode reward: [(0, '19.460'), (1, '19.060')] [2023-10-08 01:14:37,371][52060] Updated weights for policy 0, policy_version 33450 (0.0009) [2023-10-08 01:14:37,740][52060] Updated weights for policy 0, policy_version 33460 (0.0008) [2023-10-08 01:14:38,109][52060] Updated weights for policy 0, policy_version 33470 (0.0008) [2023-10-08 01:14:38,855][52059] Updated weights for policy 1, policy_version 33892 (0.0009) [2023-10-08 01:14:39,224][52059] Updated weights for policy 1, policy_version 33902 (0.0009) [2023-10-08 01:14:39,588][52059] Updated weights for policy 1, policy_version 33912 (0.0007) [2023-10-08 01:14:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 69009408. Throughput: 0: 1707.2, 1: 1727.9. Samples: 17259516. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:14:41,211][50642] Avg episode reward: [(0, '22.890'), (1, '22.060')] [2023-10-08 01:14:42,126][52060] Updated weights for policy 0, policy_version 33480 (0.0010) [2023-10-08 01:14:42,503][52060] Updated weights for policy 0, policy_version 33490 (0.0009) [2023-10-08 01:14:42,874][52060] Updated weights for policy 0, policy_version 33500 (0.0007) [2023-10-08 01:14:43,558][52059] Updated weights for policy 1, policy_version 33922 (0.0007) [2023-10-08 01:14:43,910][52059] Updated weights for policy 1, policy_version 33932 (0.0008) [2023-10-08 01:14:44,281][52059] Updated weights for policy 1, policy_version 33942 (0.0008) [2023-10-08 01:14:44,646][52059] Updated weights for policy 1, policy_version 33952 (0.0009) [2023-10-08 01:14:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 69074944. Throughput: 0: 1719.9, 1: 1716.7. Samples: 17280292. Policy #0 lag: (min: 27.0, avg: 27.0, max: 27.0) [2023-10-08 01:14:46,211][50642] Avg episode reward: [(0, '16.820'), (1, '16.480')] [2023-10-08 01:14:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000033504_34308096.pth... [2023-10-08 01:14:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000033952_34766848.pth... [2023-10-08 01:14:46,249][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000031904_32669696.pth [2023-10-08 01:14:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000032320_33095680.pth [2023-10-08 01:14:46,964][52060] Updated weights for policy 0, policy_version 33510 (0.0009) [2023-10-08 01:14:47,332][52060] Updated weights for policy 0, policy_version 33520 (0.0007) [2023-10-08 01:14:47,701][52060] Updated weights for policy 0, policy_version 33530 (0.0008) [2023-10-08 01:14:48,589][52059] Updated weights for policy 1, policy_version 33962 (0.0009) [2023-10-08 01:14:48,949][52059] Updated weights for policy 1, policy_version 33972 (0.0009) [2023-10-08 01:14:49,317][52059] Updated weights for policy 1, policy_version 33982 (0.0009) [2023-10-08 01:14:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 69140480. Throughput: 0: 1695.1, 1: 1734.0. Samples: 17290360. Policy #0 lag: (min: 1.0, avg: 11.1, max: 33.0) [2023-10-08 01:14:51,211][50642] Avg episode reward: [(0, '19.140'), (1, '16.310')] [2023-10-08 01:14:51,618][52060] Updated weights for policy 0, policy_version 33540 (0.0008) [2023-10-08 01:14:51,986][52060] Updated weights for policy 0, policy_version 33550 (0.0008) [2023-10-08 01:14:52,365][52060] Updated weights for policy 0, policy_version 33560 (0.0009) [2023-10-08 01:14:53,368][52059] Updated weights for policy 1, policy_version 33992 (0.0008) [2023-10-08 01:14:53,739][52059] Updated weights for policy 1, policy_version 34002 (0.0009) [2023-10-08 01:14:54,104][52059] Updated weights for policy 1, policy_version 34012 (0.0008) [2023-10-08 01:14:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 69206016. Throughput: 0: 1722.3, 1: 1719.5. Samples: 17311144. Policy #0 lag: (min: 1.0, avg: 11.1, max: 33.0) [2023-10-08 01:14:56,211][50642] Avg episode reward: [(0, '23.670'), (1, '21.370')] [2023-10-08 01:14:56,260][52060] Updated weights for policy 0, policy_version 33570 (0.0007) [2023-10-08 01:14:56,628][52060] Updated weights for policy 0, policy_version 33580 (0.0008) [2023-10-08 01:14:56,998][52060] Updated weights for policy 0, policy_version 33590 (0.0007) [2023-10-08 01:14:57,359][51605] Saving new best policy, reward=23.670! [2023-10-08 01:14:57,363][52060] Updated weights for policy 0, policy_version 33600 (0.0007) [2023-10-08 01:14:58,008][52059] Updated weights for policy 1, policy_version 34022 (0.0008) [2023-10-08 01:14:58,380][52059] Updated weights for policy 1, policy_version 34032 (0.0008) [2023-10-08 01:14:58,748][52059] Updated weights for policy 1, policy_version 34042 (0.0009) [2023-10-08 01:15:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 69271552. Throughput: 0: 1726.8, 1: 1734.2. Samples: 17332576. Policy #0 lag: (min: 1.0, avg: 11.1, max: 33.0) [2023-10-08 01:15:01,211][50642] Avg episode reward: [(0, '16.930'), (1, '15.290')] [2023-10-08 01:15:01,443][52060] Updated weights for policy 0, policy_version 33610 (0.0008) [2023-10-08 01:15:01,811][52060] Updated weights for policy 0, policy_version 33620 (0.0008) [2023-10-08 01:15:02,184][52060] Updated weights for policy 0, policy_version 33630 (0.0008) [2023-10-08 01:15:02,626][52059] Updated weights for policy 1, policy_version 34052 (0.0009) [2023-10-08 01:15:02,992][52059] Updated weights for policy 1, policy_version 34062 (0.0009) [2023-10-08 01:15:03,357][52059] Updated weights for policy 1, policy_version 34072 (0.0008) [2023-10-08 01:15:06,062][52060] Updated weights for policy 0, policy_version 33640 (0.0007) [2023-10-08 01:15:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 69337088. Throughput: 0: 1717.0, 1: 1715.3. Samples: 17341966. Policy #0 lag: (min: 1.0, avg: 11.1, max: 33.0) [2023-10-08 01:15:06,211][50642] Avg episode reward: [(0, '18.540'), (1, '17.300')] [2023-10-08 01:15:06,433][52060] Updated weights for policy 0, policy_version 33650 (0.0007) [2023-10-08 01:15:06,812][52060] Updated weights for policy 0, policy_version 33660 (0.0007) [2023-10-08 01:15:07,222][52059] Updated weights for policy 1, policy_version 34082 (0.0009) [2023-10-08 01:15:07,590][52059] Updated weights for policy 1, policy_version 34092 (0.0009) [2023-10-08 01:15:07,945][52059] Updated weights for policy 1, policy_version 34102 (0.0007) [2023-10-08 01:15:08,316][52059] Updated weights for policy 1, policy_version 34112 (0.0008) [2023-10-08 01:15:10,645][52060] Updated weights for policy 0, policy_version 33670 (0.0008) [2023-10-08 01:15:11,019][52060] Updated weights for policy 0, policy_version 33680 (0.0009) [2023-10-08 01:15:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 69402624. Throughput: 0: 1729.0, 1: 1727.4. Samples: 17363642. Policy #0 lag: (min: 1.0, avg: 11.1, max: 33.0) [2023-10-08 01:15:11,211][50642] Avg episode reward: [(0, '23.990'), (1, '21.920')] [2023-10-08 01:15:11,379][52060] Updated weights for policy 0, policy_version 33690 (0.0008) [2023-10-08 01:15:11,596][51605] Saving new best policy, reward=23.990! [2023-10-08 01:15:12,071][52059] Updated weights for policy 1, policy_version 34122 (0.0011) [2023-10-08 01:15:12,447][52059] Updated weights for policy 1, policy_version 34132 (0.0010) [2023-10-08 01:15:12,800][52059] Updated weights for policy 1, policy_version 34142 (0.0010) [2023-10-08 01:15:15,356][52060] Updated weights for policy 0, policy_version 33700 (0.0009) [2023-10-08 01:15:15,724][52060] Updated weights for policy 0, policy_version 33710 (0.0008) [2023-10-08 01:15:16,101][52060] Updated weights for policy 0, policy_version 33720 (0.0010) [2023-10-08 01:15:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 69468160. Throughput: 0: 1720.2, 1: 1748.9. Samples: 17384536. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:16,211][50642] Avg episode reward: [(0, '17.440'), (1, '13.740')] [2023-10-08 01:15:16,866][52059] Updated weights for policy 1, policy_version 34152 (0.0007) [2023-10-08 01:15:17,243][52059] Updated weights for policy 1, policy_version 34162 (0.0007) [2023-10-08 01:15:17,599][52059] Updated weights for policy 1, policy_version 34172 (0.0007) [2023-10-08 01:15:20,088][52060] Updated weights for policy 0, policy_version 33730 (0.0009) [2023-10-08 01:15:20,459][52060] Updated weights for policy 0, policy_version 33740 (0.0007) [2023-10-08 01:15:20,819][52060] Updated weights for policy 0, policy_version 33750 (0.0008) [2023-10-08 01:15:21,194][52060] Updated weights for policy 0, policy_version 33760 (0.0008) [2023-10-08 01:15:21,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 69566464. Throughput: 0: 1737.6, 1: 1714.8. Samples: 17394572. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:21,211][50642] Avg episode reward: [(0, '18.140'), (1, '16.830')] [2023-10-08 01:15:21,510][52059] Updated weights for policy 1, policy_version 34182 (0.0008) [2023-10-08 01:15:21,870][52059] Updated weights for policy 1, policy_version 34192 (0.0010) [2023-10-08 01:15:22,246][52059] Updated weights for policy 1, policy_version 34202 (0.0008) [2023-10-08 01:15:24,952][52060] Updated weights for policy 0, policy_version 33770 (0.0008) [2023-10-08 01:15:25,326][52060] Updated weights for policy 0, policy_version 33780 (0.0009) [2023-10-08 01:15:25,691][52060] Updated weights for policy 0, policy_version 33790 (0.0009) [2023-10-08 01:15:26,208][52059] Updated weights for policy 1, policy_version 34212 (0.0009) [2023-10-08 01:15:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 69632000. Throughput: 0: 1730.8, 1: 1739.0. Samples: 17415654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:26,211][50642] Avg episode reward: [(0, '22.580'), (1, '19.680')] [2023-10-08 01:15:26,575][52059] Updated weights for policy 1, policy_version 34222 (0.0007) [2023-10-08 01:15:26,942][52059] Updated weights for policy 1, policy_version 34232 (0.0008) [2023-10-08 01:15:29,649][52060] Updated weights for policy 0, policy_version 33800 (0.0009) [2023-10-08 01:15:30,019][52060] Updated weights for policy 0, policy_version 33810 (0.0007) [2023-10-08 01:15:30,389][52060] Updated weights for policy 0, policy_version 33820 (0.0008) [2023-10-08 01:15:30,768][52059] Updated weights for policy 1, policy_version 34242 (0.0009) [2023-10-08 01:15:31,142][52059] Updated weights for policy 1, policy_version 34252 (0.0008) [2023-10-08 01:15:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 69697536. Throughput: 0: 1711.9, 1: 1746.3. Samples: 17435908. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:31,211][50642] Avg episode reward: [(0, '16.800'), (1, '16.050')] [2023-10-08 01:15:31,497][52059] Updated weights for policy 1, policy_version 34262 (0.0008) [2023-10-08 01:15:31,862][52059] Updated weights for policy 1, policy_version 34272 (0.0007) [2023-10-08 01:15:34,408][52060] Updated weights for policy 0, policy_version 33830 (0.0010) [2023-10-08 01:15:34,788][52060] Updated weights for policy 0, policy_version 33840 (0.0009) [2023-10-08 01:15:35,167][52060] Updated weights for policy 0, policy_version 33850 (0.0011) [2023-10-08 01:15:35,659][52059] Updated weights for policy 1, policy_version 34282 (0.0009) [2023-10-08 01:15:36,024][52059] Updated weights for policy 1, policy_version 34292 (0.0010) [2023-10-08 01:15:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 69763072. Throughput: 0: 1741.2, 1: 1739.2. Samples: 17446978. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:36,211][50642] Avg episode reward: [(0, '17.670'), (1, '15.170')] [2023-10-08 01:15:36,396][52059] Updated weights for policy 1, policy_version 34302 (0.0009) [2023-10-08 01:15:39,111][52060] Updated weights for policy 0, policy_version 33860 (0.0009) [2023-10-08 01:15:39,478][52060] Updated weights for policy 0, policy_version 33870 (0.0008) [2023-10-08 01:15:39,846][52060] Updated weights for policy 0, policy_version 33880 (0.0008) [2023-10-08 01:15:40,423][52059] Updated weights for policy 1, policy_version 34312 (0.0008) [2023-10-08 01:15:40,783][52059] Updated weights for policy 1, policy_version 34322 (0.0010) [2023-10-08 01:15:41,151][52059] Updated weights for policy 1, policy_version 34332 (0.0007) [2023-10-08 01:15:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 69828608. Throughput: 0: 1716.6, 1: 1753.9. Samples: 17467318. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:41,211][50642] Avg episode reward: [(0, '22.410'), (1, '18.520')] [2023-10-08 01:15:43,817][52060] Updated weights for policy 0, policy_version 33890 (0.0008) [2023-10-08 01:15:44,181][52060] Updated weights for policy 0, policy_version 33900 (0.0009) [2023-10-08 01:15:44,549][52060] Updated weights for policy 0, policy_version 33910 (0.0011) [2023-10-08 01:15:44,912][52060] Updated weights for policy 0, policy_version 33920 (0.0009) [2023-10-08 01:15:45,057][52059] Updated weights for policy 1, policy_version 34342 (0.0007) [2023-10-08 01:15:45,423][52059] Updated weights for policy 1, policy_version 34352 (0.0008) [2023-10-08 01:15:45,786][52059] Updated weights for policy 1, policy_version 34362 (0.0008) [2023-10-08 01:15:46,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 69926912. Throughput: 0: 1707.7, 1: 1731.5. Samples: 17487344. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:46,211][50642] Avg episode reward: [(0, '17.580'), (1, '20.350')] [2023-10-08 01:15:48,957][52060] Updated weights for policy 0, policy_version 33930 (0.0008) [2023-10-08 01:15:49,327][52060] Updated weights for policy 0, policy_version 33940 (0.0007) [2023-10-08 01:15:49,693][52060] Updated weights for policy 0, policy_version 33950 (0.0008) [2023-10-08 01:15:49,734][52059] Updated weights for policy 1, policy_version 34372 (0.0010) [2023-10-08 01:15:50,096][52059] Updated weights for policy 1, policy_version 34382 (0.0008) [2023-10-08 01:15:50,455][52059] Updated weights for policy 1, policy_version 34392 (0.0010) [2023-10-08 01:15:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 69992448. Throughput: 0: 1730.5, 1: 1755.0. Samples: 17498812. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:51,211][50642] Avg episode reward: [(0, '18.620'), (1, '17.400')] [2023-10-08 01:15:53,810][52060] Updated weights for policy 0, policy_version 33960 (0.0010) [2023-10-08 01:15:54,168][52060] Updated weights for policy 0, policy_version 33970 (0.0009) [2023-10-08 01:15:54,339][52059] Updated weights for policy 1, policy_version 34402 (0.0008) [2023-10-08 01:15:54,541][52060] Updated weights for policy 0, policy_version 33980 (0.0007) [2023-10-08 01:15:54,708][52059] Updated weights for policy 1, policy_version 34412 (0.0009) [2023-10-08 01:15:55,081][52059] Updated weights for policy 1, policy_version 34422 (0.0011) [2023-10-08 01:15:55,437][52059] Updated weights for policy 1, policy_version 34432 (0.0009) [2023-10-08 01:15:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 70057984. Throughput: 0: 1702.1, 1: 1734.8. Samples: 17518302. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:15:56,211][50642] Avg episode reward: [(0, '22.170'), (1, '18.920')] [2023-10-08 01:15:58,476][52060] Updated weights for policy 0, policy_version 33990 (0.0008) [2023-10-08 01:15:58,851][52060] Updated weights for policy 0, policy_version 34000 (0.0008) [2023-10-08 01:15:59,211][52060] Updated weights for policy 0, policy_version 34010 (0.0007) [2023-10-08 01:15:59,447][52059] Updated weights for policy 1, policy_version 34442 (0.0008) [2023-10-08 01:15:59,816][52059] Updated weights for policy 1, policy_version 34452 (0.0011) [2023-10-08 01:16:00,189][52059] Updated weights for policy 1, policy_version 34462 (0.0009) [2023-10-08 01:16:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 70123520. Throughput: 0: 1713.2, 1: 1720.1. Samples: 17539034. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:01,211][50642] Avg episode reward: [(0, '17.900'), (1, '20.370')] [2023-10-08 01:16:03,273][52060] Updated weights for policy 0, policy_version 34020 (0.0008) [2023-10-08 01:16:03,642][52060] Updated weights for policy 0, policy_version 34030 (0.0009) [2023-10-08 01:16:04,011][52060] Updated weights for policy 0, policy_version 34040 (0.0009) [2023-10-08 01:16:04,242][52059] Updated weights for policy 1, policy_version 34472 (0.0008) [2023-10-08 01:16:04,613][52059] Updated weights for policy 1, policy_version 34482 (0.0007) [2023-10-08 01:16:04,981][52059] Updated weights for policy 1, policy_version 34492 (0.0007) [2023-10-08 01:16:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 70189056. Throughput: 0: 1709.4, 1: 1746.8. Samples: 17550100. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:06,211][50642] Avg episode reward: [(0, '18.070'), (1, '17.660')] [2023-10-08 01:16:08,023][52060] Updated weights for policy 0, policy_version 34050 (0.0008) [2023-10-08 01:16:08,389][52060] Updated weights for policy 0, policy_version 34060 (0.0009) [2023-10-08 01:16:08,757][52060] Updated weights for policy 0, policy_version 34070 (0.0009) [2023-10-08 01:16:08,923][52059] Updated weights for policy 1, policy_version 34502 (0.0009) [2023-10-08 01:16:09,126][52060] Updated weights for policy 0, policy_version 34080 (0.0008) [2023-10-08 01:16:09,274][52059] Updated weights for policy 1, policy_version 34512 (0.0009) [2023-10-08 01:16:09,639][52059] Updated weights for policy 1, policy_version 34522 (0.0007) [2023-10-08 01:16:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 70254592. Throughput: 0: 1701.5, 1: 1716.1. Samples: 17569446. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:11,211][50642] Avg episode reward: [(0, '21.940'), (1, '18.050')] [2023-10-08 01:16:13,163][52060] Updated weights for policy 0, policy_version 34090 (0.0008) [2023-10-08 01:16:13,485][52059] Updated weights for policy 1, policy_version 34532 (0.0007) [2023-10-08 01:16:13,548][52060] Updated weights for policy 0, policy_version 34100 (0.0009) [2023-10-08 01:16:13,840][52059] Updated weights for policy 1, policy_version 34542 (0.0008) [2023-10-08 01:16:13,910][52060] Updated weights for policy 0, policy_version 34110 (0.0009) [2023-10-08 01:16:14,211][52059] Updated weights for policy 1, policy_version 34552 (0.0009) [2023-10-08 01:16:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 70320128. Throughput: 0: 1720.2, 1: 1718.1. Samples: 17590632. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:16,211][50642] Avg episode reward: [(0, '18.470'), (1, '20.570')] [2023-10-08 01:16:17,825][52060] Updated weights for policy 0, policy_version 34120 (0.0009) [2023-10-08 01:16:18,116][52059] Updated weights for policy 1, policy_version 34562 (0.0009) [2023-10-08 01:16:18,207][52060] Updated weights for policy 0, policy_version 34130 (0.0009) [2023-10-08 01:16:18,480][52059] Updated weights for policy 1, policy_version 34572 (0.0008) [2023-10-08 01:16:18,568][52060] Updated weights for policy 0, policy_version 34140 (0.0010) [2023-10-08 01:16:18,844][52059] Updated weights for policy 1, policy_version 34582 (0.0009) [2023-10-08 01:16:19,214][52059] Updated weights for policy 1, policy_version 34592 (0.0009) [2023-10-08 01:16:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 70385664. Throughput: 0: 1686.6, 1: 1721.9. Samples: 17600360. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:21,211][50642] Avg episode reward: [(0, '17.690'), (1, '20.000')] [2023-10-08 01:16:22,528][52060] Updated weights for policy 0, policy_version 34150 (0.0009) [2023-10-08 01:16:22,899][52060] Updated weights for policy 0, policy_version 34160 (0.0009) [2023-10-08 01:16:23,266][52060] Updated weights for policy 0, policy_version 34170 (0.0007) [2023-10-08 01:16:23,292][52059] Updated weights for policy 1, policy_version 34602 (0.0007) [2023-10-08 01:16:23,666][52059] Updated weights for policy 1, policy_version 34612 (0.0008) [2023-10-08 01:16:24,022][52059] Updated weights for policy 1, policy_version 34622 (0.0011) [2023-10-08 01:16:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 70451200. Throughput: 0: 1709.9, 1: 1711.0. Samples: 17621256. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:26,211][50642] Avg episode reward: [(0, '22.080'), (1, '19.190')] [2023-10-08 01:16:27,196][52060] Updated weights for policy 0, policy_version 34180 (0.0007) [2023-10-08 01:16:27,566][52060] Updated weights for policy 0, policy_version 34190 (0.0007) [2023-10-08 01:16:27,941][52060] Updated weights for policy 0, policy_version 34200 (0.0007) [2023-10-08 01:16:27,960][52059] Updated weights for policy 1, policy_version 34632 (0.0007) [2023-10-08 01:16:28,329][52059] Updated weights for policy 1, policy_version 34642 (0.0010) [2023-10-08 01:16:28,682][52059] Updated weights for policy 1, policy_version 34652 (0.0007) [2023-10-08 01:16:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 70516736. Throughput: 0: 1714.0, 1: 1732.9. Samples: 17642452. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:31,211][50642] Avg episode reward: [(0, '19.310'), (1, '19.260')] [2023-10-08 01:16:31,870][52060] Updated weights for policy 0, policy_version 34210 (0.0008) [2023-10-08 01:16:32,235][52060] Updated weights for policy 0, policy_version 34220 (0.0010) [2023-10-08 01:16:32,604][52060] Updated weights for policy 0, policy_version 34230 (0.0008) [2023-10-08 01:16:32,684][52059] Updated weights for policy 1, policy_version 34662 (0.0009) [2023-10-08 01:16:32,975][52060] Updated weights for policy 0, policy_version 34240 (0.0007) [2023-10-08 01:16:33,053][52059] Updated weights for policy 1, policy_version 34672 (0.0008) [2023-10-08 01:16:33,419][52059] Updated weights for policy 1, policy_version 34682 (0.0008) [2023-10-08 01:16:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 70582272. Throughput: 0: 1692.0, 1: 1706.7. Samples: 17651754. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:36,211][50642] Avg episode reward: [(0, '18.250'), (1, '21.120')] [2023-10-08 01:16:36,999][52060] Updated weights for policy 0, policy_version 34250 (0.0008) [2023-10-08 01:16:37,363][52060] Updated weights for policy 0, policy_version 34260 (0.0008) [2023-10-08 01:16:37,450][52059] Updated weights for policy 1, policy_version 34692 (0.0009) [2023-10-08 01:16:37,736][52060] Updated weights for policy 0, policy_version 34270 (0.0007) [2023-10-08 01:16:37,802][52059] Updated weights for policy 1, policy_version 34702 (0.0009) [2023-10-08 01:16:38,168][52059] Updated weights for policy 1, policy_version 34712 (0.0009) [2023-10-08 01:16:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 70647808. Throughput: 0: 1722.1, 1: 1718.0. Samples: 17673106. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:41,211][50642] Avg episode reward: [(0, '22.230'), (1, '20.870')] [2023-10-08 01:16:41,552][52060] Updated weights for policy 0, policy_version 34280 (0.0008) [2023-10-08 01:16:41,912][52060] Updated weights for policy 0, policy_version 34290 (0.0009) [2023-10-08 01:16:42,117][52059] Updated weights for policy 1, policy_version 34722 (0.0010) [2023-10-08 01:16:42,283][52060] Updated weights for policy 0, policy_version 34300 (0.0009) [2023-10-08 01:16:42,491][52059] Updated weights for policy 1, policy_version 34732 (0.0009) [2023-10-08 01:16:42,855][52059] Updated weights for policy 1, policy_version 34742 (0.0008) [2023-10-08 01:16:43,216][52059] Updated weights for policy 1, policy_version 34752 (0.0007) [2023-10-08 01:16:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 70713344. Throughput: 0: 1713.0, 1: 1738.3. Samples: 17694342. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:46,211][50642] Avg episode reward: [(0, '19.800'), (1, '18.370')] [2023-10-08 01:16:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000034752_35586048.pth... [2023-10-08 01:16:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000033152_33947648.pth [2023-10-08 01:16:46,413][52060] Updated weights for policy 0, policy_version 34310 (0.0010) [2023-10-08 01:16:46,786][52060] Updated weights for policy 0, policy_version 34320 (0.0008) [2023-10-08 01:16:47,147][52059] Updated weights for policy 1, policy_version 34762 (0.0007) [2023-10-08 01:16:47,155][52060] Updated weights for policy 0, policy_version 34330 (0.0007) [2023-10-08 01:16:47,371][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000034336_35160064.pth... [2023-10-08 01:16:47,399][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000032704_33488896.pth [2023-10-08 01:16:47,513][52059] Updated weights for policy 1, policy_version 34772 (0.0008) [2023-10-08 01:16:47,885][52059] Updated weights for policy 1, policy_version 34782 (0.0009) [2023-10-08 01:16:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 70778880. Throughput: 0: 1696.2, 1: 1712.1. Samples: 17703472. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:51,211][50642] Avg episode reward: [(0, '19.050'), (1, '20.880')] [2023-10-08 01:16:51,225][52060] Updated weights for policy 0, policy_version 34340 (0.0009) [2023-10-08 01:16:51,596][52060] Updated weights for policy 0, policy_version 34350 (0.0008) [2023-10-08 01:16:51,760][52059] Updated weights for policy 1, policy_version 34792 (0.0010) [2023-10-08 01:16:51,967][52060] Updated weights for policy 0, policy_version 34360 (0.0009) [2023-10-08 01:16:52,117][52059] Updated weights for policy 1, policy_version 34802 (0.0008) [2023-10-08 01:16:52,486][52059] Updated weights for policy 1, policy_version 34812 (0.0009) [2023-10-08 01:16:56,073][52060] Updated weights for policy 0, policy_version 34370 (0.0010) [2023-10-08 01:16:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 70844416. Throughput: 0: 1710.7, 1: 1743.0. Samples: 17724862. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:16:56,211][50642] Avg episode reward: [(0, '21.060'), (1, '19.550')] [2023-10-08 01:16:56,443][52060] Updated weights for policy 0, policy_version 34380 (0.0010) [2023-10-08 01:16:56,536][52059] Updated weights for policy 1, policy_version 34822 (0.0008) [2023-10-08 01:16:56,817][52060] Updated weights for policy 0, policy_version 34390 (0.0008) [2023-10-08 01:16:56,897][52059] Updated weights for policy 1, policy_version 34832 (0.0010) [2023-10-08 01:16:57,186][52060] Updated weights for policy 0, policy_version 34400 (0.0010) [2023-10-08 01:16:57,264][52059] Updated weights for policy 1, policy_version 34842 (0.0010) [2023-10-08 01:17:01,015][52059] Updated weights for policy 1, policy_version 34852 (0.0009) [2023-10-08 01:17:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 70909952. Throughput: 0: 1712.4, 1: 1740.4. Samples: 17746006. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:01,211][50642] Avg episode reward: [(0, '19.790'), (1, '17.100')] [2023-10-08 01:17:01,242][52060] Updated weights for policy 0, policy_version 34410 (0.0009) [2023-10-08 01:17:01,380][52059] Updated weights for policy 1, policy_version 34862 (0.0007) [2023-10-08 01:17:01,617][52060] Updated weights for policy 0, policy_version 34420 (0.0009) [2023-10-08 01:17:01,739][52059] Updated weights for policy 1, policy_version 34872 (0.0007) [2023-10-08 01:17:01,979][52060] Updated weights for policy 0, policy_version 34430 (0.0008) [2023-10-08 01:17:05,783][52059] Updated weights for policy 1, policy_version 34882 (0.0009) [2023-10-08 01:17:05,984][52060] Updated weights for policy 0, policy_version 34440 (0.0007) [2023-10-08 01:17:06,154][52059] Updated weights for policy 1, policy_version 34892 (0.0008) [2023-10-08 01:17:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 70975488. Throughput: 0: 1714.6, 1: 1734.4. Samples: 17755566. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:06,211][50642] Avg episode reward: [(0, '18.010'), (1, '18.600')] [2023-10-08 01:17:06,358][52060] Updated weights for policy 0, policy_version 34450 (0.0007) [2023-10-08 01:17:06,504][52059] Updated weights for policy 1, policy_version 34902 (0.0008) [2023-10-08 01:17:06,731][52060] Updated weights for policy 0, policy_version 34460 (0.0009) [2023-10-08 01:17:06,877][52059] Updated weights for policy 1, policy_version 34912 (0.0007) [2023-10-08 01:17:10,593][52059] Updated weights for policy 1, policy_version 34922 (0.0007) [2023-10-08 01:17:10,636][52060] Updated weights for policy 0, policy_version 34470 (0.0007) [2023-10-08 01:17:10,956][52059] Updated weights for policy 1, policy_version 34932 (0.0007) [2023-10-08 01:17:10,997][52060] Updated weights for policy 0, policy_version 34480 (0.0007) [2023-10-08 01:17:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 71041024. Throughput: 0: 1712.2, 1: 1740.1. Samples: 17776612. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:11,211][50642] Avg episode reward: [(0, '21.270'), (1, '21.390')] [2023-10-08 01:17:11,310][52059] Updated weights for policy 1, policy_version 34942 (0.0008) [2023-10-08 01:17:11,376][52060] Updated weights for policy 0, policy_version 34490 (0.0009) [2023-10-08 01:17:15,357][52060] Updated weights for policy 0, policy_version 34500 (0.0008) [2023-10-08 01:17:15,424][52059] Updated weights for policy 1, policy_version 34952 (0.0008) [2023-10-08 01:17:15,727][52060] Updated weights for policy 0, policy_version 34510 (0.0009) [2023-10-08 01:17:15,788][52059] Updated weights for policy 1, policy_version 34962 (0.0008) [2023-10-08 01:17:16,095][52060] Updated weights for policy 0, policy_version 34520 (0.0009) [2023-10-08 01:17:16,157][52059] Updated weights for policy 1, policy_version 34972 (0.0007) [2023-10-08 01:17:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 71106560. Throughput: 0: 1700.0, 1: 1716.7. Samples: 17796202. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:16,211][50642] Avg episode reward: [(0, '19.620'), (1, '18.190')] [2023-10-08 01:17:20,052][52059] Updated weights for policy 1, policy_version 34982 (0.0008) [2023-10-08 01:17:20,079][52060] Updated weights for policy 0, policy_version 34530 (0.0009) [2023-10-08 01:17:20,419][52059] Updated weights for policy 1, policy_version 34992 (0.0009) [2023-10-08 01:17:20,444][52060] Updated weights for policy 0, policy_version 34540 (0.0009) [2023-10-08 01:17:20,775][52059] Updated weights for policy 1, policy_version 35002 (0.0008) [2023-10-08 01:17:20,812][52060] Updated weights for policy 0, policy_version 34550 (0.0008) [2023-10-08 01:17:21,185][52060] Updated weights for policy 0, policy_version 34560 (0.0009) [2023-10-08 01:17:21,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 71237632. Throughput: 0: 1710.7, 1: 1739.0. Samples: 17806990. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:21,211][50642] Avg episode reward: [(0, '17.020'), (1, '18.120')] [2023-10-08 01:17:24,874][52059] Updated weights for policy 1, policy_version 35012 (0.0008) [2023-10-08 01:17:25,237][52059] Updated weights for policy 1, policy_version 35022 (0.0009) [2023-10-08 01:17:25,256][52060] Updated weights for policy 0, policy_version 34570 (0.0007) [2023-10-08 01:17:25,605][52059] Updated weights for policy 1, policy_version 35032 (0.0008) [2023-10-08 01:17:25,624][52060] Updated weights for policy 0, policy_version 34580 (0.0007) [2023-10-08 01:17:25,992][52060] Updated weights for policy 0, policy_version 34590 (0.0007) [2023-10-08 01:17:26,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 71303168. Throughput: 0: 1707.6, 1: 1734.2. Samples: 17827988. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:26,211][50642] Avg episode reward: [(0, '20.460'), (1, '22.270')] [2023-10-08 01:17:29,517][52059] Updated weights for policy 1, policy_version 35042 (0.0009) [2023-10-08 01:17:29,885][52059] Updated weights for policy 1, policy_version 35052 (0.0007) [2023-10-08 01:17:29,996][52060] Updated weights for policy 0, policy_version 34600 (0.0007) [2023-10-08 01:17:30,242][52059] Updated weights for policy 1, policy_version 35062 (0.0009) [2023-10-08 01:17:30,362][52060] Updated weights for policy 0, policy_version 34610 (0.0008) [2023-10-08 01:17:30,607][52059] Updated weights for policy 1, policy_version 35072 (0.0008) [2023-10-08 01:17:30,724][52060] Updated weights for policy 0, policy_version 34620 (0.0010) [2023-10-08 01:17:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 71368704. Throughput: 0: 1684.6, 1: 1706.8. Samples: 17846954. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:31,211][50642] Avg episode reward: [(0, '20.440'), (1, '19.510')] [2023-10-08 01:17:34,501][52059] Updated weights for policy 1, policy_version 35082 (0.0009) [2023-10-08 01:17:34,659][52060] Updated weights for policy 0, policy_version 34630 (0.0009) [2023-10-08 01:17:34,871][52059] Updated weights for policy 1, policy_version 35092 (0.0008) [2023-10-08 01:17:35,020][52060] Updated weights for policy 0, policy_version 34640 (0.0007) [2023-10-08 01:17:35,237][52059] Updated weights for policy 1, policy_version 35102 (0.0008) [2023-10-08 01:17:35,382][52060] Updated weights for policy 0, policy_version 34650 (0.0008) [2023-10-08 01:17:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 71434240. Throughput: 0: 1716.8, 1: 1735.6. Samples: 17858828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:36,211][50642] Avg episode reward: [(0, '17.050'), (1, '18.110')] [2023-10-08 01:17:39,301][52059] Updated weights for policy 1, policy_version 35112 (0.0010) [2023-10-08 01:17:39,357][52060] Updated weights for policy 0, policy_version 34660 (0.0009) [2023-10-08 01:17:39,678][52059] Updated weights for policy 1, policy_version 35122 (0.0007) [2023-10-08 01:17:39,732][52060] Updated weights for policy 0, policy_version 34670 (0.0010) [2023-10-08 01:17:40,040][52059] Updated weights for policy 1, policy_version 35132 (0.0007) [2023-10-08 01:17:40,106][52060] Updated weights for policy 0, policy_version 34680 (0.0009) [2023-10-08 01:17:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 71499776. Throughput: 0: 1701.5, 1: 1714.8. Samples: 17878592. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:41,211][50642] Avg episode reward: [(0, '20.430'), (1, '19.570')] [2023-10-08 01:17:43,811][52059] Updated weights for policy 1, policy_version 35142 (0.0008) [2023-10-08 01:17:43,984][52060] Updated weights for policy 0, policy_version 34690 (0.0009) [2023-10-08 01:17:44,174][52059] Updated weights for policy 1, policy_version 35152 (0.0008) [2023-10-08 01:17:44,353][52060] Updated weights for policy 0, policy_version 34700 (0.0008) [2023-10-08 01:17:44,537][52059] Updated weights for policy 1, policy_version 35162 (0.0008) [2023-10-08 01:17:44,721][52060] Updated weights for policy 0, policy_version 34710 (0.0009) [2023-10-08 01:17:45,091][52060] Updated weights for policy 0, policy_version 34720 (0.0009) [2023-10-08 01:17:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 71565312. Throughput: 0: 1693.8, 1: 1712.8. Samples: 17899304. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:46,211][50642] Avg episode reward: [(0, '20.370'), (1, '20.450')] [2023-10-08 01:17:48,384][52059] Updated weights for policy 1, policy_version 35172 (0.0007) [2023-10-08 01:17:48,755][52059] Updated weights for policy 1, policy_version 35182 (0.0008) [2023-10-08 01:17:49,112][52059] Updated weights for policy 1, policy_version 35192 (0.0007) [2023-10-08 01:17:49,161][52060] Updated weights for policy 0, policy_version 34730 (0.0007) [2023-10-08 01:17:49,524][52060] Updated weights for policy 0, policy_version 34740 (0.0008) [2023-10-08 01:17:49,897][52060] Updated weights for policy 0, policy_version 34750 (0.0009) [2023-10-08 01:17:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 71630848. Throughput: 0: 1724.4, 1: 1721.6. Samples: 17910634. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:51,211][50642] Avg episode reward: [(0, '16.260'), (1, '16.210')] [2023-10-08 01:17:53,083][52059] Updated weights for policy 1, policy_version 35202 (0.0007) [2023-10-08 01:17:53,446][52059] Updated weights for policy 1, policy_version 35212 (0.0007) [2023-10-08 01:17:53,762][52060] Updated weights for policy 0, policy_version 34760 (0.0009) [2023-10-08 01:17:53,809][52059] Updated weights for policy 1, policy_version 35222 (0.0007) [2023-10-08 01:17:54,134][52060] Updated weights for policy 0, policy_version 34770 (0.0010) [2023-10-08 01:17:54,170][52059] Updated weights for policy 1, policy_version 35232 (0.0008) [2023-10-08 01:17:54,497][52060] Updated weights for policy 0, policy_version 34780 (0.0010) [2023-10-08 01:17:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 71696384. Throughput: 0: 1697.1, 1: 1712.2. Samples: 17930028. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:17:56,211][50642] Avg episode reward: [(0, '20.030'), (1, '19.900')] [2023-10-08 01:17:58,126][52059] Updated weights for policy 1, policy_version 35242 (0.0009) [2023-10-08 01:17:58,493][52059] Updated weights for policy 1, policy_version 35252 (0.0008) [2023-10-08 01:17:58,499][52060] Updated weights for policy 0, policy_version 34790 (0.0008) [2023-10-08 01:17:58,857][52059] Updated weights for policy 1, policy_version 35262 (0.0007) [2023-10-08 01:17:58,858][52060] Updated weights for policy 0, policy_version 34800 (0.0008) [2023-10-08 01:17:59,238][52060] Updated weights for policy 0, policy_version 34810 (0.0009) [2023-10-08 01:18:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 71761920. Throughput: 0: 1713.5, 1: 1732.1. Samples: 17951256. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 01:18:01,211][50642] Avg episode reward: [(0, '21.740'), (1, '17.810')] [2023-10-08 01:18:02,813][52059] Updated weights for policy 1, policy_version 35272 (0.0008) [2023-10-08 01:18:03,062][52060] Updated weights for policy 0, policy_version 34820 (0.0007) [2023-10-08 01:18:03,178][52059] Updated weights for policy 1, policy_version 35282 (0.0008) [2023-10-08 01:18:03,431][52060] Updated weights for policy 0, policy_version 34830 (0.0008) [2023-10-08 01:18:03,544][52059] Updated weights for policy 1, policy_version 35292 (0.0007) [2023-10-08 01:18:03,811][52060] Updated weights for policy 0, policy_version 34840 (0.0008) [2023-10-08 01:18:06,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 71827456. Throughput: 0: 1707.9, 1: 1709.1. Samples: 17960758. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 01:18:06,211][50642] Avg episode reward: [(0, '17.030'), (1, '14.830')] [2023-10-08 01:18:07,425][52059] Updated weights for policy 1, policy_version 35302 (0.0007) [2023-10-08 01:18:07,699][52060] Updated weights for policy 0, policy_version 34850 (0.0008) [2023-10-08 01:18:07,790][52059] Updated weights for policy 1, policy_version 35312 (0.0007) [2023-10-08 01:18:08,065][52060] Updated weights for policy 0, policy_version 34860 (0.0009) [2023-10-08 01:18:08,151][52059] Updated weights for policy 1, policy_version 35322 (0.0009) [2023-10-08 01:18:08,443][52060] Updated weights for policy 0, policy_version 34870 (0.0008) [2023-10-08 01:18:08,810][52060] Updated weights for policy 0, policy_version 34880 (0.0008) [2023-10-08 01:18:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 71892992. Throughput: 0: 1699.6, 1: 1721.3. Samples: 17981932. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 01:18:11,211][50642] Avg episode reward: [(0, '19.280'), (1, '15.740')] [2023-10-08 01:18:12,215][52059] Updated weights for policy 1, policy_version 35332 (0.0009) [2023-10-08 01:18:12,576][52059] Updated weights for policy 1, policy_version 35342 (0.0008) [2023-10-08 01:18:12,921][52060] Updated weights for policy 0, policy_version 34890 (0.0007) [2023-10-08 01:18:12,948][52059] Updated weights for policy 1, policy_version 35352 (0.0007) [2023-10-08 01:18:13,300][52060] Updated weights for policy 0, policy_version 34900 (0.0007) [2023-10-08 01:18:13,666][52060] Updated weights for policy 0, policy_version 34910 (0.0008) [2023-10-08 01:18:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 71958528. Throughput: 0: 1724.1, 1: 1745.8. Samples: 18003098. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 01:18:16,211][50642] Avg episode reward: [(0, '22.900'), (1, '16.120')] [2023-10-08 01:18:16,711][52059] Updated weights for policy 1, policy_version 35362 (0.0007) [2023-10-08 01:18:17,071][52059] Updated weights for policy 1, policy_version 35372 (0.0009) [2023-10-08 01:18:17,440][52059] Updated weights for policy 1, policy_version 35382 (0.0007) [2023-10-08 01:18:17,616][52060] Updated weights for policy 0, policy_version 34920 (0.0008) [2023-10-08 01:18:17,799][52059] Updated weights for policy 1, policy_version 35392 (0.0007) [2023-10-08 01:18:17,979][52060] Updated weights for policy 0, policy_version 34930 (0.0007) [2023-10-08 01:18:18,353][52060] Updated weights for policy 0, policy_version 34940 (0.0010) [2023-10-08 01:18:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 72024064. Throughput: 0: 1695.6, 1: 1719.5. Samples: 18012506. Policy #0 lag: (min: 31.0, avg: 31.4, max: 45.0) [2023-10-08 01:18:21,211][50642] Avg episode reward: [(0, '17.460'), (1, '14.680')] [2023-10-08 01:18:21,723][52059] Updated weights for policy 1, policy_version 35402 (0.0010) [2023-10-08 01:18:22,090][52059] Updated weights for policy 1, policy_version 35412 (0.0010) [2023-10-08 01:18:22,412][52060] Updated weights for policy 0, policy_version 34950 (0.0007) [2023-10-08 01:18:22,464][52059] Updated weights for policy 1, policy_version 35422 (0.0009) [2023-10-08 01:18:22,787][52060] Updated weights for policy 0, policy_version 34960 (0.0007) [2023-10-08 01:18:23,162][52060] Updated weights for policy 0, policy_version 34970 (0.0007) [2023-10-08 01:18:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 72089600. Throughput: 0: 1703.2, 1: 1745.6. Samples: 18033788. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:18:26,211][50642] Avg episode reward: [(0, '19.010'), (1, '16.540')] [2023-10-08 01:18:26,548][52059] Updated weights for policy 1, policy_version 35432 (0.0008) [2023-10-08 01:18:26,915][52059] Updated weights for policy 1, policy_version 35442 (0.0010) [2023-10-08 01:18:27,243][52060] Updated weights for policy 0, policy_version 34980 (0.0008) [2023-10-08 01:18:27,277][52059] Updated weights for policy 1, policy_version 35452 (0.0009) [2023-10-08 01:18:27,608][52060] Updated weights for policy 0, policy_version 34990 (0.0008) [2023-10-08 01:18:27,978][52060] Updated weights for policy 0, policy_version 35000 (0.0007) [2023-10-08 01:18:31,197][52059] Updated weights for policy 1, policy_version 35462 (0.0008) [2023-10-08 01:18:31,210][50642] Fps is (10 sec: 13106.7, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 72155136. Throughput: 0: 1722.7, 1: 1741.1. Samples: 18055176. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:18:31,211][50642] Avg episode reward: [(0, '22.240'), (1, '19.270')] [2023-10-08 01:18:31,561][52059] Updated weights for policy 1, policy_version 35472 (0.0007) [2023-10-08 01:18:31,855][52060] Updated weights for policy 0, policy_version 35010 (0.0007) [2023-10-08 01:18:31,932][52059] Updated weights for policy 1, policy_version 35482 (0.0007) [2023-10-08 01:18:32,233][52060] Updated weights for policy 0, policy_version 35020 (0.0008) [2023-10-08 01:18:32,601][52060] Updated weights for policy 0, policy_version 35030 (0.0009) [2023-10-08 01:18:32,981][52060] Updated weights for policy 0, policy_version 35040 (0.0008) [2023-10-08 01:18:35,924][52059] Updated weights for policy 1, policy_version 35492 (0.0008) [2023-10-08 01:18:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 72220672. Throughput: 0: 1692.1, 1: 1726.4. Samples: 18064468. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:18:36,211][50642] Avg episode reward: [(0, '18.980'), (1, '18.590')] [2023-10-08 01:18:36,287][52059] Updated weights for policy 1, policy_version 35502 (0.0008) [2023-10-08 01:18:36,651][52059] Updated weights for policy 1, policy_version 35512 (0.0008) [2023-10-08 01:18:36,780][52060] Updated weights for policy 0, policy_version 35050 (0.0009) [2023-10-08 01:18:37,154][52060] Updated weights for policy 0, policy_version 35060 (0.0009) [2023-10-08 01:18:37,527][52060] Updated weights for policy 0, policy_version 35070 (0.0009) [2023-10-08 01:18:40,518][52059] Updated weights for policy 1, policy_version 35522 (0.0008) [2023-10-08 01:18:40,876][52059] Updated weights for policy 1, policy_version 35532 (0.0008) [2023-10-08 01:18:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 72286208. Throughput: 0: 1723.1, 1: 1743.7. Samples: 18086036. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:18:41,211][50642] Avg episode reward: [(0, '19.370'), (1, '19.740')] [2023-10-08 01:18:41,242][52059] Updated weights for policy 1, policy_version 35542 (0.0007) [2023-10-08 01:18:41,433][52060] Updated weights for policy 0, policy_version 35080 (0.0008) [2023-10-08 01:18:41,608][52059] Updated weights for policy 1, policy_version 35552 (0.0010) [2023-10-08 01:18:41,793][52060] Updated weights for policy 0, policy_version 35090 (0.0008) [2023-10-08 01:18:42,165][52060] Updated weights for policy 0, policy_version 35100 (0.0008) [2023-10-08 01:18:45,476][52059] Updated weights for policy 1, policy_version 35562 (0.0008) [2023-10-08 01:18:45,834][52059] Updated weights for policy 1, policy_version 35572 (0.0008) [2023-10-08 01:18:46,002][52060] Updated weights for policy 0, policy_version 35110 (0.0010) [2023-10-08 01:18:46,207][52059] Updated weights for policy 1, policy_version 35582 (0.0007) [2023-10-08 01:18:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 72351744. Throughput: 0: 1725.0, 1: 1732.6. Samples: 18106846. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:18:46,211][50642] Avg episode reward: [(0, '21.830'), (1, '20.990')] [2023-10-08 01:18:46,278][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000035584_36438016.pth... [2023-10-08 01:18:46,315][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000033952_34766848.pth [2023-10-08 01:18:46,370][52060] Updated weights for policy 0, policy_version 35120 (0.0007) [2023-10-08 01:18:46,742][52060] Updated weights for policy 0, policy_version 35130 (0.0009) [2023-10-08 01:18:46,959][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000035136_35979264.pth... [2023-10-08 01:18:46,989][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000033504_34308096.pth [2023-10-08 01:18:50,149][52059] Updated weights for policy 1, policy_version 35592 (0.0010) [2023-10-08 01:18:50,527][52059] Updated weights for policy 1, policy_version 35602 (0.0011) [2023-10-08 01:18:50,775][52060] Updated weights for policy 0, policy_version 35140 (0.0008) [2023-10-08 01:18:50,888][52059] Updated weights for policy 1, policy_version 35612 (0.0009) [2023-10-08 01:18:51,140][52060] Updated weights for policy 0, policy_version 35150 (0.0008) [2023-10-08 01:18:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 72450048. Throughput: 0: 1721.7, 1: 1751.7. Samples: 18117058. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:18:51,211][50642] Avg episode reward: [(0, '19.160'), (1, '19.920')] [2023-10-08 01:18:51,511][52060] Updated weights for policy 0, policy_version 35160 (0.0009) [2023-10-08 01:18:54,915][52059] Updated weights for policy 1, policy_version 35622 (0.0008) [2023-10-08 01:18:55,292][52059] Updated weights for policy 1, policy_version 35632 (0.0009) [2023-10-08 01:18:55,602][52060] Updated weights for policy 0, policy_version 35170 (0.0009) [2023-10-08 01:18:55,653][52059] Updated weights for policy 1, policy_version 35642 (0.0008) [2023-10-08 01:18:55,970][52060] Updated weights for policy 0, policy_version 35180 (0.0008) [2023-10-08 01:18:56,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 72515584. Throughput: 0: 1729.2, 1: 1739.9. Samples: 18138038. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:18:56,211][50642] Avg episode reward: [(0, '19.460'), (1, '18.300')] [2023-10-08 01:18:56,341][52060] Updated weights for policy 0, policy_version 35190 (0.0007) [2023-10-08 01:18:56,711][52060] Updated weights for policy 0, policy_version 35200 (0.0009) [2023-10-08 01:18:59,560][52059] Updated weights for policy 1, policy_version 35652 (0.0010) [2023-10-08 01:18:59,930][52059] Updated weights for policy 1, policy_version 35662 (0.0008) [2023-10-08 01:19:00,298][52059] Updated weights for policy 1, policy_version 35672 (0.0007) [2023-10-08 01:19:00,677][52060] Updated weights for policy 0, policy_version 35210 (0.0008) [2023-10-08 01:19:01,043][52060] Updated weights for policy 0, policy_version 35220 (0.0011) [2023-10-08 01:19:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 72581120. Throughput: 0: 1716.0, 1: 1717.8. Samples: 18157620. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:19:01,211][50642] Avg episode reward: [(0, '20.620'), (1, '17.630')] [2023-10-08 01:19:01,408][52060] Updated weights for policy 0, policy_version 35230 (0.0010) [2023-10-08 01:19:04,324][52059] Updated weights for policy 1, policy_version 35682 (0.0007) [2023-10-08 01:19:04,685][52059] Updated weights for policy 1, policy_version 35692 (0.0007) [2023-10-08 01:19:05,054][52059] Updated weights for policy 1, policy_version 35702 (0.0011) [2023-10-08 01:19:05,416][52059] Updated weights for policy 1, policy_version 35712 (0.0009) [2023-10-08 01:19:05,444][52060] Updated weights for policy 0, policy_version 35240 (0.0009) [2023-10-08 01:19:05,807][52060] Updated weights for policy 0, policy_version 35250 (0.0010) [2023-10-08 01:19:06,176][52060] Updated weights for policy 0, policy_version 35260 (0.0009) [2023-10-08 01:19:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 72646656. Throughput: 0: 1728.9, 1: 1742.7. Samples: 18168728. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 01:19:06,211][50642] Avg episode reward: [(0, '18.160'), (1, '22.070')] [2023-10-08 01:19:09,335][52059] Updated weights for policy 1, policy_version 35722 (0.0007) [2023-10-08 01:19:09,689][52059] Updated weights for policy 1, policy_version 35732 (0.0008) [2023-10-08 01:19:10,050][52059] Updated weights for policy 1, policy_version 35742 (0.0009) [2023-10-08 01:19:10,138][52060] Updated weights for policy 0, policy_version 35270 (0.0010) [2023-10-08 01:19:10,507][52060] Updated weights for policy 0, policy_version 35280 (0.0009) [2023-10-08 01:19:10,873][52060] Updated weights for policy 0, policy_version 35290 (0.0011) [2023-10-08 01:19:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 72744960. Throughput: 0: 1733.7, 1: 1720.5. Samples: 18189228. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) [2023-10-08 01:19:11,211][50642] Avg episode reward: [(0, '17.400'), (1, '19.840')] [2023-10-08 01:19:14,109][52059] Updated weights for policy 1, policy_version 35752 (0.0008) [2023-10-08 01:19:14,476][52059] Updated weights for policy 1, policy_version 35762 (0.0009) [2023-10-08 01:19:14,844][52059] Updated weights for policy 1, policy_version 35772 (0.0008) [2023-10-08 01:19:14,880][52060] Updated weights for policy 0, policy_version 35300 (0.0009) [2023-10-08 01:19:15,249][52060] Updated weights for policy 0, policy_version 35310 (0.0008) [2023-10-08 01:19:15,615][52060] Updated weights for policy 0, policy_version 35320 (0.0008) [2023-10-08 01:19:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 72810496. Throughput: 0: 1700.3, 1: 1713.7. Samples: 18208804. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) [2023-10-08 01:19:16,211][50642] Avg episode reward: [(0, '20.030'), (1, '19.200')] [2023-10-08 01:19:18,654][52059] Updated weights for policy 1, policy_version 35782 (0.0008) [2023-10-08 01:19:19,023][52059] Updated weights for policy 1, policy_version 35792 (0.0008) [2023-10-08 01:19:19,377][52059] Updated weights for policy 1, policy_version 35802 (0.0007) [2023-10-08 01:19:19,510][52060] Updated weights for policy 0, policy_version 35330 (0.0008) [2023-10-08 01:19:19,891][52060] Updated weights for policy 0, policy_version 35340 (0.0009) [2023-10-08 01:19:20,257][52060] Updated weights for policy 0, policy_version 35350 (0.0008) [2023-10-08 01:19:20,622][52060] Updated weights for policy 0, policy_version 35360 (0.0009) [2023-10-08 01:19:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 72876032. Throughput: 0: 1728.0, 1: 1733.8. Samples: 18220248. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) [2023-10-08 01:19:21,211][50642] Avg episode reward: [(0, '18.920'), (1, '21.240')] [2023-10-08 01:19:23,401][52059] Updated weights for policy 1, policy_version 35812 (0.0008) [2023-10-08 01:19:23,767][52059] Updated weights for policy 1, policy_version 35822 (0.0009) [2023-10-08 01:19:24,132][52059] Updated weights for policy 1, policy_version 35832 (0.0008) [2023-10-08 01:19:24,528][52060] Updated weights for policy 0, policy_version 35370 (0.0010) [2023-10-08 01:19:24,892][52060] Updated weights for policy 0, policy_version 35380 (0.0009) [2023-10-08 01:19:25,262][52060] Updated weights for policy 0, policy_version 35390 (0.0007) [2023-10-08 01:19:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 72941568. Throughput: 0: 1712.1, 1: 1704.5. Samples: 18239786. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) [2023-10-08 01:19:26,211][50642] Avg episode reward: [(0, '18.550'), (1, '21.270')] [2023-10-08 01:19:27,963][52059] Updated weights for policy 1, policy_version 35842 (0.0008) [2023-10-08 01:19:28,333][52059] Updated weights for policy 1, policy_version 35852 (0.0008) [2023-10-08 01:19:28,694][52059] Updated weights for policy 1, policy_version 35862 (0.0007) [2023-10-08 01:19:29,051][52059] Updated weights for policy 1, policy_version 35872 (0.0009) [2023-10-08 01:19:29,133][52060] Updated weights for policy 0, policy_version 35400 (0.0008) [2023-10-08 01:19:29,517][52060] Updated weights for policy 0, policy_version 35410 (0.0009) [2023-10-08 01:19:29,881][52060] Updated weights for policy 0, policy_version 35420 (0.0007) [2023-10-08 01:19:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.6, 300 sec: 13773.7). Total num frames: 73007104. Throughput: 0: 1696.2, 1: 1722.7. Samples: 18260694. Policy #0 lag: (min: 22.0, avg: 24.9, max: 54.0) [2023-10-08 01:19:31,211][50642] Avg episode reward: [(0, '19.950'), (1, '17.450')] [2023-10-08 01:19:32,872][52059] Updated weights for policy 1, policy_version 35882 (0.0009) [2023-10-08 01:19:33,245][52059] Updated weights for policy 1, policy_version 35892 (0.0007) [2023-10-08 01:19:33,613][52059] Updated weights for policy 1, policy_version 35902 (0.0007) [2023-10-08 01:19:33,954][52060] Updated weights for policy 0, policy_version 35430 (0.0009) [2023-10-08 01:19:34,321][52060] Updated weights for policy 0, policy_version 35440 (0.0007) [2023-10-08 01:19:34,691][52060] Updated weights for policy 0, policy_version 35450 (0.0010) [2023-10-08 01:19:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 73072640. Throughput: 0: 1719.4, 1: 1708.8. Samples: 18271324. Policy #0 lag: (min: 4.0, avg: 12.0, max: 36.0) [2023-10-08 01:19:36,211][50642] Avg episode reward: [(0, '20.630'), (1, '19.890')] [2023-10-08 01:19:37,594][52059] Updated weights for policy 1, policy_version 35912 (0.0008) [2023-10-08 01:19:37,970][52059] Updated weights for policy 1, policy_version 35922 (0.0010) [2023-10-08 01:19:38,323][52059] Updated weights for policy 1, policy_version 35932 (0.0010) [2023-10-08 01:19:38,739][52060] Updated weights for policy 0, policy_version 35460 (0.0008) [2023-10-08 01:19:39,119][52060] Updated weights for policy 0, policy_version 35470 (0.0010) [2023-10-08 01:19:39,483][52060] Updated weights for policy 0, policy_version 35480 (0.0011) [2023-10-08 01:19:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 73138176. Throughput: 0: 1691.4, 1: 1717.1. Samples: 18291420. Policy #0 lag: (min: 4.0, avg: 12.0, max: 36.0) [2023-10-08 01:19:41,211][50642] Avg episode reward: [(0, '17.510'), (1, '20.680')] [2023-10-08 01:19:42,267][52059] Updated weights for policy 1, policy_version 35942 (0.0009) [2023-10-08 01:19:42,628][52059] Updated weights for policy 1, policy_version 35952 (0.0007) [2023-10-08 01:19:42,988][52059] Updated weights for policy 1, policy_version 35962 (0.0007) [2023-10-08 01:19:43,345][52060] Updated weights for policy 0, policy_version 35490 (0.0008) [2023-10-08 01:19:43,711][52060] Updated weights for policy 0, policy_version 35500 (0.0007) [2023-10-08 01:19:44,091][52060] Updated weights for policy 0, policy_version 35510 (0.0010) [2023-10-08 01:19:44,462][52060] Updated weights for policy 0, policy_version 35520 (0.0007) [2023-10-08 01:19:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 73203712. Throughput: 0: 1713.1, 1: 1737.7. Samples: 18312910. Policy #0 lag: (min: 4.0, avg: 12.0, max: 36.0) [2023-10-08 01:19:46,211][50642] Avg episode reward: [(0, '18.340'), (1, '18.400')] [2023-10-08 01:19:46,860][52059] Updated weights for policy 1, policy_version 35972 (0.0007) [2023-10-08 01:19:47,228][52059] Updated weights for policy 1, policy_version 35982 (0.0007) [2023-10-08 01:19:47,598][52059] Updated weights for policy 1, policy_version 35992 (0.0008) [2023-10-08 01:19:48,538][52060] Updated weights for policy 0, policy_version 35530 (0.0010) [2023-10-08 01:19:48,912][52060] Updated weights for policy 0, policy_version 35540 (0.0010) [2023-10-08 01:19:49,287][52060] Updated weights for policy 0, policy_version 35550 (0.0008) [2023-10-08 01:19:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73269248. Throughput: 0: 1710.4, 1: 1711.3. Samples: 18322704. Policy #0 lag: (min: 4.0, avg: 12.0, max: 36.0) [2023-10-08 01:19:51,211][50642] Avg episode reward: [(0, '20.830'), (1, '20.670')] [2023-10-08 01:19:51,397][52059] Updated weights for policy 1, policy_version 36002 (0.0009) [2023-10-08 01:19:51,757][52059] Updated weights for policy 1, policy_version 36012 (0.0009) [2023-10-08 01:19:52,126][52059] Updated weights for policy 1, policy_version 36022 (0.0009) [2023-10-08 01:19:52,495][52059] Updated weights for policy 1, policy_version 36032 (0.0010) [2023-10-08 01:19:53,074][52060] Updated weights for policy 0, policy_version 35560 (0.0011) [2023-10-08 01:19:53,453][52060] Updated weights for policy 0, policy_version 35570 (0.0010) [2023-10-08 01:19:53,827][52060] Updated weights for policy 0, policy_version 35580 (0.0008) [2023-10-08 01:19:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 73334784. Throughput: 0: 1700.5, 1: 1737.8. Samples: 18343954. Policy #0 lag: (min: 4.0, avg: 12.0, max: 36.0) [2023-10-08 01:19:56,211][50642] Avg episode reward: [(0, '18.050'), (1, '21.210')] [2023-10-08 01:19:56,379][52059] Updated weights for policy 1, policy_version 36042 (0.0009) [2023-10-08 01:19:56,748][52059] Updated weights for policy 1, policy_version 36052 (0.0008) [2023-10-08 01:19:57,119][52059] Updated weights for policy 1, policy_version 36062 (0.0007) [2023-10-08 01:19:57,794][52060] Updated weights for policy 0, policy_version 35590 (0.0009) [2023-10-08 01:19:58,168][52060] Updated weights for policy 0, policy_version 35600 (0.0007) [2023-10-08 01:19:58,539][52060] Updated weights for policy 0, policy_version 35610 (0.0007) [2023-10-08 01:20:01,108][52059] Updated weights for policy 1, policy_version 36072 (0.0011) [2023-10-08 01:20:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73400320. Throughput: 0: 1726.2, 1: 1751.3. Samples: 18365290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:01,211][50642] Avg episode reward: [(0, '19.440'), (1, '20.690')] [2023-10-08 01:20:01,492][52059] Updated weights for policy 1, policy_version 36082 (0.0010) [2023-10-08 01:20:01,857][52059] Updated weights for policy 1, policy_version 36092 (0.0008) [2023-10-08 01:20:02,532][52060] Updated weights for policy 0, policy_version 35620 (0.0007) [2023-10-08 01:20:02,903][52060] Updated weights for policy 0, policy_version 35630 (0.0008) [2023-10-08 01:20:03,272][52060] Updated weights for policy 0, policy_version 35640 (0.0009) [2023-10-08 01:20:05,787][52059] Updated weights for policy 1, policy_version 36102 (0.0009) [2023-10-08 01:20:06,164][52059] Updated weights for policy 1, policy_version 36112 (0.0008) [2023-10-08 01:20:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73465856. Throughput: 0: 1698.3, 1: 1730.9. Samples: 18374562. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:06,211][50642] Avg episode reward: [(0, '19.290'), (1, '19.210')] [2023-10-08 01:20:06,528][52059] Updated weights for policy 1, policy_version 36122 (0.0009) [2023-10-08 01:20:07,121][52060] Updated weights for policy 0, policy_version 35650 (0.0009) [2023-10-08 01:20:07,487][52060] Updated weights for policy 0, policy_version 35660 (0.0008) [2023-10-08 01:20:07,857][52060] Updated weights for policy 0, policy_version 35670 (0.0008) [2023-10-08 01:20:08,234][52060] Updated weights for policy 0, policy_version 35680 (0.0009) [2023-10-08 01:20:10,469][52059] Updated weights for policy 1, policy_version 36132 (0.0007) [2023-10-08 01:20:10,837][52059] Updated weights for policy 1, policy_version 36142 (0.0008) [2023-10-08 01:20:11,197][52059] Updated weights for policy 1, policy_version 36152 (0.0009) [2023-10-08 01:20:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 73531392. Throughput: 0: 1718.4, 1: 1756.8. Samples: 18396172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:11,211][50642] Avg episode reward: [(0, '18.500'), (1, '19.380')] [2023-10-08 01:20:12,212][52060] Updated weights for policy 0, policy_version 35690 (0.0010) [2023-10-08 01:20:12,586][52060] Updated weights for policy 0, policy_version 35700 (0.0009) [2023-10-08 01:20:12,952][52060] Updated weights for policy 0, policy_version 35710 (0.0008) [2023-10-08 01:20:15,133][52059] Updated weights for policy 1, policy_version 36162 (0.0009) [2023-10-08 01:20:15,496][52059] Updated weights for policy 1, policy_version 36172 (0.0010) [2023-10-08 01:20:15,862][52059] Updated weights for policy 1, policy_version 36182 (0.0011) [2023-10-08 01:20:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 73596928. Throughput: 0: 1733.7, 1: 1733.9. Samples: 18416736. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:16,211][50642] Avg episode reward: [(0, '19.770'), (1, '23.110')] [2023-10-08 01:20:16,231][52059] Updated weights for policy 1, policy_version 36192 (0.0010) [2023-10-08 01:20:16,231][51710] Saving new best policy, reward=23.110! [2023-10-08 01:20:16,938][52060] Updated weights for policy 0, policy_version 35720 (0.0007) [2023-10-08 01:20:17,301][52060] Updated weights for policy 0, policy_version 35730 (0.0009) [2023-10-08 01:20:17,670][52060] Updated weights for policy 0, policy_version 35740 (0.0009) [2023-10-08 01:20:20,241][52059] Updated weights for policy 1, policy_version 36202 (0.0007) [2023-10-08 01:20:20,607][52059] Updated weights for policy 1, policy_version 36212 (0.0007) [2023-10-08 01:20:20,965][52059] Updated weights for policy 1, policy_version 36222 (0.0008) [2023-10-08 01:20:21,210][50642] Fps is (10 sec: 16384.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73695232. Throughput: 0: 1709.1, 1: 1752.0. Samples: 18427074. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:21,211][50642] Avg episode reward: [(0, '20.950'), (1, '20.150')] [2023-10-08 01:20:21,656][52060] Updated weights for policy 0, policy_version 35750 (0.0008) [2023-10-08 01:20:22,029][52060] Updated weights for policy 0, policy_version 35760 (0.0008) [2023-10-08 01:20:22,397][52060] Updated weights for policy 0, policy_version 35770 (0.0008) [2023-10-08 01:20:24,787][52059] Updated weights for policy 1, policy_version 36232 (0.0009) [2023-10-08 01:20:25,156][52059] Updated weights for policy 1, policy_version 36242 (0.0009) [2023-10-08 01:20:25,517][52059] Updated weights for policy 1, policy_version 36252 (0.0010) [2023-10-08 01:20:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73760768. Throughput: 0: 1741.9, 1: 1743.6. Samples: 18448270. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:26,211][50642] Avg episode reward: [(0, '19.720'), (1, '20.200')] [2023-10-08 01:20:26,225][52060] Updated weights for policy 0, policy_version 35780 (0.0007) [2023-10-08 01:20:26,592][52060] Updated weights for policy 0, policy_version 35790 (0.0011) [2023-10-08 01:20:26,962][52060] Updated weights for policy 0, policy_version 35800 (0.0010) [2023-10-08 01:20:29,520][52059] Updated weights for policy 1, policy_version 36262 (0.0009) [2023-10-08 01:20:29,892][52059] Updated weights for policy 1, policy_version 36272 (0.0010) [2023-10-08 01:20:30,256][52059] Updated weights for policy 1, policy_version 36282 (0.0010) [2023-10-08 01:20:31,071][52060] Updated weights for policy 0, policy_version 35810 (0.0009) [2023-10-08 01:20:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73826304. Throughput: 0: 1737.2, 1: 1728.1. Samples: 18468846. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:31,211][50642] Avg episode reward: [(0, '20.380'), (1, '20.050')] [2023-10-08 01:20:31,446][52060] Updated weights for policy 0, policy_version 35820 (0.0009) [2023-10-08 01:20:31,816][52060] Updated weights for policy 0, policy_version 35830 (0.0008) [2023-10-08 01:20:32,192][52060] Updated weights for policy 0, policy_version 35840 (0.0008) [2023-10-08 01:20:34,310][52059] Updated weights for policy 1, policy_version 36292 (0.0009) [2023-10-08 01:20:34,662][52059] Updated weights for policy 1, policy_version 36302 (0.0010) [2023-10-08 01:20:35,031][52059] Updated weights for policy 1, policy_version 36312 (0.0008) [2023-10-08 01:20:36,174][52060] Updated weights for policy 0, policy_version 35850 (0.0010) [2023-10-08 01:20:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 73891840. Throughput: 0: 1725.9, 1: 1756.7. Samples: 18479420. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:36,211][50642] Avg episode reward: [(0, '22.230'), (1, '20.530')] [2023-10-08 01:20:36,546][52060] Updated weights for policy 0, policy_version 35860 (0.0007) [2023-10-08 01:20:36,918][52060] Updated weights for policy 0, policy_version 35870 (0.0007) [2023-10-08 01:20:38,701][52059] Updated weights for policy 1, policy_version 36322 (0.0010) [2023-10-08 01:20:39,064][52059] Updated weights for policy 1, policy_version 36332 (0.0008) [2023-10-08 01:20:39,419][52059] Updated weights for policy 1, policy_version 36342 (0.0008) [2023-10-08 01:20:39,785][52059] Updated weights for policy 1, policy_version 36352 (0.0008) [2023-10-08 01:20:40,785][52060] Updated weights for policy 0, policy_version 35880 (0.0008) [2023-10-08 01:20:41,151][52060] Updated weights for policy 0, policy_version 35890 (0.0007) [2023-10-08 01:20:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 73957376. Throughput: 0: 1732.6, 1: 1720.3. Samples: 18499336. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:41,211][50642] Avg episode reward: [(0, '19.520'), (1, '19.830')] [2023-10-08 01:20:41,523][52060] Updated weights for policy 0, policy_version 35900 (0.0007) [2023-10-08 01:20:43,733][52059] Updated weights for policy 1, policy_version 36362 (0.0009) [2023-10-08 01:20:44,094][52059] Updated weights for policy 1, policy_version 36372 (0.0007) [2023-10-08 01:20:44,459][52059] Updated weights for policy 1, policy_version 36382 (0.0010) [2023-10-08 01:20:45,437][52060] Updated weights for policy 0, policy_version 35910 (0.0010) [2023-10-08 01:20:45,810][52060] Updated weights for policy 0, policy_version 35920 (0.0009) [2023-10-08 01:20:46,180][52060] Updated weights for policy 0, policy_version 35930 (0.0007) [2023-10-08 01:20:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 74022912. Throughput: 0: 1719.2, 1: 1717.6. Samples: 18519942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:46,211][50642] Avg episode reward: [(0, '18.830'), (1, '19.330')] [2023-10-08 01:20:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000036384_37257216.pth... [2023-10-08 01:20:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000034752_35586048.pth [2023-10-08 01:20:46,400][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000035936_36798464.pth... [2023-10-08 01:20:46,430][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000034336_35160064.pth [2023-10-08 01:20:48,362][52059] Updated weights for policy 1, policy_version 36392 (0.0010) [2023-10-08 01:20:48,734][52059] Updated weights for policy 1, policy_version 36402 (0.0010) [2023-10-08 01:20:49,094][52059] Updated weights for policy 1, policy_version 36412 (0.0011) [2023-10-08 01:20:50,154][52060] Updated weights for policy 0, policy_version 35940 (0.0008) [2023-10-08 01:20:50,530][52060] Updated weights for policy 0, policy_version 35950 (0.0009) [2023-10-08 01:20:50,892][52060] Updated weights for policy 0, policy_version 35960 (0.0011) [2023-10-08 01:20:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 74121216. Throughput: 0: 1732.3, 1: 1728.8. Samples: 18530312. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:20:51,211][50642] Avg episode reward: [(0, '20.060'), (1, '19.500')] [2023-10-08 01:20:53,055][52059] Updated weights for policy 1, policy_version 36422 (0.0010) [2023-10-08 01:20:53,419][52059] Updated weights for policy 1, policy_version 36432 (0.0009) [2023-10-08 01:20:53,783][52059] Updated weights for policy 1, policy_version 36442 (0.0008) [2023-10-08 01:20:55,015][52060] Updated weights for policy 0, policy_version 35970 (0.0010) [2023-10-08 01:20:55,381][52060] Updated weights for policy 0, policy_version 35980 (0.0008) [2023-10-08 01:20:55,743][52060] Updated weights for policy 0, policy_version 35990 (0.0009) [2023-10-08 01:20:56,116][52060] Updated weights for policy 0, policy_version 36000 (0.0009) [2023-10-08 01:20:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 74186752. Throughput: 0: 1721.2, 1: 1720.9. Samples: 18551066. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) [2023-10-08 01:20:56,211][50642] Avg episode reward: [(0, '18.350'), (1, '20.400')] [2023-10-08 01:20:57,608][52059] Updated weights for policy 1, policy_version 36452 (0.0010) [2023-10-08 01:20:57,958][52059] Updated weights for policy 1, policy_version 36462 (0.0010) [2023-10-08 01:20:58,322][52059] Updated weights for policy 1, policy_version 36472 (0.0010) [2023-10-08 01:20:59,995][52060] Updated weights for policy 0, policy_version 36010 (0.0008) [2023-10-08 01:21:00,361][52060] Updated weights for policy 0, policy_version 36020 (0.0008) [2023-10-08 01:21:00,745][52060] Updated weights for policy 0, policy_version 36030 (0.0010) [2023-10-08 01:21:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 74252288. Throughput: 0: 1696.9, 1: 1742.1. Samples: 18571490. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) [2023-10-08 01:21:01,211][50642] Avg episode reward: [(0, '18.900'), (1, '19.230')] [2023-10-08 01:21:02,225][52059] Updated weights for policy 1, policy_version 36482 (0.0010) [2023-10-08 01:21:02,591][52059] Updated weights for policy 1, policy_version 36492 (0.0007) [2023-10-08 01:21:02,962][52059] Updated weights for policy 1, policy_version 36502 (0.0008) [2023-10-08 01:21:03,324][52059] Updated weights for policy 1, policy_version 36512 (0.0008) [2023-10-08 01:21:04,540][52060] Updated weights for policy 0, policy_version 36040 (0.0009) [2023-10-08 01:21:04,913][52060] Updated weights for policy 0, policy_version 36050 (0.0008) [2023-10-08 01:21:05,285][52060] Updated weights for policy 0, policy_version 36060 (0.0010) [2023-10-08 01:21:06,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 74317824. Throughput: 0: 1725.6, 1: 1720.1. Samples: 18582132. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) [2023-10-08 01:21:06,211][50642] Avg episode reward: [(0, '19.690'), (1, '18.470')] [2023-10-08 01:21:07,285][52059] Updated weights for policy 1, policy_version 36522 (0.0009) [2023-10-08 01:21:07,652][52059] Updated weights for policy 1, policy_version 36532 (0.0007) [2023-10-08 01:21:08,009][52059] Updated weights for policy 1, policy_version 36542 (0.0009) [2023-10-08 01:21:09,355][52060] Updated weights for policy 0, policy_version 36070 (0.0009) [2023-10-08 01:21:09,711][52060] Updated weights for policy 0, policy_version 36080 (0.0008) [2023-10-08 01:21:10,082][52060] Updated weights for policy 0, policy_version 36090 (0.0007) [2023-10-08 01:21:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 74383360. Throughput: 0: 1703.2, 1: 1734.5. Samples: 18602968. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) [2023-10-08 01:21:11,211][50642] Avg episode reward: [(0, '20.540'), (1, '19.940')] [2023-10-08 01:21:11,870][52059] Updated weights for policy 1, policy_version 36552 (0.0008) [2023-10-08 01:21:12,235][52059] Updated weights for policy 1, policy_version 36562 (0.0007) [2023-10-08 01:21:12,602][52059] Updated weights for policy 1, policy_version 36572 (0.0008) [2023-10-08 01:21:14,059][52060] Updated weights for policy 0, policy_version 36100 (0.0010) [2023-10-08 01:21:14,420][52060] Updated weights for policy 0, policy_version 36110 (0.0007) [2023-10-08 01:21:14,798][52060] Updated weights for policy 0, policy_version 36120 (0.0010) [2023-10-08 01:21:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 74448896. Throughput: 0: 1691.3, 1: 1755.5. Samples: 18623952. Policy #0 lag: (min: 0.0, avg: 24.2, max: 32.0) [2023-10-08 01:21:16,211][50642] Avg episode reward: [(0, '19.740'), (1, '19.470')] [2023-10-08 01:21:16,385][52059] Updated weights for policy 1, policy_version 36582 (0.0007) [2023-10-08 01:21:16,743][52059] Updated weights for policy 1, policy_version 36592 (0.0007) [2023-10-08 01:21:17,106][52059] Updated weights for policy 1, policy_version 36602 (0.0008) [2023-10-08 01:21:18,809][52060] Updated weights for policy 0, policy_version 36130 (0.0009) [2023-10-08 01:21:19,180][52060] Updated weights for policy 0, policy_version 36140 (0.0009) [2023-10-08 01:21:19,555][52060] Updated weights for policy 0, policy_version 36150 (0.0009) [2023-10-08 01:21:19,922][52060] Updated weights for policy 0, policy_version 36160 (0.0007) [2023-10-08 01:21:21,070][52059] Updated weights for policy 1, policy_version 36612 (0.0007) [2023-10-08 01:21:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 74514432. Throughput: 0: 1723.7, 1: 1728.5. Samples: 18634772. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:21,211][50642] Avg episode reward: [(0, '19.880'), (1, '18.580')] [2023-10-08 01:21:21,443][52059] Updated weights for policy 1, policy_version 36622 (0.0008) [2023-10-08 01:21:21,811][52059] Updated weights for policy 1, policy_version 36632 (0.0008) [2023-10-08 01:21:23,890][52060] Updated weights for policy 0, policy_version 36170 (0.0009) [2023-10-08 01:21:24,270][52060] Updated weights for policy 0, policy_version 36180 (0.0011) [2023-10-08 01:21:24,633][52060] Updated weights for policy 0, policy_version 36190 (0.0008) [2023-10-08 01:21:25,632][52059] Updated weights for policy 1, policy_version 36642 (0.0009) [2023-10-08 01:21:26,005][52059] Updated weights for policy 1, policy_version 36652 (0.0009) [2023-10-08 01:21:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 74579968. Throughput: 0: 1700.4, 1: 1758.8. Samples: 18655000. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:26,211][50642] Avg episode reward: [(0, '20.220'), (1, '19.390')] [2023-10-08 01:21:26,377][52059] Updated weights for policy 1, policy_version 36662 (0.0008) [2023-10-08 01:21:26,742][52059] Updated weights for policy 1, policy_version 36672 (0.0008) [2023-10-08 01:21:28,796][52060] Updated weights for policy 0, policy_version 36200 (0.0008) [2023-10-08 01:21:29,157][52060] Updated weights for policy 0, policy_version 36210 (0.0008) [2023-10-08 01:21:29,536][52060] Updated weights for policy 0, policy_version 36220 (0.0009) [2023-10-08 01:21:30,629][52059] Updated weights for policy 1, policy_version 36682 (0.0008) [2023-10-08 01:21:30,997][52059] Updated weights for policy 1, policy_version 36692 (0.0009) [2023-10-08 01:21:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 74645504. Throughput: 0: 1710.4, 1: 1749.2. Samples: 18675626. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:31,211][50642] Avg episode reward: [(0, '19.890'), (1, '22.930')] [2023-10-08 01:21:31,366][52059] Updated weights for policy 1, policy_version 36702 (0.0009) [2023-10-08 01:21:33,481][52060] Updated weights for policy 0, policy_version 36230 (0.0007) [2023-10-08 01:21:33,850][52060] Updated weights for policy 0, policy_version 36240 (0.0007) [2023-10-08 01:21:34,217][52060] Updated weights for policy 0, policy_version 36250 (0.0009) [2023-10-08 01:21:35,319][52059] Updated weights for policy 1, policy_version 36712 (0.0007) [2023-10-08 01:21:35,700][52059] Updated weights for policy 1, policy_version 36722 (0.0008) [2023-10-08 01:21:36,067][52059] Updated weights for policy 1, policy_version 36732 (0.0009) [2023-10-08 01:21:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 74711040. Throughput: 0: 1710.0, 1: 1754.7. Samples: 18686224. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:36,211][50642] Avg episode reward: [(0, '19.280'), (1, '18.440')] [2023-10-08 01:21:38,152][52060] Updated weights for policy 0, policy_version 36260 (0.0007) [2023-10-08 01:21:38,521][52060] Updated weights for policy 0, policy_version 36270 (0.0008) [2023-10-08 01:21:38,888][52060] Updated weights for policy 0, policy_version 36280 (0.0009) [2023-10-08 01:21:40,010][52059] Updated weights for policy 1, policy_version 36742 (0.0009) [2023-10-08 01:21:40,367][52059] Updated weights for policy 1, policy_version 36752 (0.0009) [2023-10-08 01:21:40,729][52059] Updated weights for policy 1, policy_version 36762 (0.0009) [2023-10-08 01:21:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 74809344. Throughput: 0: 1702.7, 1: 1760.5. Samples: 18706910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:41,211][50642] Avg episode reward: [(0, '20.810'), (1, '17.720')] [2023-10-08 01:21:42,857][52060] Updated weights for policy 0, policy_version 36290 (0.0009) [2023-10-08 01:21:43,224][52060] Updated weights for policy 0, policy_version 36300 (0.0010) [2023-10-08 01:21:43,585][52060] Updated weights for policy 0, policy_version 36310 (0.0010) [2023-10-08 01:21:43,955][52060] Updated weights for policy 0, policy_version 36320 (0.0010) [2023-10-08 01:21:44,735][52059] Updated weights for policy 1, policy_version 36772 (0.0008) [2023-10-08 01:21:45,109][52059] Updated weights for policy 1, policy_version 36782 (0.0008) [2023-10-08 01:21:45,479][52059] Updated weights for policy 1, policy_version 36792 (0.0008) [2023-10-08 01:21:46,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 74874880. Throughput: 0: 1726.3, 1: 1732.6. Samples: 18727142. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:46,211][50642] Avg episode reward: [(0, '18.580'), (1, '19.870')] [2023-10-08 01:21:47,907][52060] Updated weights for policy 0, policy_version 36330 (0.0010) [2023-10-08 01:21:48,272][52060] Updated weights for policy 0, policy_version 36340 (0.0010) [2023-10-08 01:21:48,646][52060] Updated weights for policy 0, policy_version 36350 (0.0011) [2023-10-08 01:21:49,437][52059] Updated weights for policy 1, policy_version 36802 (0.0009) [2023-10-08 01:21:49,808][52059] Updated weights for policy 1, policy_version 36812 (0.0009) [2023-10-08 01:21:50,175][52059] Updated weights for policy 1, policy_version 36822 (0.0008) [2023-10-08 01:21:50,545][52059] Updated weights for policy 1, policy_version 36832 (0.0009) [2023-10-08 01:21:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 74940416. Throughput: 0: 1693.8, 1: 1762.4. Samples: 18737658. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:51,211][50642] Avg episode reward: [(0, '18.790'), (1, '18.450')] [2023-10-08 01:21:52,580][52060] Updated weights for policy 0, policy_version 36360 (0.0008) [2023-10-08 01:21:52,946][52060] Updated weights for policy 0, policy_version 36370 (0.0008) [2023-10-08 01:21:53,316][52060] Updated weights for policy 0, policy_version 36380 (0.0007) [2023-10-08 01:21:54,444][52059] Updated weights for policy 1, policy_version 36842 (0.0007) [2023-10-08 01:21:54,801][52059] Updated weights for policy 1, policy_version 36852 (0.0007) [2023-10-08 01:21:55,176][52059] Updated weights for policy 1, policy_version 36862 (0.0008) [2023-10-08 01:21:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 75005952. Throughput: 0: 1714.6, 1: 1739.0. Samples: 18758380. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:21:56,211][50642] Avg episode reward: [(0, '19.520'), (1, '18.350')] [2023-10-08 01:21:57,191][52060] Updated weights for policy 0, policy_version 36390 (0.0007) [2023-10-08 01:21:57,564][52060] Updated weights for policy 0, policy_version 36400 (0.0008) [2023-10-08 01:21:57,928][52060] Updated weights for policy 0, policy_version 36410 (0.0010) [2023-10-08 01:21:59,039][52059] Updated weights for policy 1, policy_version 36872 (0.0010) [2023-10-08 01:21:59,404][52059] Updated weights for policy 1, policy_version 36882 (0.0008) [2023-10-08 01:21:59,774][52059] Updated weights for policy 1, policy_version 36892 (0.0007) [2023-10-08 01:22:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 75071488. Throughput: 0: 1723.4, 1: 1725.6. Samples: 18779160. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:22:01,211][50642] Avg episode reward: [(0, '18.550'), (1, '20.480')] [2023-10-08 01:22:02,008][52060] Updated weights for policy 0, policy_version 36420 (0.0008) [2023-10-08 01:22:02,376][52060] Updated weights for policy 0, policy_version 36430 (0.0007) [2023-10-08 01:22:02,747][52060] Updated weights for policy 0, policy_version 36440 (0.0008) [2023-10-08 01:22:03,720][52059] Updated weights for policy 1, policy_version 36902 (0.0008) [2023-10-08 01:22:04,083][52059] Updated weights for policy 1, policy_version 36912 (0.0007) [2023-10-08 01:22:04,443][52059] Updated weights for policy 1, policy_version 36922 (0.0007) [2023-10-08 01:22:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 75137024. Throughput: 0: 1691.2, 1: 1745.9. Samples: 18789440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:22:06,211][50642] Avg episode reward: [(0, '18.750'), (1, '20.770')] [2023-10-08 01:22:06,597][52060] Updated weights for policy 0, policy_version 36450 (0.0010) [2023-10-08 01:22:06,969][52060] Updated weights for policy 0, policy_version 36460 (0.0009) [2023-10-08 01:22:07,346][52060] Updated weights for policy 0, policy_version 36470 (0.0009) [2023-10-08 01:22:07,724][52060] Updated weights for policy 0, policy_version 36480 (0.0007) [2023-10-08 01:22:08,355][52059] Updated weights for policy 1, policy_version 36932 (0.0007) [2023-10-08 01:22:08,713][52059] Updated weights for policy 1, policy_version 36942 (0.0008) [2023-10-08 01:22:09,082][52059] Updated weights for policy 1, policy_version 36952 (0.0008) [2023-10-08 01:22:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 75202560. Throughput: 0: 1723.0, 1: 1725.8. Samples: 18810194. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:22:11,211][50642] Avg episode reward: [(0, '21.170'), (1, '17.570')] [2023-10-08 01:22:11,650][52060] Updated weights for policy 0, policy_version 36490 (0.0007) [2023-10-08 01:22:12,015][52060] Updated weights for policy 0, policy_version 36500 (0.0007) [2023-10-08 01:22:12,392][52060] Updated weights for policy 0, policy_version 36510 (0.0011) [2023-10-08 01:22:13,161][52059] Updated weights for policy 1, policy_version 36962 (0.0009) [2023-10-08 01:22:13,536][52059] Updated weights for policy 1, policy_version 36972 (0.0009) [2023-10-08 01:22:13,894][52059] Updated weights for policy 1, policy_version 36982 (0.0008) [2023-10-08 01:22:14,253][52059] Updated weights for policy 1, policy_version 36992 (0.0010) [2023-10-08 01:22:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 75268096. Throughput: 0: 1725.4, 1: 1735.6. Samples: 18831370. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:22:16,211][50642] Avg episode reward: [(0, '17.340'), (1, '19.710')] [2023-10-08 01:22:16,476][52060] Updated weights for policy 0, policy_version 36520 (0.0007) [2023-10-08 01:22:16,849][52060] Updated weights for policy 0, policy_version 36530 (0.0008) [2023-10-08 01:22:17,218][52060] Updated weights for policy 0, policy_version 36540 (0.0009) [2023-10-08 01:22:18,000][52059] Updated weights for policy 1, policy_version 37002 (0.0009) [2023-10-08 01:22:18,376][52059] Updated weights for policy 1, policy_version 37012 (0.0007) [2023-10-08 01:22:18,742][52059] Updated weights for policy 1, policy_version 37022 (0.0007) [2023-10-08 01:22:21,149][52060] Updated weights for policy 0, policy_version 36550 (0.0010) [2023-10-08 01:22:21,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 75333632. Throughput: 0: 1714.5, 1: 1723.9. Samples: 18840954. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-08 01:22:21,211][50642] Avg episode reward: [(0, '17.740'), (1, '17.260')] [2023-10-08 01:22:21,516][52060] Updated weights for policy 0, policy_version 36560 (0.0010) [2023-10-08 01:22:21,888][52060] Updated weights for policy 0, policy_version 36570 (0.0010) [2023-10-08 01:22:22,670][52059] Updated weights for policy 1, policy_version 37032 (0.0008) [2023-10-08 01:22:23,039][52059] Updated weights for policy 1, policy_version 37042 (0.0008) [2023-10-08 01:22:23,405][52059] Updated weights for policy 1, policy_version 37052 (0.0010) [2023-10-08 01:22:25,925][52060] Updated weights for policy 0, policy_version 36580 (0.0008) [2023-10-08 01:22:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 75399168. Throughput: 0: 1725.2, 1: 1728.8. Samples: 18862344. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-08 01:22:26,211][50642] Avg episode reward: [(0, '21.710'), (1, '17.840')] [2023-10-08 01:22:26,285][52060] Updated weights for policy 0, policy_version 36590 (0.0008) [2023-10-08 01:22:26,670][52060] Updated weights for policy 0, policy_version 36600 (0.0009) [2023-10-08 01:22:27,337][52059] Updated weights for policy 1, policy_version 37062 (0.0009) [2023-10-08 01:22:27,722][52059] Updated weights for policy 1, policy_version 37072 (0.0008) [2023-10-08 01:22:28,087][52059] Updated weights for policy 1, policy_version 37082 (0.0008) [2023-10-08 01:22:30,743][52060] Updated weights for policy 0, policy_version 36610 (0.0009) [2023-10-08 01:22:31,107][52060] Updated weights for policy 0, policy_version 36620 (0.0011) [2023-10-08 01:22:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 75464704. Throughput: 0: 1718.3, 1: 1748.8. Samples: 18883158. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-08 01:22:31,211][50642] Avg episode reward: [(0, '17.960'), (1, '18.580')] [2023-10-08 01:22:31,484][52060] Updated weights for policy 0, policy_version 36630 (0.0007) [2023-10-08 01:22:31,863][52060] Updated weights for policy 0, policy_version 36640 (0.0009) [2023-10-08 01:22:31,929][52059] Updated weights for policy 1, policy_version 37092 (0.0010) [2023-10-08 01:22:32,290][52059] Updated weights for policy 1, policy_version 37102 (0.0009) [2023-10-08 01:22:32,658][52059] Updated weights for policy 1, policy_version 37112 (0.0010) [2023-10-08 01:22:35,814][52060] Updated weights for policy 0, policy_version 36650 (0.0010) [2023-10-08 01:22:36,180][52060] Updated weights for policy 0, policy_version 36660 (0.0009) [2023-10-08 01:22:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 75530240. Throughput: 0: 1727.7, 1: 1721.4. Samples: 18892866. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-08 01:22:36,211][50642] Avg episode reward: [(0, '17.370'), (1, '19.250')] [2023-10-08 01:22:36,555][52060] Updated weights for policy 0, policy_version 36670 (0.0009) [2023-10-08 01:22:36,637][52059] Updated weights for policy 1, policy_version 37122 (0.0009) [2023-10-08 01:22:37,002][52059] Updated weights for policy 1, policy_version 37132 (0.0007) [2023-10-08 01:22:37,362][52059] Updated weights for policy 1, policy_version 37142 (0.0007) [2023-10-08 01:22:37,737][52059] Updated weights for policy 1, policy_version 37152 (0.0009) [2023-10-08 01:22:40,435][52060] Updated weights for policy 0, policy_version 36680 (0.0007) [2023-10-08 01:22:40,798][52060] Updated weights for policy 0, policy_version 36690 (0.0008) [2023-10-08 01:22:41,179][52060] Updated weights for policy 0, policy_version 36700 (0.0007) [2023-10-08 01:22:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 75595776. Throughput: 0: 1722.2, 1: 1742.2. Samples: 18914278. Policy #0 lag: (min: 15.0, avg: 15.0, max: 15.0) [2023-10-08 01:22:41,211][50642] Avg episode reward: [(0, '21.160'), (1, '19.290')] [2023-10-08 01:22:41,548][52059] Updated weights for policy 1, policy_version 37162 (0.0009) [2023-10-08 01:22:41,906][52059] Updated weights for policy 1, policy_version 37172 (0.0009) [2023-10-08 01:22:42,276][52059] Updated weights for policy 1, policy_version 37182 (0.0008) [2023-10-08 01:22:44,996][52060] Updated weights for policy 0, policy_version 36710 (0.0008) [2023-10-08 01:22:45,363][52060] Updated weights for policy 0, policy_version 36720 (0.0008) [2023-10-08 01:22:45,727][52060] Updated weights for policy 0, policy_version 36730 (0.0009) [2023-10-08 01:22:46,160][52059] Updated weights for policy 1, policy_version 37192 (0.0009) [2023-10-08 01:22:46,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 75694080. Throughput: 0: 1702.0, 1: 1748.3. Samples: 18934424. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:22:46,211][50642] Avg episode reward: [(0, '20.550'), (1, '21.700')] [2023-10-08 01:22:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000036736_37617664.pth... [2023-10-08 01:22:46,253][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000035136_35979264.pth [2023-10-08 01:22:46,531][52059] Updated weights for policy 1, policy_version 37202 (0.0012) [2023-10-08 01:22:46,903][52059] Updated weights for policy 1, policy_version 37212 (0.0010) [2023-10-08 01:22:47,039][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000037216_38109184.pth... [2023-10-08 01:22:47,069][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000035584_36438016.pth [2023-10-08 01:22:49,588][52060] Updated weights for policy 0, policy_version 36740 (0.0008) [2023-10-08 01:22:49,961][52060] Updated weights for policy 0, policy_version 36750 (0.0007) [2023-10-08 01:22:50,333][52060] Updated weights for policy 0, policy_version 36760 (0.0008) [2023-10-08 01:22:50,912][52059] Updated weights for policy 1, policy_version 37222 (0.0010) [2023-10-08 01:22:51,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 75759616. Throughput: 0: 1728.8, 1: 1724.8. Samples: 18944852. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:22:51,211][50642] Avg episode reward: [(0, '17.540'), (1, '19.700')] [2023-10-08 01:22:51,287][52059] Updated weights for policy 1, policy_version 37232 (0.0010) [2023-10-08 01:22:51,655][52059] Updated weights for policy 1, policy_version 37242 (0.0010) [2023-10-08 01:22:54,437][52060] Updated weights for policy 0, policy_version 36770 (0.0007) [2023-10-08 01:22:54,800][52060] Updated weights for policy 0, policy_version 36780 (0.0007) [2023-10-08 01:22:55,176][52060] Updated weights for policy 0, policy_version 36790 (0.0010) [2023-10-08 01:22:55,540][52060] Updated weights for policy 0, policy_version 36800 (0.0009) [2023-10-08 01:22:55,644][52059] Updated weights for policy 1, policy_version 37252 (0.0010) [2023-10-08 01:22:56,010][52059] Updated weights for policy 1, policy_version 37262 (0.0007) [2023-10-08 01:22:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 75825152. Throughput: 0: 1711.5, 1: 1740.7. Samples: 18965542. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:22:56,211][50642] Avg episode reward: [(0, '20.130'), (1, '18.980')] [2023-10-08 01:22:56,372][52059] Updated weights for policy 1, policy_version 37272 (0.0008) [2023-10-08 01:22:59,392][52060] Updated weights for policy 0, policy_version 36810 (0.0009) [2023-10-08 01:22:59,754][52060] Updated weights for policy 0, policy_version 36820 (0.0009) [2023-10-08 01:23:00,125][52060] Updated weights for policy 0, policy_version 36830 (0.0007) [2023-10-08 01:23:00,358][52059] Updated weights for policy 1, policy_version 37282 (0.0009) [2023-10-08 01:23:00,729][52059] Updated weights for policy 1, policy_version 37292 (0.0010) [2023-10-08 01:23:01,096][52059] Updated weights for policy 1, policy_version 37302 (0.0010) [2023-10-08 01:23:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 75890688. Throughput: 0: 1695.5, 1: 1731.3. Samples: 18985574. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:23:01,211][50642] Avg episode reward: [(0, '21.690'), (1, '20.380')] [2023-10-08 01:23:01,471][52059] Updated weights for policy 1, policy_version 37312 (0.0009) [2023-10-08 01:23:04,275][52060] Updated weights for policy 0, policy_version 36840 (0.0008) [2023-10-08 01:23:04,649][52060] Updated weights for policy 0, policy_version 36850 (0.0007) [2023-10-08 01:23:05,022][52060] Updated weights for policy 0, policy_version 36860 (0.0008) [2023-10-08 01:23:05,405][52059] Updated weights for policy 1, policy_version 37322 (0.0008) [2023-10-08 01:23:05,766][52059] Updated weights for policy 1, policy_version 37332 (0.0010) [2023-10-08 01:23:06,130][52059] Updated weights for policy 1, policy_version 37342 (0.0010) [2023-10-08 01:23:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 75988992. Throughput: 0: 1723.8, 1: 1739.9. Samples: 18996820. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:23:06,211][50642] Avg episode reward: [(0, '17.660'), (1, '18.670')] [2023-10-08 01:23:09,064][52060] Updated weights for policy 0, policy_version 36870 (0.0010) [2023-10-08 01:23:09,441][52060] Updated weights for policy 0, policy_version 36880 (0.0009) [2023-10-08 01:23:09,811][52060] Updated weights for policy 0, policy_version 36890 (0.0007) [2023-10-08 01:23:09,885][52059] Updated weights for policy 1, policy_version 37352 (0.0007) [2023-10-08 01:23:10,242][52059] Updated weights for policy 1, policy_version 37362 (0.0008) [2023-10-08 01:23:10,605][52059] Updated weights for policy 1, policy_version 37372 (0.0007) [2023-10-08 01:23:11,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 76054528. Throughput: 0: 1696.2, 1: 1734.3. Samples: 19016718. Policy #0 lag: (min: 17.0, avg: 28.4, max: 49.0) [2023-10-08 01:23:11,211][50642] Avg episode reward: [(0, '19.820'), (1, '19.640')] [2023-10-08 01:23:13,951][52060] Updated weights for policy 0, policy_version 36900 (0.0009) [2023-10-08 01:23:14,329][52060] Updated weights for policy 0, policy_version 36910 (0.0009) [2023-10-08 01:23:14,520][52059] Updated weights for policy 1, policy_version 37382 (0.0008) [2023-10-08 01:23:14,697][52060] Updated weights for policy 0, policy_version 36920 (0.0007) [2023-10-08 01:23:14,910][52059] Updated weights for policy 1, policy_version 37392 (0.0007) [2023-10-08 01:23:15,265][52059] Updated weights for policy 1, policy_version 37402 (0.0009) [2023-10-08 01:23:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 76120064. Throughput: 0: 1691.5, 1: 1717.4. Samples: 19036562. Policy #0 lag: (min: 17.0, avg: 28.4, max: 49.0) [2023-10-08 01:23:16,211][50642] Avg episode reward: [(0, '20.990'), (1, '19.530')] [2023-10-08 01:23:18,751][52060] Updated weights for policy 0, policy_version 36930 (0.0007) [2023-10-08 01:23:19,118][52060] Updated weights for policy 0, policy_version 36940 (0.0010) [2023-10-08 01:23:19,212][52059] Updated weights for policy 1, policy_version 37412 (0.0009) [2023-10-08 01:23:19,485][52060] Updated weights for policy 0, policy_version 36950 (0.0008) [2023-10-08 01:23:19,579][52059] Updated weights for policy 1, policy_version 37422 (0.0007) [2023-10-08 01:23:19,851][52060] Updated weights for policy 0, policy_version 36960 (0.0008) [2023-10-08 01:23:19,934][52059] Updated weights for policy 1, policy_version 37432 (0.0009) [2023-10-08 01:23:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 76185600. Throughput: 0: 1707.6, 1: 1746.2. Samples: 19048288. Policy #0 lag: (min: 17.0, avg: 28.4, max: 49.0) [2023-10-08 01:23:21,211][50642] Avg episode reward: [(0, '17.340'), (1, '20.170')] [2023-10-08 01:23:23,813][52059] Updated weights for policy 1, policy_version 37442 (0.0010) [2023-10-08 01:23:23,885][52060] Updated weights for policy 0, policy_version 36970 (0.0009) [2023-10-08 01:23:24,174][52059] Updated weights for policy 1, policy_version 37452 (0.0007) [2023-10-08 01:23:24,260][52060] Updated weights for policy 0, policy_version 36980 (0.0008) [2023-10-08 01:23:24,534][52059] Updated weights for policy 1, policy_version 37462 (0.0008) [2023-10-08 01:23:24,627][52060] Updated weights for policy 0, policy_version 36990 (0.0008) [2023-10-08 01:23:24,902][52059] Updated weights for policy 1, policy_version 37472 (0.0008) [2023-10-08 01:23:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 76251136. Throughput: 0: 1679.6, 1: 1715.4. Samples: 19067056. Policy #0 lag: (min: 17.0, avg: 28.4, max: 49.0) [2023-10-08 01:23:26,211][50642] Avg episode reward: [(0, '19.640'), (1, '18.710')] [2023-10-08 01:23:28,548][52060] Updated weights for policy 0, policy_version 37000 (0.0008) [2023-10-08 01:23:28,855][52059] Updated weights for policy 1, policy_version 37482 (0.0008) [2023-10-08 01:23:28,915][52060] Updated weights for policy 0, policy_version 37010 (0.0008) [2023-10-08 01:23:29,226][52059] Updated weights for policy 1, policy_version 37492 (0.0007) [2023-10-08 01:23:29,273][52060] Updated weights for policy 0, policy_version 37020 (0.0007) [2023-10-08 01:23:29,580][52059] Updated weights for policy 1, policy_version 37502 (0.0007) [2023-10-08 01:23:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 76316672. Throughput: 0: 1704.4, 1: 1717.5. Samples: 19088410. Policy #0 lag: (min: 17.0, avg: 28.4, max: 49.0) [2023-10-08 01:23:31,211][50642] Avg episode reward: [(0, '22.310'), (1, '18.650')] [2023-10-08 01:23:33,197][52060] Updated weights for policy 0, policy_version 37030 (0.0008) [2023-10-08 01:23:33,552][52060] Updated weights for policy 0, policy_version 37040 (0.0008) [2023-10-08 01:23:33,602][52059] Updated weights for policy 1, policy_version 37512 (0.0011) [2023-10-08 01:23:33,930][52060] Updated weights for policy 0, policy_version 37050 (0.0008) [2023-10-08 01:23:33,959][52059] Updated weights for policy 1, policy_version 37522 (0.0008) [2023-10-08 01:23:34,328][52059] Updated weights for policy 1, policy_version 37532 (0.0010) [2023-10-08 01:23:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 76382208. Throughput: 0: 1688.5, 1: 1733.1. Samples: 19098824. Policy #0 lag: (min: 24.0, avg: 50.6, max: 56.0) [2023-10-08 01:23:36,211][50642] Avg episode reward: [(0, '18.130'), (1, '20.890')] [2023-10-08 01:23:37,835][52060] Updated weights for policy 0, policy_version 37060 (0.0007) [2023-10-08 01:23:38,199][52060] Updated weights for policy 0, policy_version 37070 (0.0009) [2023-10-08 01:23:38,261][52059] Updated weights for policy 1, policy_version 37542 (0.0008) [2023-10-08 01:23:38,559][52060] Updated weights for policy 0, policy_version 37080 (0.0008) [2023-10-08 01:23:38,626][52059] Updated weights for policy 1, policy_version 37552 (0.0008) [2023-10-08 01:23:38,990][52059] Updated weights for policy 1, policy_version 37562 (0.0008) [2023-10-08 01:23:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 76447744. Throughput: 0: 1691.6, 1: 1718.9. Samples: 19119012. Policy #0 lag: (min: 24.0, avg: 50.6, max: 56.0) [2023-10-08 01:23:41,211][50642] Avg episode reward: [(0, '19.400'), (1, '18.720')] [2023-10-08 01:23:42,538][52060] Updated weights for policy 0, policy_version 37090 (0.0009) [2023-10-08 01:23:42,832][52059] Updated weights for policy 1, policy_version 37572 (0.0010) [2023-10-08 01:23:42,917][52060] Updated weights for policy 0, policy_version 37100 (0.0007) [2023-10-08 01:23:43,190][52059] Updated weights for policy 1, policy_version 37582 (0.0008) [2023-10-08 01:23:43,280][52060] Updated weights for policy 0, policy_version 37110 (0.0007) [2023-10-08 01:23:43,558][52059] Updated weights for policy 1, policy_version 37592 (0.0007) [2023-10-08 01:23:43,641][52060] Updated weights for policy 0, policy_version 37120 (0.0007) [2023-10-08 01:23:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 76513280. Throughput: 0: 1706.6, 1: 1737.1. Samples: 19140542. Policy #0 lag: (min: 24.0, avg: 50.6, max: 56.0) [2023-10-08 01:23:46,211][50642] Avg episode reward: [(0, '21.510'), (1, '17.960')] [2023-10-08 01:23:47,500][52059] Updated weights for policy 1, policy_version 37602 (0.0007) [2023-10-08 01:23:47,647][52060] Updated weights for policy 0, policy_version 37130 (0.0009) [2023-10-08 01:23:47,865][52059] Updated weights for policy 1, policy_version 37612 (0.0009) [2023-10-08 01:23:48,010][52060] Updated weights for policy 0, policy_version 37140 (0.0009) [2023-10-08 01:23:48,225][52059] Updated weights for policy 1, policy_version 37622 (0.0009) [2023-10-08 01:23:48,380][52060] Updated weights for policy 0, policy_version 37150 (0.0008) [2023-10-08 01:23:48,586][52059] Updated weights for policy 1, policy_version 37632 (0.0007) [2023-10-08 01:23:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 76578816. Throughput: 0: 1678.0, 1: 1724.3. Samples: 19149920. Policy #0 lag: (min: 24.0, avg: 50.6, max: 56.0) [2023-10-08 01:23:51,211][50642] Avg episode reward: [(0, '17.960'), (1, '22.510')] [2023-10-08 01:23:52,479][52060] Updated weights for policy 0, policy_version 37160 (0.0008) [2023-10-08 01:23:52,585][52059] Updated weights for policy 1, policy_version 37642 (0.0007) [2023-10-08 01:23:52,850][52060] Updated weights for policy 0, policy_version 37170 (0.0008) [2023-10-08 01:23:52,961][52059] Updated weights for policy 1, policy_version 37652 (0.0009) [2023-10-08 01:23:53,228][52060] Updated weights for policy 0, policy_version 37180 (0.0008) [2023-10-08 01:23:53,331][52059] Updated weights for policy 1, policy_version 37662 (0.0009) [2023-10-08 01:23:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 76644352. Throughput: 0: 1699.4, 1: 1723.0. Samples: 19170726. Policy #0 lag: (min: 24.0, avg: 50.6, max: 56.0) [2023-10-08 01:23:56,211][50642] Avg episode reward: [(0, '18.860'), (1, '20.540')] [2023-10-08 01:23:57,235][52059] Updated weights for policy 1, policy_version 37672 (0.0008) [2023-10-08 01:23:57,418][52060] Updated weights for policy 0, policy_version 37190 (0.0008) [2023-10-08 01:23:57,602][52059] Updated weights for policy 1, policy_version 37682 (0.0008) [2023-10-08 01:23:57,800][52060] Updated weights for policy 0, policy_version 37200 (0.0010) [2023-10-08 01:23:57,954][52059] Updated weights for policy 1, policy_version 37692 (0.0009) [2023-10-08 01:23:58,159][52060] Updated weights for policy 0, policy_version 37210 (0.0009) [2023-10-08 01:24:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 76709888. Throughput: 0: 1700.7, 1: 1743.3. Samples: 19191540. Policy #0 lag: (min: 24.0, avg: 50.6, max: 56.0) [2023-10-08 01:24:01,211][50642] Avg episode reward: [(0, '21.410'), (1, '18.210')] [2023-10-08 01:24:02,091][52059] Updated weights for policy 1, policy_version 37702 (0.0009) [2023-10-08 01:24:02,161][52060] Updated weights for policy 0, policy_version 37220 (0.0010) [2023-10-08 01:24:02,464][52059] Updated weights for policy 1, policy_version 37712 (0.0009) [2023-10-08 01:24:02,527][52060] Updated weights for policy 0, policy_version 37230 (0.0009) [2023-10-08 01:24:02,824][52059] Updated weights for policy 1, policy_version 37722 (0.0009) [2023-10-08 01:24:02,895][52060] Updated weights for policy 0, policy_version 37240 (0.0010) [2023-10-08 01:24:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 76775424. Throughput: 0: 1675.7, 1: 1708.7. Samples: 19200586. Policy #0 lag: (min: 1.0, avg: 13.9, max: 33.0) [2023-10-08 01:24:06,211][50642] Avg episode reward: [(0, '18.440'), (1, '21.210')] [2023-10-08 01:24:06,604][52059] Updated weights for policy 1, policy_version 37732 (0.0008) [2023-10-08 01:24:06,925][52060] Updated weights for policy 0, policy_version 37250 (0.0009) [2023-10-08 01:24:06,977][52059] Updated weights for policy 1, policy_version 37742 (0.0007) [2023-10-08 01:24:07,295][52060] Updated weights for policy 0, policy_version 37260 (0.0009) [2023-10-08 01:24:07,336][52059] Updated weights for policy 1, policy_version 37752 (0.0009) [2023-10-08 01:24:07,674][52060] Updated weights for policy 0, policy_version 37270 (0.0009) [2023-10-08 01:24:08,040][52060] Updated weights for policy 0, policy_version 37280 (0.0007) [2023-10-08 01:24:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 76840960. Throughput: 0: 1706.0, 1: 1743.9. Samples: 19222298. Policy #0 lag: (min: 1.0, avg: 13.9, max: 33.0) [2023-10-08 01:24:11,211][50642] Avg episode reward: [(0, '19.210'), (1, '21.440')] [2023-10-08 01:24:11,236][52059] Updated weights for policy 1, policy_version 37762 (0.0008) [2023-10-08 01:24:11,599][52059] Updated weights for policy 1, policy_version 37772 (0.0008) [2023-10-08 01:24:11,957][52059] Updated weights for policy 1, policy_version 37782 (0.0008) [2023-10-08 01:24:12,048][52060] Updated weights for policy 0, policy_version 37290 (0.0008) [2023-10-08 01:24:12,316][52059] Updated weights for policy 1, policy_version 37792 (0.0008) [2023-10-08 01:24:12,410][52060] Updated weights for policy 0, policy_version 37300 (0.0009) [2023-10-08 01:24:12,787][52060] Updated weights for policy 0, policy_version 37310 (0.0009) [2023-10-08 01:24:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 76906496. Throughput: 0: 1700.7, 1: 1745.3. Samples: 19243482. Policy #0 lag: (min: 1.0, avg: 13.9, max: 33.0) [2023-10-08 01:24:16,211][50642] Avg episode reward: [(0, '21.010'), (1, '17.850')] [2023-10-08 01:24:16,286][52059] Updated weights for policy 1, policy_version 37802 (0.0007) [2023-10-08 01:24:16,651][52059] Updated weights for policy 1, policy_version 37812 (0.0009) [2023-10-08 01:24:16,976][52060] Updated weights for policy 0, policy_version 37320 (0.0009) [2023-10-08 01:24:17,021][52059] Updated weights for policy 1, policy_version 37822 (0.0008) [2023-10-08 01:24:17,343][52060] Updated weights for policy 0, policy_version 37330 (0.0010) [2023-10-08 01:24:17,721][52060] Updated weights for policy 0, policy_version 37340 (0.0008) [2023-10-08 01:24:20,926][52059] Updated weights for policy 1, policy_version 37832 (0.0010) [2023-10-08 01:24:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 76972032. Throughput: 0: 1691.4, 1: 1731.2. Samples: 19252844. Policy #0 lag: (min: 1.0, avg: 13.9, max: 33.0) [2023-10-08 01:24:21,211][50642] Avg episode reward: [(0, '19.350'), (1, '20.020')] [2023-10-08 01:24:21,292][52059] Updated weights for policy 1, policy_version 37842 (0.0010) [2023-10-08 01:24:21,653][52059] Updated weights for policy 1, policy_version 37852 (0.0008) [2023-10-08 01:24:21,735][52060] Updated weights for policy 0, policy_version 37350 (0.0008) [2023-10-08 01:24:22,113][52060] Updated weights for policy 0, policy_version 37360 (0.0009) [2023-10-08 01:24:22,472][52060] Updated weights for policy 0, policy_version 37370 (0.0008) [2023-10-08 01:24:25,679][52059] Updated weights for policy 1, policy_version 37862 (0.0010) [2023-10-08 01:24:26,055][52059] Updated weights for policy 1, policy_version 37872 (0.0008) [2023-10-08 01:24:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 77037568. Throughput: 0: 1702.9, 1: 1747.6. Samples: 19274282. Policy #0 lag: (min: 1.0, avg: 13.9, max: 33.0) [2023-10-08 01:24:26,211][50642] Avg episode reward: [(0, '19.330'), (1, '22.280')] [2023-10-08 01:24:26,417][52060] Updated weights for policy 0, policy_version 37380 (0.0010) [2023-10-08 01:24:26,417][52059] Updated weights for policy 1, policy_version 37882 (0.0007) [2023-10-08 01:24:26,779][52060] Updated weights for policy 0, policy_version 37390 (0.0008) [2023-10-08 01:24:27,142][52060] Updated weights for policy 0, policy_version 37400 (0.0008) [2023-10-08 01:24:30,280][52059] Updated weights for policy 1, policy_version 37892 (0.0008) [2023-10-08 01:24:30,639][52059] Updated weights for policy 1, policy_version 37902 (0.0007) [2023-10-08 01:24:31,001][52059] Updated weights for policy 1, policy_version 37912 (0.0009) [2023-10-08 01:24:31,049][52060] Updated weights for policy 0, policy_version 37410 (0.0009) [2023-10-08 01:24:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 77103104. Throughput: 0: 1706.8, 1: 1723.4. Samples: 19294900. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:24:31,211][50642] Avg episode reward: [(0, '20.930'), (1, '20.890')] [2023-10-08 01:24:31,413][52060] Updated weights for policy 0, policy_version 37420 (0.0009) [2023-10-08 01:24:31,786][52060] Updated weights for policy 0, policy_version 37430 (0.0008) [2023-10-08 01:24:32,151][52060] Updated weights for policy 0, policy_version 37440 (0.0008) [2023-10-08 01:24:34,946][52059] Updated weights for policy 1, policy_version 37922 (0.0008) [2023-10-08 01:24:35,315][52059] Updated weights for policy 1, policy_version 37932 (0.0007) [2023-10-08 01:24:35,672][52059] Updated weights for policy 1, policy_version 37942 (0.0011) [2023-10-08 01:24:36,037][52059] Updated weights for policy 1, policy_version 37952 (0.0008) [2023-10-08 01:24:36,061][52060] Updated weights for policy 0, policy_version 37450 (0.0008) [2023-10-08 01:24:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 77201408. Throughput: 0: 1703.5, 1: 1741.0. Samples: 19304922. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:24:36,211][50642] Avg episode reward: [(0, '18.910'), (1, '19.210')] [2023-10-08 01:24:36,429][52060] Updated weights for policy 0, policy_version 37460 (0.0010) [2023-10-08 01:24:36,795][52060] Updated weights for policy 0, policy_version 37470 (0.0011) [2023-10-08 01:24:39,773][52059] Updated weights for policy 1, policy_version 37962 (0.0009) [2023-10-08 01:24:40,136][52059] Updated weights for policy 1, policy_version 37972 (0.0008) [2023-10-08 01:24:40,501][52059] Updated weights for policy 1, policy_version 37982 (0.0010) [2023-10-08 01:24:40,660][52060] Updated weights for policy 0, policy_version 37480 (0.0008) [2023-10-08 01:24:41,025][52060] Updated weights for policy 0, policy_version 37490 (0.0010) [2023-10-08 01:24:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 77266944. Throughput: 0: 1712.7, 1: 1743.0. Samples: 19326230. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:24:41,211][50642] Avg episode reward: [(0, '18.770'), (1, '22.300')] [2023-10-08 01:24:41,389][52060] Updated weights for policy 0, policy_version 37500 (0.0010) [2023-10-08 01:24:44,472][52059] Updated weights for policy 1, policy_version 37992 (0.0008) [2023-10-08 01:24:44,842][52059] Updated weights for policy 1, policy_version 38002 (0.0009) [2023-10-08 01:24:45,217][52059] Updated weights for policy 1, policy_version 38012 (0.0007) [2023-10-08 01:24:45,535][52060] Updated weights for policy 0, policy_version 37510 (0.0010) [2023-10-08 01:24:45,916][52060] Updated weights for policy 0, policy_version 37520 (0.0007) [2023-10-08 01:24:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 77332480. Throughput: 0: 1706.6, 1: 1726.6. Samples: 19346034. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:24:46,211][50642] Avg episode reward: [(0, '20.780'), (1, '20.790')] [2023-10-08 01:24:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000038016_38928384.pth... [2023-10-08 01:24:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000036384_37257216.pth [2023-10-08 01:24:46,287][52060] Updated weights for policy 0, policy_version 37530 (0.0009) [2023-10-08 01:24:46,508][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000037536_38436864.pth... [2023-10-08 01:24:46,548][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000035936_36798464.pth [2023-10-08 01:24:48,962][52059] Updated weights for policy 1, policy_version 38022 (0.0009) [2023-10-08 01:24:49,354][52059] Updated weights for policy 1, policy_version 38032 (0.0009) [2023-10-08 01:24:49,734][52059] Updated weights for policy 1, policy_version 38042 (0.0010) [2023-10-08 01:24:50,371][52060] Updated weights for policy 0, policy_version 37540 (0.0007) [2023-10-08 01:24:50,742][52060] Updated weights for policy 0, policy_version 37550 (0.0008) [2023-10-08 01:24:51,101][52060] Updated weights for policy 0, policy_version 37560 (0.0011) [2023-10-08 01:24:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 77398016. Throughput: 0: 1717.0, 1: 1757.8. Samples: 19356952. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:24:51,211][50642] Avg episode reward: [(0, '19.650'), (1, '19.850')] [2023-10-08 01:24:53,743][52059] Updated weights for policy 1, policy_version 38052 (0.0008) [2023-10-08 01:24:54,112][52059] Updated weights for policy 1, policy_version 38062 (0.0011) [2023-10-08 01:24:54,475][52059] Updated weights for policy 1, policy_version 38072 (0.0009) [2023-10-08 01:24:55,080][52060] Updated weights for policy 0, policy_version 37570 (0.0010) [2023-10-08 01:24:55,439][52060] Updated weights for policy 0, policy_version 37580 (0.0008) [2023-10-08 01:24:55,809][52060] Updated weights for policy 0, policy_version 37590 (0.0009) [2023-10-08 01:24:56,183][52060] Updated weights for policy 0, policy_version 37600 (0.0008) [2023-10-08 01:24:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 77496320. Throughput: 0: 1714.1, 1: 1718.7. Samples: 19376774. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) [2023-10-08 01:24:56,211][50642] Avg episode reward: [(0, '17.260'), (1, '19.140')] [2023-10-08 01:24:58,240][52059] Updated weights for policy 1, policy_version 38082 (0.0008) [2023-10-08 01:24:58,607][52059] Updated weights for policy 1, policy_version 38092 (0.0011) [2023-10-08 01:24:58,972][52059] Updated weights for policy 1, policy_version 38102 (0.0008) [2023-10-08 01:24:59,334][52059] Updated weights for policy 1, policy_version 38112 (0.0008) [2023-10-08 01:25:00,158][52060] Updated weights for policy 0, policy_version 37610 (0.0008) [2023-10-08 01:25:00,523][52060] Updated weights for policy 0, policy_version 37620 (0.0008) [2023-10-08 01:25:00,883][52060] Updated weights for policy 0, policy_version 37630 (0.0007) [2023-10-08 01:25:01,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 77561856. Throughput: 0: 1693.2, 1: 1722.4. Samples: 19397186. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) [2023-10-08 01:25:01,211][50642] Avg episode reward: [(0, '21.340'), (1, '22.840')] [2023-10-08 01:25:03,274][52059] Updated weights for policy 1, policy_version 38122 (0.0009) [2023-10-08 01:25:03,645][52059] Updated weights for policy 1, policy_version 38132 (0.0007) [2023-10-08 01:25:04,009][52059] Updated weights for policy 1, policy_version 38142 (0.0009) [2023-10-08 01:25:04,885][52060] Updated weights for policy 0, policy_version 37640 (0.0008) [2023-10-08 01:25:05,246][52060] Updated weights for policy 0, policy_version 37650 (0.0008) [2023-10-08 01:25:05,616][52060] Updated weights for policy 0, policy_version 37660 (0.0008) [2023-10-08 01:25:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 77627392. Throughput: 0: 1719.2, 1: 1730.1. Samples: 19408060. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) [2023-10-08 01:25:06,211][50642] Avg episode reward: [(0, '21.630'), (1, '18.390')] [2023-10-08 01:25:07,860][52059] Updated weights for policy 1, policy_version 38152 (0.0010) [2023-10-08 01:25:08,232][52059] Updated weights for policy 1, policy_version 38162 (0.0010) [2023-10-08 01:25:08,595][52059] Updated weights for policy 1, policy_version 38172 (0.0010) [2023-10-08 01:25:09,669][52060] Updated weights for policy 0, policy_version 37670 (0.0009) [2023-10-08 01:25:10,028][52060] Updated weights for policy 0, policy_version 37680 (0.0009) [2023-10-08 01:25:10,405][52060] Updated weights for policy 0, policy_version 37690 (0.0008) [2023-10-08 01:25:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 77692928. Throughput: 0: 1708.0, 1: 1724.0. Samples: 19428720. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) [2023-10-08 01:25:11,211][50642] Avg episode reward: [(0, '16.570'), (1, '19.100')] [2023-10-08 01:25:12,538][52059] Updated weights for policy 1, policy_version 38182 (0.0008) [2023-10-08 01:25:12,899][52059] Updated weights for policy 1, policy_version 38192 (0.0008) [2023-10-08 01:25:13,272][52059] Updated weights for policy 1, policy_version 38202 (0.0007) [2023-10-08 01:25:14,337][52060] Updated weights for policy 0, policy_version 37700 (0.0007) [2023-10-08 01:25:14,713][52060] Updated weights for policy 0, policy_version 37710 (0.0010) [2023-10-08 01:25:15,099][52060] Updated weights for policy 0, policy_version 37720 (0.0009) [2023-10-08 01:25:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 77758464. Throughput: 0: 1684.9, 1: 1740.1. Samples: 19449024. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) [2023-10-08 01:25:16,211][50642] Avg episode reward: [(0, '18.460'), (1, '20.020')] [2023-10-08 01:25:17,243][52059] Updated weights for policy 1, policy_version 38212 (0.0009) [2023-10-08 01:25:17,603][52059] Updated weights for policy 1, policy_version 38222 (0.0008) [2023-10-08 01:25:17,967][52059] Updated weights for policy 1, policy_version 38232 (0.0009) [2023-10-08 01:25:18,915][52060] Updated weights for policy 0, policy_version 37730 (0.0008) [2023-10-08 01:25:19,280][52060] Updated weights for policy 0, policy_version 37740 (0.0008) [2023-10-08 01:25:19,649][52060] Updated weights for policy 0, policy_version 37750 (0.0010) [2023-10-08 01:25:20,019][52060] Updated weights for policy 0, policy_version 37760 (0.0010) [2023-10-08 01:25:21,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 77824000. Throughput: 0: 1722.6, 1: 1720.0. Samples: 19459840. Policy #0 lag: (min: 31.0, avg: 35.8, max: 63.0) [2023-10-08 01:25:21,212][50642] Avg episode reward: [(0, '23.680'), (1, '18.470')] [2023-10-08 01:25:22,141][52059] Updated weights for policy 1, policy_version 38242 (0.0010) [2023-10-08 01:25:22,498][52059] Updated weights for policy 1, policy_version 38252 (0.0007) [2023-10-08 01:25:22,858][52059] Updated weights for policy 1, policy_version 38262 (0.0007) [2023-10-08 01:25:23,228][52059] Updated weights for policy 1, policy_version 38272 (0.0008) [2023-10-08 01:25:23,977][52060] Updated weights for policy 0, policy_version 37770 (0.0008) [2023-10-08 01:25:24,336][52060] Updated weights for policy 0, policy_version 37780 (0.0008) [2023-10-08 01:25:24,715][52060] Updated weights for policy 0, policy_version 37790 (0.0010) [2023-10-08 01:25:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 77889536. Throughput: 0: 1689.0, 1: 1721.4. Samples: 19479698. Policy #0 lag: (min: 26.0, avg: 26.5, max: 41.0) [2023-10-08 01:25:26,211][50642] Avg episode reward: [(0, '17.670'), (1, '18.350')] [2023-10-08 01:25:27,234][52059] Updated weights for policy 1, policy_version 38282 (0.0010) [2023-10-08 01:25:27,604][52059] Updated weights for policy 1, policy_version 38292 (0.0010) [2023-10-08 01:25:27,978][52059] Updated weights for policy 1, policy_version 38302 (0.0011) [2023-10-08 01:25:28,726][52060] Updated weights for policy 0, policy_version 37800 (0.0011) [2023-10-08 01:25:29,102][52060] Updated weights for policy 0, policy_version 37810 (0.0008) [2023-10-08 01:25:29,460][52060] Updated weights for policy 0, policy_version 37820 (0.0010) [2023-10-08 01:25:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 77955072. Throughput: 0: 1705.6, 1: 1737.8. Samples: 19500986. Policy #0 lag: (min: 26.0, avg: 26.5, max: 41.0) [2023-10-08 01:25:31,211][50642] Avg episode reward: [(0, '17.160'), (1, '19.850')] [2023-10-08 01:25:31,987][52059] Updated weights for policy 1, policy_version 38312 (0.0010) [2023-10-08 01:25:32,353][52059] Updated weights for policy 1, policy_version 38322 (0.0007) [2023-10-08 01:25:32,724][52059] Updated weights for policy 1, policy_version 38332 (0.0007) [2023-10-08 01:25:33,477][52060] Updated weights for policy 0, policy_version 37830 (0.0008) [2023-10-08 01:25:33,859][52060] Updated weights for policy 0, policy_version 37840 (0.0007) [2023-10-08 01:25:34,222][52060] Updated weights for policy 0, policy_version 37850 (0.0008) [2023-10-08 01:25:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78020608. Throughput: 0: 1713.5, 1: 1709.5. Samples: 19510986. Policy #0 lag: (min: 26.0, avg: 26.5, max: 41.0) [2023-10-08 01:25:36,211][50642] Avg episode reward: [(0, '21.330'), (1, '21.090')] [2023-10-08 01:25:36,864][52059] Updated weights for policy 1, policy_version 38342 (0.0009) [2023-10-08 01:25:37,258][52059] Updated weights for policy 1, policy_version 38352 (0.0009) [2023-10-08 01:25:37,616][52059] Updated weights for policy 1, policy_version 38362 (0.0008) [2023-10-08 01:25:38,100][52060] Updated weights for policy 0, policy_version 37860 (0.0009) [2023-10-08 01:25:38,475][52060] Updated weights for policy 0, policy_version 37870 (0.0009) [2023-10-08 01:25:38,839][52060] Updated weights for policy 0, policy_version 37880 (0.0008) [2023-10-08 01:25:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78086144. Throughput: 0: 1704.2, 1: 1735.7. Samples: 19531570. Policy #0 lag: (min: 26.0, avg: 26.5, max: 41.0) [2023-10-08 01:25:41,211][50642] Avg episode reward: [(0, '17.540'), (1, '19.530')] [2023-10-08 01:25:41,548][52059] Updated weights for policy 1, policy_version 38372 (0.0008) [2023-10-08 01:25:41,910][52059] Updated weights for policy 1, policy_version 38382 (0.0008) [2023-10-08 01:25:42,272][52059] Updated weights for policy 1, policy_version 38392 (0.0008) [2023-10-08 01:25:42,722][52060] Updated weights for policy 0, policy_version 37890 (0.0007) [2023-10-08 01:25:43,096][52060] Updated weights for policy 0, policy_version 37900 (0.0008) [2023-10-08 01:25:43,462][52060] Updated weights for policy 0, policy_version 37910 (0.0007) [2023-10-08 01:25:43,824][52060] Updated weights for policy 0, policy_version 37920 (0.0009) [2023-10-08 01:25:46,040][52059] Updated weights for policy 1, policy_version 38402 (0.0008) [2023-10-08 01:25:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 78151680. Throughput: 0: 1732.0, 1: 1736.9. Samples: 19553288. Policy #0 lag: (min: 26.0, avg: 26.5, max: 41.0) [2023-10-08 01:25:46,211][50642] Avg episode reward: [(0, '17.020'), (1, '20.300')] [2023-10-08 01:25:46,408][52059] Updated weights for policy 1, policy_version 38412 (0.0011) [2023-10-08 01:25:46,779][52059] Updated weights for policy 1, policy_version 38422 (0.0010) [2023-10-08 01:25:47,141][52059] Updated weights for policy 1, policy_version 38432 (0.0010) [2023-10-08 01:25:47,616][52060] Updated weights for policy 0, policy_version 37930 (0.0010) [2023-10-08 01:25:47,995][52060] Updated weights for policy 0, policy_version 37940 (0.0009) [2023-10-08 01:25:48,362][52060] Updated weights for policy 0, policy_version 37950 (0.0009) [2023-10-08 01:25:51,072][52059] Updated weights for policy 1, policy_version 38442 (0.0007) [2023-10-08 01:25:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 78217216. Throughput: 0: 1708.8, 1: 1730.3. Samples: 19562816. Policy #0 lag: (min: 26.0, avg: 26.5, max: 41.0) [2023-10-08 01:25:51,211][50642] Avg episode reward: [(0, '20.940'), (1, '22.740')] [2023-10-08 01:25:51,429][52059] Updated weights for policy 1, policy_version 38452 (0.0007) [2023-10-08 01:25:51,797][52059] Updated weights for policy 1, policy_version 38462 (0.0008) [2023-10-08 01:25:52,420][52060] Updated weights for policy 0, policy_version 37960 (0.0007) [2023-10-08 01:25:52,785][52060] Updated weights for policy 0, policy_version 37970 (0.0007) [2023-10-08 01:25:53,150][52060] Updated weights for policy 0, policy_version 37980 (0.0007) [2023-10-08 01:25:55,765][52059] Updated weights for policy 1, policy_version 38472 (0.0008) [2023-10-08 01:25:56,124][52059] Updated weights for policy 1, policy_version 38482 (0.0009) [2023-10-08 01:25:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 78282752. Throughput: 0: 1715.6, 1: 1737.6. Samples: 19584116. Policy #0 lag: (min: 18.0, avg: 21.9, max: 50.0) [2023-10-08 01:25:56,211][50642] Avg episode reward: [(0, '19.140'), (1, '19.670')] [2023-10-08 01:25:56,483][52059] Updated weights for policy 1, policy_version 38492 (0.0008) [2023-10-08 01:25:57,080][52060] Updated weights for policy 0, policy_version 37990 (0.0007) [2023-10-08 01:25:57,449][52060] Updated weights for policy 0, policy_version 38000 (0.0007) [2023-10-08 01:25:57,827][52060] Updated weights for policy 0, policy_version 38010 (0.0007) [2023-10-08 01:26:00,274][52059] Updated weights for policy 1, policy_version 38502 (0.0007) [2023-10-08 01:26:00,638][52059] Updated weights for policy 1, policy_version 38512 (0.0008) [2023-10-08 01:26:00,998][52059] Updated weights for policy 1, policy_version 38522 (0.0008) [2023-10-08 01:26:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 78348288. Throughput: 0: 1737.5, 1: 1723.0. Samples: 19604744. Policy #0 lag: (min: 18.0, avg: 21.9, max: 50.0) [2023-10-08 01:26:01,211][50642] Avg episode reward: [(0, '17.630'), (1, '19.880')] [2023-10-08 01:26:01,845][52060] Updated weights for policy 0, policy_version 38020 (0.0009) [2023-10-08 01:26:02,203][52060] Updated weights for policy 0, policy_version 38030 (0.0010) [2023-10-08 01:26:02,574][52060] Updated weights for policy 0, policy_version 38040 (0.0010) [2023-10-08 01:26:04,720][52059] Updated weights for policy 1, policy_version 38532 (0.0008) [2023-10-08 01:26:05,085][52059] Updated weights for policy 1, policy_version 38542 (0.0008) [2023-10-08 01:26:05,450][52059] Updated weights for policy 1, policy_version 38552 (0.0008) [2023-10-08 01:26:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78446592. Throughput: 0: 1701.5, 1: 1747.4. Samples: 19615038. Policy #0 lag: (min: 18.0, avg: 21.9, max: 50.0) [2023-10-08 01:26:06,211][50642] Avg episode reward: [(0, '20.560'), (1, '22.870')] [2023-10-08 01:26:06,604][52060] Updated weights for policy 0, policy_version 38050 (0.0008) [2023-10-08 01:26:06,973][52060] Updated weights for policy 0, policy_version 38060 (0.0009) [2023-10-08 01:26:07,338][52060] Updated weights for policy 0, policy_version 38070 (0.0008) [2023-10-08 01:26:07,709][52060] Updated weights for policy 0, policy_version 38080 (0.0008) [2023-10-08 01:26:09,343][52059] Updated weights for policy 1, policy_version 38562 (0.0010) [2023-10-08 01:26:09,709][52059] Updated weights for policy 1, policy_version 38572 (0.0007) [2023-10-08 01:26:10,072][52059] Updated weights for policy 1, policy_version 38582 (0.0007) [2023-10-08 01:26:10,436][52059] Updated weights for policy 1, policy_version 38592 (0.0007) [2023-10-08 01:26:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78512128. Throughput: 0: 1734.4, 1: 1738.3. Samples: 19635966. Policy #0 lag: (min: 18.0, avg: 21.9, max: 50.0) [2023-10-08 01:26:11,211][50642] Avg episode reward: [(0, '20.860'), (1, '19.180')] [2023-10-08 01:26:11,775][52060] Updated weights for policy 0, policy_version 38090 (0.0010) [2023-10-08 01:26:12,147][52060] Updated weights for policy 0, policy_version 38100 (0.0008) [2023-10-08 01:26:12,517][52060] Updated weights for policy 0, policy_version 38110 (0.0008) [2023-10-08 01:26:14,452][52059] Updated weights for policy 1, policy_version 38602 (0.0008) [2023-10-08 01:26:14,824][52059] Updated weights for policy 1, policy_version 38612 (0.0010) [2023-10-08 01:26:15,179][52059] Updated weights for policy 1, policy_version 38622 (0.0008) [2023-10-08 01:26:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78577664. Throughput: 0: 1726.4, 1: 1725.0. Samples: 19656300. Policy #0 lag: (min: 18.0, avg: 21.9, max: 50.0) [2023-10-08 01:26:16,211][50642] Avg episode reward: [(0, '17.480'), (1, '18.340')] [2023-10-08 01:26:16,565][52060] Updated weights for policy 0, policy_version 38120 (0.0008) [2023-10-08 01:26:16,941][52060] Updated weights for policy 0, policy_version 38130 (0.0008) [2023-10-08 01:26:17,303][52060] Updated weights for policy 0, policy_version 38140 (0.0008) [2023-10-08 01:26:19,066][52059] Updated weights for policy 1, policy_version 38632 (0.0008) [2023-10-08 01:26:19,427][52059] Updated weights for policy 1, policy_version 38642 (0.0008) [2023-10-08 01:26:19,790][52059] Updated weights for policy 1, policy_version 38652 (0.0007) [2023-10-08 01:26:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 78643200. Throughput: 0: 1707.3, 1: 1756.2. Samples: 19666844. Policy #0 lag: (min: 18.0, avg: 21.9, max: 50.0) [2023-10-08 01:26:21,211][50642] Avg episode reward: [(0, '19.440'), (1, '22.770')] [2023-10-08 01:26:21,328][52060] Updated weights for policy 0, policy_version 38150 (0.0007) [2023-10-08 01:26:21,702][52060] Updated weights for policy 0, policy_version 38160 (0.0008) [2023-10-08 01:26:22,073][52060] Updated weights for policy 0, policy_version 38170 (0.0008) [2023-10-08 01:26:23,624][52059] Updated weights for policy 1, policy_version 38662 (0.0008) [2023-10-08 01:26:23,985][52059] Updated weights for policy 1, policy_version 38672 (0.0009) [2023-10-08 01:26:24,354][52059] Updated weights for policy 1, policy_version 38682 (0.0009) [2023-10-08 01:26:25,984][52060] Updated weights for policy 0, policy_version 38180 (0.0009) [2023-10-08 01:26:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78708736. Throughput: 0: 1723.4, 1: 1734.0. Samples: 19687152. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:26:26,211][50642] Avg episode reward: [(0, '21.960'), (1, '20.510')] [2023-10-08 01:26:26,350][52060] Updated weights for policy 0, policy_version 38190 (0.0007) [2023-10-08 01:26:26,724][52060] Updated weights for policy 0, policy_version 38200 (0.0009) [2023-10-08 01:26:28,354][52059] Updated weights for policy 1, policy_version 38692 (0.0008) [2023-10-08 01:26:28,741][52059] Updated weights for policy 1, policy_version 38702 (0.0010) [2023-10-08 01:26:29,104][52059] Updated weights for policy 1, policy_version 38712 (0.0010) [2023-10-08 01:26:30,597][52060] Updated weights for policy 0, policy_version 38210 (0.0010) [2023-10-08 01:26:30,968][52060] Updated weights for policy 0, policy_version 38220 (0.0008) [2023-10-08 01:26:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 78774272. Throughput: 0: 1712.1, 1: 1730.6. Samples: 19708212. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:26:31,211][50642] Avg episode reward: [(0, '19.540'), (1, '18.340')] [2023-10-08 01:26:31,344][52060] Updated weights for policy 0, policy_version 38230 (0.0011) [2023-10-08 01:26:31,710][52060] Updated weights for policy 0, policy_version 38240 (0.0007) [2023-10-08 01:26:32,852][52059] Updated weights for policy 1, policy_version 38722 (0.0010) [2023-10-08 01:26:33,228][52059] Updated weights for policy 1, policy_version 38732 (0.0008) [2023-10-08 01:26:33,584][52059] Updated weights for policy 1, policy_version 38742 (0.0007) [2023-10-08 01:26:33,951][52059] Updated weights for policy 1, policy_version 38752 (0.0008) [2023-10-08 01:26:35,722][52060] Updated weights for policy 0, policy_version 38250 (0.0009) [2023-10-08 01:26:36,092][52060] Updated weights for policy 0, policy_version 38260 (0.0008) [2023-10-08 01:26:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 78839808. Throughput: 0: 1716.4, 1: 1736.7. Samples: 19718206. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:26:36,211][50642] Avg episode reward: [(0, '17.840'), (1, '21.990')] [2023-10-08 01:26:36,460][52060] Updated weights for policy 0, policy_version 38270 (0.0007) [2023-10-08 01:26:37,889][52059] Updated weights for policy 1, policy_version 38762 (0.0009) [2023-10-08 01:26:38,254][52059] Updated weights for policy 1, policy_version 38772 (0.0008) [2023-10-08 01:26:38,627][52059] Updated weights for policy 1, policy_version 38782 (0.0009) [2023-10-08 01:26:40,506][52060] Updated weights for policy 0, policy_version 38280 (0.0008) [2023-10-08 01:26:40,871][52060] Updated weights for policy 0, policy_version 38290 (0.0007) [2023-10-08 01:26:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 78905344. Throughput: 0: 1714.6, 1: 1733.0. Samples: 19739258. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:26:41,211][50642] Avg episode reward: [(0, '20.100'), (1, '19.950')] [2023-10-08 01:26:41,242][52060] Updated weights for policy 0, policy_version 38300 (0.0008) [2023-10-08 01:26:42,633][52059] Updated weights for policy 1, policy_version 38792 (0.0010) [2023-10-08 01:26:42,998][52059] Updated weights for policy 1, policy_version 38802 (0.0009) [2023-10-08 01:26:43,365][52059] Updated weights for policy 1, policy_version 38812 (0.0008) [2023-10-08 01:26:45,136][52060] Updated weights for policy 0, policy_version 38310 (0.0008) [2023-10-08 01:26:45,502][52060] Updated weights for policy 0, policy_version 38320 (0.0010) [2023-10-08 01:26:45,882][52060] Updated weights for policy 0, policy_version 38330 (0.0008) [2023-10-08 01:26:46,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 79003648. Throughput: 0: 1692.8, 1: 1741.6. Samples: 19759294. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:26:46,211][50642] Avg episode reward: [(0, '20.530'), (1, '19.200')] [2023-10-08 01:26:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000038336_39256064.pth... [2023-10-08 01:26:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000038816_39747584.pth... [2023-10-08 01:26:46,259][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000037216_38109184.pth [2023-10-08 01:26:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000036736_37617664.pth [2023-10-08 01:26:47,427][52059] Updated weights for policy 1, policy_version 38822 (0.0009) [2023-10-08 01:26:47,794][52059] Updated weights for policy 1, policy_version 38832 (0.0007) [2023-10-08 01:26:48,172][52059] Updated weights for policy 1, policy_version 38842 (0.0009) [2023-10-08 01:26:49,835][52060] Updated weights for policy 0, policy_version 38340 (0.0007) [2023-10-08 01:26:50,201][52060] Updated weights for policy 0, policy_version 38350 (0.0008) [2023-10-08 01:26:50,574][52060] Updated weights for policy 0, policy_version 38360 (0.0011) [2023-10-08 01:26:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 79069184. Throughput: 0: 1712.9, 1: 1719.1. Samples: 19769478. Policy #0 lag: (min: 16.0, avg: 41.6, max: 48.0) [2023-10-08 01:26:51,211][50642] Avg episode reward: [(0, '18.710'), (1, '20.050')] [2023-10-08 01:26:52,124][52059] Updated weights for policy 1, policy_version 38852 (0.0008) [2023-10-08 01:26:52,487][52059] Updated weights for policy 1, policy_version 38862 (0.0007) [2023-10-08 01:26:52,851][52059] Updated weights for policy 1, policy_version 38872 (0.0007) [2023-10-08 01:26:54,603][52060] Updated weights for policy 0, policy_version 38370 (0.0008) [2023-10-08 01:26:54,965][52060] Updated weights for policy 0, policy_version 38380 (0.0009) [2023-10-08 01:26:55,341][52060] Updated weights for policy 0, policy_version 38390 (0.0009) [2023-10-08 01:26:55,707][52060] Updated weights for policy 0, policy_version 38400 (0.0010) [2023-10-08 01:26:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 79134720. Throughput: 0: 1704.7, 1: 1723.6. Samples: 19790236. Policy #0 lag: (min: 16.0, avg: 41.6, max: 48.0) [2023-10-08 01:26:56,211][50642] Avg episode reward: [(0, '19.740'), (1, '22.480')] [2023-10-08 01:26:56,822][52059] Updated weights for policy 1, policy_version 38882 (0.0007) [2023-10-08 01:26:57,183][52059] Updated weights for policy 1, policy_version 38892 (0.0008) [2023-10-08 01:26:57,548][52059] Updated weights for policy 1, policy_version 38902 (0.0009) [2023-10-08 01:26:57,914][52059] Updated weights for policy 1, policy_version 38912 (0.0009) [2023-10-08 01:26:59,667][52060] Updated weights for policy 0, policy_version 38410 (0.0007) [2023-10-08 01:27:00,040][52060] Updated weights for policy 0, policy_version 38420 (0.0007) [2023-10-08 01:27:00,405][52060] Updated weights for policy 0, policy_version 38430 (0.0007) [2023-10-08 01:27:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 79200256. Throughput: 0: 1688.7, 1: 1737.0. Samples: 19810456. Policy #0 lag: (min: 16.0, avg: 41.6, max: 48.0) [2023-10-08 01:27:01,211][50642] Avg episode reward: [(0, '21.000'), (1, '19.390')] [2023-10-08 01:27:01,888][52059] Updated weights for policy 1, policy_version 38922 (0.0010) [2023-10-08 01:27:02,248][52059] Updated weights for policy 1, policy_version 38932 (0.0011) [2023-10-08 01:27:02,618][52059] Updated weights for policy 1, policy_version 38942 (0.0010) [2023-10-08 01:27:04,404][52060] Updated weights for policy 0, policy_version 38440 (0.0008) [2023-10-08 01:27:04,770][52060] Updated weights for policy 0, policy_version 38450 (0.0008) [2023-10-08 01:27:05,134][52060] Updated weights for policy 0, policy_version 38460 (0.0008) [2023-10-08 01:27:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79265792. Throughput: 0: 1725.9, 1: 1704.9. Samples: 19821230. Policy #0 lag: (min: 16.0, avg: 41.6, max: 48.0) [2023-10-08 01:27:06,211][50642] Avg episode reward: [(0, '17.840'), (1, '20.040')] [2023-10-08 01:27:06,739][52059] Updated weights for policy 1, policy_version 38952 (0.0008) [2023-10-08 01:27:07,110][52059] Updated weights for policy 1, policy_version 38962 (0.0008) [2023-10-08 01:27:07,472][52059] Updated weights for policy 1, policy_version 38972 (0.0007) [2023-10-08 01:27:09,038][52060] Updated weights for policy 0, policy_version 38470 (0.0008) [2023-10-08 01:27:09,423][52060] Updated weights for policy 0, policy_version 38480 (0.0009) [2023-10-08 01:27:09,788][52060] Updated weights for policy 0, policy_version 38490 (0.0009) [2023-10-08 01:27:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79331328. Throughput: 0: 1695.5, 1: 1732.1. Samples: 19841392. Policy #0 lag: (min: 16.0, avg: 41.6, max: 48.0) [2023-10-08 01:27:11,211][50642] Avg episode reward: [(0, '19.220'), (1, '23.870')] [2023-10-08 01:27:11,346][52059] Updated weights for policy 1, policy_version 38982 (0.0008) [2023-10-08 01:27:11,709][52059] Updated weights for policy 1, policy_version 38992 (0.0009) [2023-10-08 01:27:12,079][52059] Updated weights for policy 1, policy_version 39002 (0.0009) [2023-10-08 01:27:12,288][51710] Saving new best policy, reward=23.870! [2023-10-08 01:27:13,789][52060] Updated weights for policy 0, policy_version 38500 (0.0010) [2023-10-08 01:27:14,154][52060] Updated weights for policy 0, policy_version 38510 (0.0009) [2023-10-08 01:27:14,534][52060] Updated weights for policy 0, policy_version 38520 (0.0008) [2023-10-08 01:27:15,811][52059] Updated weights for policy 1, policy_version 39012 (0.0008) [2023-10-08 01:27:16,203][52059] Updated weights for policy 1, policy_version 39022 (0.0009) [2023-10-08 01:27:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79396864. Throughput: 0: 1693.4, 1: 1727.2. Samples: 19862140. Policy #0 lag: (min: 16.0, avg: 41.6, max: 48.0) [2023-10-08 01:27:16,211][50642] Avg episode reward: [(0, '18.480'), (1, '18.710')] [2023-10-08 01:27:16,574][52059] Updated weights for policy 1, policy_version 39032 (0.0007) [2023-10-08 01:27:18,513][52060] Updated weights for policy 0, policy_version 38530 (0.0009) [2023-10-08 01:27:18,876][52060] Updated weights for policy 0, policy_version 38540 (0.0007) [2023-10-08 01:27:19,245][52060] Updated weights for policy 0, policy_version 38550 (0.0009) [2023-10-08 01:27:19,609][52060] Updated weights for policy 0, policy_version 38560 (0.0009) [2023-10-08 01:27:20,457][52059] Updated weights for policy 1, policy_version 39042 (0.0008) [2023-10-08 01:27:20,816][52059] Updated weights for policy 1, policy_version 39052 (0.0009) [2023-10-08 01:27:21,182][52059] Updated weights for policy 1, policy_version 39062 (0.0010) [2023-10-08 01:27:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79462400. Throughput: 0: 1711.3, 1: 1724.3. Samples: 19872806. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:27:21,211][50642] Avg episode reward: [(0, '16.570'), (1, '18.100')] [2023-10-08 01:27:21,549][52059] Updated weights for policy 1, policy_version 39072 (0.0009) [2023-10-08 01:27:23,528][52060] Updated weights for policy 0, policy_version 38570 (0.0007) [2023-10-08 01:27:23,888][52060] Updated weights for policy 0, policy_version 38580 (0.0009) [2023-10-08 01:27:24,260][52060] Updated weights for policy 0, policy_version 38590 (0.0010) [2023-10-08 01:27:25,494][52059] Updated weights for policy 1, policy_version 39082 (0.0010) [2023-10-08 01:27:25,851][52059] Updated weights for policy 1, policy_version 39092 (0.0010) [2023-10-08 01:27:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79527936. Throughput: 0: 1699.8, 1: 1725.1. Samples: 19893378. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:27:26,211][50642] Avg episode reward: [(0, '19.580'), (1, '22.310')] [2023-10-08 01:27:26,224][52059] Updated weights for policy 1, policy_version 39102 (0.0010) [2023-10-08 01:27:28,241][52060] Updated weights for policy 0, policy_version 38600 (0.0007) [2023-10-08 01:27:28,604][52060] Updated weights for policy 0, policy_version 38610 (0.0008) [2023-10-08 01:27:28,973][52060] Updated weights for policy 0, policy_version 38620 (0.0008) [2023-10-08 01:27:30,350][52059] Updated weights for policy 1, policy_version 39112 (0.0009) [2023-10-08 01:27:30,719][52059] Updated weights for policy 1, policy_version 39122 (0.0010) [2023-10-08 01:27:31,091][52059] Updated weights for policy 1, policy_version 39132 (0.0010) [2023-10-08 01:27:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 79593472. Throughput: 0: 1724.2, 1: 1710.1. Samples: 19913838. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:27:31,211][50642] Avg episode reward: [(0, '17.750'), (1, '19.650')] [2023-10-08 01:27:32,862][52060] Updated weights for policy 0, policy_version 38630 (0.0007) [2023-10-08 01:27:33,230][52060] Updated weights for policy 0, policy_version 38640 (0.0007) [2023-10-08 01:27:33,596][52060] Updated weights for policy 0, policy_version 38650 (0.0010) [2023-10-08 01:27:35,095][52059] Updated weights for policy 1, policy_version 39142 (0.0009) [2023-10-08 01:27:35,459][52059] Updated weights for policy 1, policy_version 39152 (0.0009) [2023-10-08 01:27:35,826][52059] Updated weights for policy 1, policy_version 39162 (0.0010) [2023-10-08 01:27:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 79691776. Throughput: 0: 1706.4, 1: 1730.3. Samples: 19924128. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:27:36,211][50642] Avg episode reward: [(0, '15.950'), (1, '16.580')] [2023-10-08 01:27:37,605][52060] Updated weights for policy 0, policy_version 38660 (0.0010) [2023-10-08 01:27:37,979][52060] Updated weights for policy 0, policy_version 38670 (0.0008) [2023-10-08 01:27:38,346][52060] Updated weights for policy 0, policy_version 38680 (0.0009) [2023-10-08 01:27:39,890][52059] Updated weights for policy 1, policy_version 39172 (0.0009) [2023-10-08 01:27:40,264][52059] Updated weights for policy 1, policy_version 39182 (0.0009) [2023-10-08 01:27:40,637][52059] Updated weights for policy 1, policy_version 39192 (0.0007) [2023-10-08 01:27:41,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 79757312. Throughput: 0: 1712.1, 1: 1730.3. Samples: 19945144. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:27:41,211][50642] Avg episode reward: [(0, '18.100'), (1, '20.690')] [2023-10-08 01:27:42,341][52060] Updated weights for policy 0, policy_version 38690 (0.0009) [2023-10-08 01:27:42,710][52060] Updated weights for policy 0, policy_version 38700 (0.0008) [2023-10-08 01:27:43,082][52060] Updated weights for policy 0, policy_version 38710 (0.0011) [2023-10-08 01:27:43,444][52060] Updated weights for policy 0, policy_version 38720 (0.0010) [2023-10-08 01:27:44,416][52059] Updated weights for policy 1, policy_version 39202 (0.0007) [2023-10-08 01:27:44,781][52059] Updated weights for policy 1, policy_version 39212 (0.0007) [2023-10-08 01:27:45,139][52059] Updated weights for policy 1, policy_version 39222 (0.0012) [2023-10-08 01:27:45,512][52059] Updated weights for policy 1, policy_version 39232 (0.0009) [2023-10-08 01:27:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 79822848. Throughput: 0: 1734.1, 1: 1710.0. Samples: 19965442. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:27:46,211][50642] Avg episode reward: [(0, '18.320'), (1, '21.710')] [2023-10-08 01:27:47,371][52060] Updated weights for policy 0, policy_version 38730 (0.0008) [2023-10-08 01:27:47,738][52060] Updated weights for policy 0, policy_version 38740 (0.0008) [2023-10-08 01:27:48,102][52060] Updated weights for policy 0, policy_version 38750 (0.0010) [2023-10-08 01:27:49,427][52059] Updated weights for policy 1, policy_version 39242 (0.0011) [2023-10-08 01:27:49,780][52059] Updated weights for policy 1, policy_version 39252 (0.0011) [2023-10-08 01:27:50,155][52059] Updated weights for policy 1, policy_version 39262 (0.0010) [2023-10-08 01:27:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79888384. Throughput: 0: 1701.6, 1: 1741.8. Samples: 19976182. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:27:51,211][50642] Avg episode reward: [(0, '18.840'), (1, '16.120')] [2023-10-08 01:27:52,115][52060] Updated weights for policy 0, policy_version 38760 (0.0009) [2023-10-08 01:27:52,489][52060] Updated weights for policy 0, policy_version 38770 (0.0009) [2023-10-08 01:27:52,859][52060] Updated weights for policy 0, policy_version 38780 (0.0007) [2023-10-08 01:27:54,229][52059] Updated weights for policy 1, policy_version 39272 (0.0009) [2023-10-08 01:27:54,600][52059] Updated weights for policy 1, policy_version 39282 (0.0007) [2023-10-08 01:27:54,967][52059] Updated weights for policy 1, policy_version 39292 (0.0007) [2023-10-08 01:27:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 79953920. Throughput: 0: 1720.7, 1: 1714.7. Samples: 19995984. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:27:56,211][50642] Avg episode reward: [(0, '18.730'), (1, '18.600')] [2023-10-08 01:27:56,942][52060] Updated weights for policy 0, policy_version 38790 (0.0008) [2023-10-08 01:27:57,319][52060] Updated weights for policy 0, policy_version 38800 (0.0007) [2023-10-08 01:27:57,698][52060] Updated weights for policy 0, policy_version 38810 (0.0009) [2023-10-08 01:27:58,836][52059] Updated weights for policy 1, policy_version 39302 (0.0009) [2023-10-08 01:27:59,201][52059] Updated weights for policy 1, policy_version 39312 (0.0008) [2023-10-08 01:27:59,564][52059] Updated weights for policy 1, policy_version 39322 (0.0009) [2023-10-08 01:28:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 80019456. Throughput: 0: 1727.1, 1: 1709.8. Samples: 20016798. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:28:01,211][50642] Avg episode reward: [(0, '17.800'), (1, '21.050')] [2023-10-08 01:28:01,717][52060] Updated weights for policy 0, policy_version 38820 (0.0009) [2023-10-08 01:28:02,089][52060] Updated weights for policy 0, policy_version 38830 (0.0007) [2023-10-08 01:28:02,459][52060] Updated weights for policy 0, policy_version 38840 (0.0009) [2023-10-08 01:28:03,666][52059] Updated weights for policy 1, policy_version 39332 (0.0008) [2023-10-08 01:28:04,028][52059] Updated weights for policy 1, policy_version 39342 (0.0008) [2023-10-08 01:28:04,380][52059] Updated weights for policy 1, policy_version 39352 (0.0007) [2023-10-08 01:28:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 80084992. Throughput: 0: 1702.8, 1: 1724.1. Samples: 20027014. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:28:06,211][50642] Avg episode reward: [(0, '19.600'), (1, '16.400')] [2023-10-08 01:28:06,367][52060] Updated weights for policy 0, policy_version 38850 (0.0008) [2023-10-08 01:28:06,743][52060] Updated weights for policy 0, policy_version 38860 (0.0008) [2023-10-08 01:28:07,112][52060] Updated weights for policy 0, policy_version 38870 (0.0007) [2023-10-08 01:28:07,475][52060] Updated weights for policy 0, policy_version 38880 (0.0008) [2023-10-08 01:28:08,045][52059] Updated weights for policy 1, policy_version 39362 (0.0008) [2023-10-08 01:28:08,408][52059] Updated weights for policy 1, policy_version 39372 (0.0008) [2023-10-08 01:28:08,775][52059] Updated weights for policy 1, policy_version 39382 (0.0008) [2023-10-08 01:28:09,144][52059] Updated weights for policy 1, policy_version 39392 (0.0009) [2023-10-08 01:28:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 80150528. Throughput: 0: 1720.7, 1: 1711.1. Samples: 20047808. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:28:11,211][50642] Avg episode reward: [(0, '20.200'), (1, '16.250')] [2023-10-08 01:28:11,601][52060] Updated weights for policy 0, policy_version 38890 (0.0009) [2023-10-08 01:28:11,973][52060] Updated weights for policy 0, policy_version 38900 (0.0009) [2023-10-08 01:28:12,347][52060] Updated weights for policy 0, policy_version 38910 (0.0008) [2023-10-08 01:28:13,116][52059] Updated weights for policy 1, policy_version 39402 (0.0007) [2023-10-08 01:28:13,481][52059] Updated weights for policy 1, policy_version 39412 (0.0008) [2023-10-08 01:28:13,842][52059] Updated weights for policy 1, policy_version 39422 (0.0008) [2023-10-08 01:28:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 80216064. Throughput: 0: 1711.0, 1: 1733.1. Samples: 20068820. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:28:16,211][50642] Avg episode reward: [(0, '18.930'), (1, '19.500')] [2023-10-08 01:28:16,414][52060] Updated weights for policy 0, policy_version 38920 (0.0007) [2023-10-08 01:28:16,784][52060] Updated weights for policy 0, policy_version 38930 (0.0008) [2023-10-08 01:28:17,155][52060] Updated weights for policy 0, policy_version 38940 (0.0008) [2023-10-08 01:28:17,780][52059] Updated weights for policy 1, policy_version 39432 (0.0009) [2023-10-08 01:28:18,142][52059] Updated weights for policy 1, policy_version 39442 (0.0008) [2023-10-08 01:28:18,505][52059] Updated weights for policy 1, policy_version 39452 (0.0008) [2023-10-08 01:28:21,181][52060] Updated weights for policy 0, policy_version 38950 (0.0010) [2023-10-08 01:28:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 80281600. Throughput: 0: 1709.0, 1: 1715.4. Samples: 20078224. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 01:28:21,211][50642] Avg episode reward: [(0, '19.890'), (1, '15.300')] [2023-10-08 01:28:21,549][52060] Updated weights for policy 0, policy_version 38960 (0.0009) [2023-10-08 01:28:21,921][52060] Updated weights for policy 0, policy_version 38970 (0.0008) [2023-10-08 01:28:22,470][52059] Updated weights for policy 1, policy_version 39462 (0.0007) [2023-10-08 01:28:22,828][52059] Updated weights for policy 1, policy_version 39472 (0.0009) [2023-10-08 01:28:23,192][52059] Updated weights for policy 1, policy_version 39482 (0.0008) [2023-10-08 01:28:25,899][52060] Updated weights for policy 0, policy_version 38980 (0.0007) [2023-10-08 01:28:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 80347136. Throughput: 0: 1707.5, 1: 1720.2. Samples: 20099388. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 01:28:26,211][50642] Avg episode reward: [(0, '19.290'), (1, '16.410')] [2023-10-08 01:28:26,270][52060] Updated weights for policy 0, policy_version 38990 (0.0008) [2023-10-08 01:28:26,644][52060] Updated weights for policy 0, policy_version 39000 (0.0008) [2023-10-08 01:28:27,181][52059] Updated weights for policy 1, policy_version 39492 (0.0009) [2023-10-08 01:28:27,533][52059] Updated weights for policy 1, policy_version 39502 (0.0007) [2023-10-08 01:28:27,893][52059] Updated weights for policy 1, policy_version 39512 (0.0008) [2023-10-08 01:28:30,569][52060] Updated weights for policy 0, policy_version 39010 (0.0010) [2023-10-08 01:28:30,941][52060] Updated weights for policy 0, policy_version 39020 (0.0011) [2023-10-08 01:28:31,211][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.2, 300 sec: 13662.6). Total num frames: 80412672. Throughput: 0: 1698.8, 1: 1741.3. Samples: 20120246. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 01:28:31,212][50642] Avg episode reward: [(0, '17.910'), (1, '18.350')] [2023-10-08 01:28:31,309][52060] Updated weights for policy 0, policy_version 39030 (0.0009) [2023-10-08 01:28:31,672][52060] Updated weights for policy 0, policy_version 39040 (0.0009) [2023-10-08 01:28:31,776][52059] Updated weights for policy 1, policy_version 39522 (0.0007) [2023-10-08 01:28:32,145][52059] Updated weights for policy 1, policy_version 39532 (0.0007) [2023-10-08 01:28:32,523][52059] Updated weights for policy 1, policy_version 39542 (0.0007) [2023-10-08 01:28:32,882][52059] Updated weights for policy 1, policy_version 39552 (0.0008) [2023-10-08 01:28:35,424][52060] Updated weights for policy 0, policy_version 39050 (0.0008) [2023-10-08 01:28:35,797][52060] Updated weights for policy 0, policy_version 39060 (0.0008) [2023-10-08 01:28:36,160][52060] Updated weights for policy 0, policy_version 39070 (0.0009) [2023-10-08 01:28:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 80478208. Throughput: 0: 1708.6, 1: 1715.2. Samples: 20130250. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 01:28:36,211][50642] Avg episode reward: [(0, '19.600'), (1, '21.430')] [2023-10-08 01:28:36,858][52059] Updated weights for policy 1, policy_version 39562 (0.0008) [2023-10-08 01:28:37,217][52059] Updated weights for policy 1, policy_version 39572 (0.0008) [2023-10-08 01:28:37,584][52059] Updated weights for policy 1, policy_version 39582 (0.0008) [2023-10-08 01:28:40,053][52060] Updated weights for policy 0, policy_version 39080 (0.0008) [2023-10-08 01:28:40,415][52060] Updated weights for policy 0, policy_version 39090 (0.0008) [2023-10-08 01:28:40,784][52060] Updated weights for policy 0, policy_version 39100 (0.0008) [2023-10-08 01:28:41,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 80576512. Throughput: 0: 1718.6, 1: 1744.6. Samples: 20151828. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 01:28:41,211][50642] Avg episode reward: [(0, '19.250'), (1, '17.660')] [2023-10-08 01:28:41,443][52059] Updated weights for policy 1, policy_version 39592 (0.0008) [2023-10-08 01:28:41,804][52059] Updated weights for policy 1, policy_version 39602 (0.0009) [2023-10-08 01:28:42,158][52059] Updated weights for policy 1, policy_version 39612 (0.0009) [2023-10-08 01:28:44,645][52060] Updated weights for policy 0, policy_version 39110 (0.0009) [2023-10-08 01:28:45,033][52060] Updated weights for policy 0, policy_version 39120 (0.0009) [2023-10-08 01:28:45,402][52060] Updated weights for policy 0, policy_version 39130 (0.0008) [2023-10-08 01:28:46,136][52059] Updated weights for policy 1, policy_version 39622 (0.0009) [2023-10-08 01:28:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 80642048. Throughput: 0: 1695.9, 1: 1753.9. Samples: 20172040. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 01:28:46,211][50642] Avg episode reward: [(0, '18.520'), (1, '18.690')] [2023-10-08 01:28:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000039136_40075264.pth... [2023-10-08 01:28:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000037536_38436864.pth [2023-10-08 01:28:46,503][52059] Updated weights for policy 1, policy_version 39632 (0.0009) [2023-10-08 01:28:46,871][52059] Updated weights for policy 1, policy_version 39642 (0.0009) [2023-10-08 01:28:47,081][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000039648_40599552.pth... [2023-10-08 01:28:47,110][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000038016_38928384.pth [2023-10-08 01:28:49,401][52060] Updated weights for policy 0, policy_version 39140 (0.0008) [2023-10-08 01:28:49,772][52060] Updated weights for policy 0, policy_version 39150 (0.0007) [2023-10-08 01:28:50,139][52060] Updated weights for policy 0, policy_version 39160 (0.0008) [2023-10-08 01:28:50,904][52059] Updated weights for policy 1, policy_version 39652 (0.0009) [2023-10-08 01:28:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 80707584. Throughput: 0: 1726.0, 1: 1732.6. Samples: 20182650. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 01:28:51,211][50642] Avg episode reward: [(0, '21.380'), (1, '22.930')] [2023-10-08 01:28:51,281][52059] Updated weights for policy 1, policy_version 39662 (0.0011) [2023-10-08 01:28:51,647][52059] Updated weights for policy 1, policy_version 39672 (0.0009) [2023-10-08 01:28:54,264][52060] Updated weights for policy 0, policy_version 39170 (0.0010) [2023-10-08 01:28:54,626][52060] Updated weights for policy 0, policy_version 39180 (0.0011) [2023-10-08 01:28:54,995][52060] Updated weights for policy 0, policy_version 39190 (0.0009) [2023-10-08 01:28:55,366][52060] Updated weights for policy 0, policy_version 39200 (0.0007) [2023-10-08 01:28:55,386][52059] Updated weights for policy 1, policy_version 39682 (0.0008) [2023-10-08 01:28:55,754][52059] Updated weights for policy 1, policy_version 39692 (0.0008) [2023-10-08 01:28:56,119][52059] Updated weights for policy 1, policy_version 39702 (0.0007) [2023-10-08 01:28:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 80773120. Throughput: 0: 1705.4, 1: 1744.9. Samples: 20203072. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 01:28:56,211][50642] Avg episode reward: [(0, '20.740'), (1, '19.370')] [2023-10-08 01:28:56,485][52059] Updated weights for policy 1, policy_version 39712 (0.0010) [2023-10-08 01:28:59,332][52060] Updated weights for policy 0, policy_version 39210 (0.0007) [2023-10-08 01:28:59,705][52060] Updated weights for policy 0, policy_version 39220 (0.0009) [2023-10-08 01:29:00,070][52060] Updated weights for policy 0, policy_version 39230 (0.0008) [2023-10-08 01:29:00,425][52059] Updated weights for policy 1, policy_version 39722 (0.0008) [2023-10-08 01:29:00,790][52059] Updated weights for policy 1, policy_version 39732 (0.0009) [2023-10-08 01:29:01,154][52059] Updated weights for policy 1, policy_version 39742 (0.0007) [2023-10-08 01:29:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 80838656. Throughput: 0: 1699.6, 1: 1727.4. Samples: 20223036. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 01:29:01,211][50642] Avg episode reward: [(0, '19.120'), (1, '18.660')] [2023-10-08 01:29:04,088][52060] Updated weights for policy 0, policy_version 39240 (0.0010) [2023-10-08 01:29:04,456][52060] Updated weights for policy 0, policy_version 39250 (0.0009) [2023-10-08 01:29:04,811][52060] Updated weights for policy 0, policy_version 39260 (0.0009) [2023-10-08 01:29:05,033][52059] Updated weights for policy 1, policy_version 39752 (0.0008) [2023-10-08 01:29:05,401][52059] Updated weights for policy 1, policy_version 39762 (0.0007) [2023-10-08 01:29:05,760][52059] Updated weights for policy 1, policy_version 39772 (0.0010) [2023-10-08 01:29:06,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 80936960. Throughput: 0: 1726.4, 1: 1744.5. Samples: 20234418. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 01:29:06,211][50642] Avg episode reward: [(0, '20.160'), (1, '20.880')] [2023-10-08 01:29:08,845][52060] Updated weights for policy 0, policy_version 39270 (0.0008) [2023-10-08 01:29:09,210][52060] Updated weights for policy 0, policy_version 39280 (0.0007) [2023-10-08 01:29:09,582][52060] Updated weights for policy 0, policy_version 39290 (0.0008) [2023-10-08 01:29:09,676][52059] Updated weights for policy 1, policy_version 39782 (0.0009) [2023-10-08 01:29:10,043][52059] Updated weights for policy 1, policy_version 39792 (0.0009) [2023-10-08 01:29:10,412][52059] Updated weights for policy 1, policy_version 39802 (0.0009) [2023-10-08 01:29:11,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 81002496. Throughput: 0: 1699.2, 1: 1738.0. Samples: 20254064. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 01:29:11,211][50642] Avg episode reward: [(0, '21.150'), (1, '21.470')] [2023-10-08 01:29:13,562][52060] Updated weights for policy 0, policy_version 39300 (0.0008) [2023-10-08 01:29:13,934][52060] Updated weights for policy 0, policy_version 39310 (0.0009) [2023-10-08 01:29:14,294][52060] Updated weights for policy 0, policy_version 39320 (0.0010) [2023-10-08 01:29:14,347][52059] Updated weights for policy 1, policy_version 39812 (0.0008) [2023-10-08 01:29:14,716][52059] Updated weights for policy 1, policy_version 39822 (0.0009) [2023-10-08 01:29:15,077][52059] Updated weights for policy 1, policy_version 39832 (0.0010) [2023-10-08 01:29:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 81068032. Throughput: 0: 1702.1, 1: 1724.1. Samples: 20274426. Policy #0 lag: (min: 31.0, avg: 31.0, max: 32.0) [2023-10-08 01:29:16,211][50642] Avg episode reward: [(0, '18.970'), (1, '19.220')] [2023-10-08 01:29:18,139][52060] Updated weights for policy 0, policy_version 39330 (0.0009) [2023-10-08 01:29:18,509][52060] Updated weights for policy 0, policy_version 39340 (0.0011) [2023-10-08 01:29:18,865][52060] Updated weights for policy 0, policy_version 39350 (0.0009) [2023-10-08 01:29:18,906][52059] Updated weights for policy 1, policy_version 39842 (0.0008) [2023-10-08 01:29:19,231][52060] Updated weights for policy 0, policy_version 39360 (0.0008) [2023-10-08 01:29:19,280][52059] Updated weights for policy 1, policy_version 39852 (0.0009) [2023-10-08 01:29:19,639][52059] Updated weights for policy 1, policy_version 39862 (0.0008) [2023-10-08 01:29:20,009][52059] Updated weights for policy 1, policy_version 39872 (0.0009) [2023-10-08 01:29:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 81133568. Throughput: 0: 1705.8, 1: 1754.7. Samples: 20285970. Policy #0 lag: (min: 31.0, avg: 31.0, max: 32.0) [2023-10-08 01:29:21,211][50642] Avg episode reward: [(0, '18.850'), (1, '20.920')] [2023-10-08 01:29:23,029][52060] Updated weights for policy 0, policy_version 39370 (0.0011) [2023-10-08 01:29:23,402][52060] Updated weights for policy 0, policy_version 39380 (0.0010) [2023-10-08 01:29:23,759][52060] Updated weights for policy 0, policy_version 39390 (0.0008) [2023-10-08 01:29:23,828][52059] Updated weights for policy 1, policy_version 39882 (0.0008) [2023-10-08 01:29:24,180][52059] Updated weights for policy 1, policy_version 39892 (0.0007) [2023-10-08 01:29:24,554][52059] Updated weights for policy 1, policy_version 39902 (0.0007) [2023-10-08 01:29:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 81199104. Throughput: 0: 1698.6, 1: 1724.7. Samples: 20305874. Policy #0 lag: (min: 31.0, avg: 31.0, max: 32.0) [2023-10-08 01:29:26,211][50642] Avg episode reward: [(0, '21.040'), (1, '22.200')] [2023-10-08 01:29:27,833][52060] Updated weights for policy 0, policy_version 39400 (0.0008) [2023-10-08 01:29:28,196][52060] Updated weights for policy 0, policy_version 39410 (0.0009) [2023-10-08 01:29:28,464][52059] Updated weights for policy 1, policy_version 39912 (0.0008) [2023-10-08 01:29:28,570][52060] Updated weights for policy 0, policy_version 39420 (0.0008) [2023-10-08 01:29:28,817][52059] Updated weights for policy 1, policy_version 39922 (0.0008) [2023-10-08 01:29:29,184][52059] Updated weights for policy 1, policy_version 39932 (0.0007) [2023-10-08 01:29:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 81264640. Throughput: 0: 1723.0, 1: 1723.1. Samples: 20327112. Policy #0 lag: (min: 31.0, avg: 31.0, max: 32.0) [2023-10-08 01:29:31,211][50642] Avg episode reward: [(0, '19.080'), (1, '18.220')] [2023-10-08 01:29:32,643][52060] Updated weights for policy 0, policy_version 39430 (0.0009) [2023-10-08 01:29:33,026][52060] Updated weights for policy 0, policy_version 39440 (0.0009) [2023-10-08 01:29:33,124][52059] Updated weights for policy 1, policy_version 39942 (0.0007) [2023-10-08 01:29:33,393][52060] Updated weights for policy 0, policy_version 39450 (0.0007) [2023-10-08 01:29:33,497][52059] Updated weights for policy 1, policy_version 39952 (0.0007) [2023-10-08 01:29:33,855][52059] Updated weights for policy 1, policy_version 39962 (0.0007) [2023-10-08 01:29:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 81330176. Throughput: 0: 1686.9, 1: 1732.8. Samples: 20336534. Policy #0 lag: (min: 31.0, avg: 31.0, max: 32.0) [2023-10-08 01:29:36,211][50642] Avg episode reward: [(0, '19.270'), (1, '20.410')] [2023-10-08 01:29:37,314][52060] Updated weights for policy 0, policy_version 39460 (0.0009) [2023-10-08 01:29:37,677][52060] Updated weights for policy 0, policy_version 39470 (0.0007) [2023-10-08 01:29:37,884][52059] Updated weights for policy 1, policy_version 39972 (0.0009) [2023-10-08 01:29:38,046][52060] Updated weights for policy 0, policy_version 39480 (0.0008) [2023-10-08 01:29:38,274][52059] Updated weights for policy 1, policy_version 39982 (0.0009) [2023-10-08 01:29:38,649][52059] Updated weights for policy 1, policy_version 39992 (0.0009) [2023-10-08 01:29:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 81395712. Throughput: 0: 1706.8, 1: 1722.7. Samples: 20357400. Policy #0 lag: (min: 31.0, avg: 31.0, max: 32.0) [2023-10-08 01:29:41,211][50642] Avg episode reward: [(0, '20.140'), (1, '21.200')] [2023-10-08 01:29:42,223][52060] Updated weights for policy 0, policy_version 39490 (0.0008) [2023-10-08 01:29:42,595][52060] Updated weights for policy 0, policy_version 39500 (0.0007) [2023-10-08 01:29:42,601][52059] Updated weights for policy 1, policy_version 40002 (0.0010) [2023-10-08 01:29:42,961][52060] Updated weights for policy 0, policy_version 39510 (0.0010) [2023-10-08 01:29:42,962][52059] Updated weights for policy 1, policy_version 40012 (0.0007) [2023-10-08 01:29:43,334][52059] Updated weights for policy 1, policy_version 40022 (0.0009) [2023-10-08 01:29:43,338][52060] Updated weights for policy 0, policy_version 39520 (0.0007) [2023-10-08 01:29:43,694][52059] Updated weights for policy 1, policy_version 40032 (0.0010) [2023-10-08 01:29:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 81461248. Throughput: 0: 1717.8, 1: 1738.4. Samples: 20378566. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-08 01:29:46,212][50642] Avg episode reward: [(0, '19.770'), (1, '19.870')] [2023-10-08 01:29:47,384][52060] Updated weights for policy 0, policy_version 39530 (0.0008) [2023-10-08 01:29:47,723][52059] Updated weights for policy 1, policy_version 40042 (0.0009) [2023-10-08 01:29:47,752][52060] Updated weights for policy 0, policy_version 39540 (0.0009) [2023-10-08 01:29:48,076][52059] Updated weights for policy 1, policy_version 40052 (0.0008) [2023-10-08 01:29:48,113][52060] Updated weights for policy 0, policy_version 39550 (0.0008) [2023-10-08 01:29:48,443][52059] Updated weights for policy 1, policy_version 40062 (0.0010) [2023-10-08 01:29:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 81526784. Throughput: 0: 1693.4, 1: 1717.6. Samples: 20387914. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-08 01:29:51,211][50642] Avg episode reward: [(0, '17.220'), (1, '19.740')] [2023-10-08 01:29:51,977][52060] Updated weights for policy 0, policy_version 39560 (0.0009) [2023-10-08 01:29:52,339][52060] Updated weights for policy 0, policy_version 39570 (0.0009) [2023-10-08 01:29:52,412][52059] Updated weights for policy 1, policy_version 40072 (0.0008) [2023-10-08 01:29:52,718][52060] Updated weights for policy 0, policy_version 39580 (0.0008) [2023-10-08 01:29:52,771][52059] Updated weights for policy 1, policy_version 40082 (0.0007) [2023-10-08 01:29:53,130][52059] Updated weights for policy 1, policy_version 40092 (0.0007) [2023-10-08 01:29:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 81592320. Throughput: 0: 1721.8, 1: 1722.4. Samples: 20409054. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-08 01:29:56,211][50642] Avg episode reward: [(0, '20.500'), (1, '19.960')] [2023-10-08 01:29:56,911][52060] Updated weights for policy 0, policy_version 39590 (0.0009) [2023-10-08 01:29:57,165][52059] Updated weights for policy 1, policy_version 40102 (0.0007) [2023-10-08 01:29:57,275][52060] Updated weights for policy 0, policy_version 39600 (0.0010) [2023-10-08 01:29:57,523][52059] Updated weights for policy 1, policy_version 40112 (0.0009) [2023-10-08 01:29:57,641][52060] Updated weights for policy 0, policy_version 39610 (0.0007) [2023-10-08 01:29:57,894][52059] Updated weights for policy 1, policy_version 40122 (0.0007) [2023-10-08 01:30:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 81657856. Throughput: 0: 1725.1, 1: 1736.4. Samples: 20430194. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-08 01:30:01,211][50642] Avg episode reward: [(0, '19.320'), (1, '22.470')] [2023-10-08 01:30:01,634][52060] Updated weights for policy 0, policy_version 39620 (0.0007) [2023-10-08 01:30:01,854][52059] Updated weights for policy 1, policy_version 40132 (0.0008) [2023-10-08 01:30:01,999][52060] Updated weights for policy 0, policy_version 39630 (0.0009) [2023-10-08 01:30:02,221][52059] Updated weights for policy 1, policy_version 40142 (0.0008) [2023-10-08 01:30:02,353][52060] Updated weights for policy 0, policy_version 39640 (0.0008) [2023-10-08 01:30:02,578][52059] Updated weights for policy 1, policy_version 40152 (0.0008) [2023-10-08 01:30:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 81723392. Throughput: 0: 1709.4, 1: 1702.3. Samples: 20439496. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-08 01:30:06,211][50642] Avg episode reward: [(0, '17.630'), (1, '20.930')] [2023-10-08 01:30:06,468][52060] Updated weights for policy 0, policy_version 39650 (0.0009) [2023-10-08 01:30:06,625][52059] Updated weights for policy 1, policy_version 40162 (0.0008) [2023-10-08 01:30:06,826][52060] Updated weights for policy 0, policy_version 39660 (0.0007) [2023-10-08 01:30:06,981][52059] Updated weights for policy 1, policy_version 40172 (0.0009) [2023-10-08 01:30:07,197][52060] Updated weights for policy 0, policy_version 39670 (0.0008) [2023-10-08 01:30:07,347][52059] Updated weights for policy 1, policy_version 40182 (0.0009) [2023-10-08 01:30:07,568][52060] Updated weights for policy 0, policy_version 39680 (0.0008) [2023-10-08 01:30:07,709][52059] Updated weights for policy 1, policy_version 40192 (0.0007) [2023-10-08 01:30:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 81788928. Throughput: 0: 1712.2, 1: 1725.7. Samples: 20460578. Policy #0 lag: (min: 28.0, avg: 28.0, max: 28.0) [2023-10-08 01:30:11,211][50642] Avg episode reward: [(0, '19.260'), (1, '20.290')] [2023-10-08 01:30:11,395][52060] Updated weights for policy 0, policy_version 39690 (0.0008) [2023-10-08 01:30:11,583][52059] Updated weights for policy 1, policy_version 40202 (0.0008) [2023-10-08 01:30:11,771][52060] Updated weights for policy 0, policy_version 39700 (0.0010) [2023-10-08 01:30:11,940][52059] Updated weights for policy 1, policy_version 40212 (0.0008) [2023-10-08 01:30:12,140][52060] Updated weights for policy 0, policy_version 39710 (0.0008) [2023-10-08 01:30:12,309][52059] Updated weights for policy 1, policy_version 40222 (0.0008) [2023-10-08 01:30:16,111][52060] Updated weights for policy 0, policy_version 39720 (0.0008) [2023-10-08 01:30:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 81854464. Throughput: 0: 1709.4, 1: 1727.1. Samples: 20481754. Policy #0 lag: (min: 14.0, avg: 22.7, max: 46.0) [2023-10-08 01:30:16,211][50642] Avg episode reward: [(0, '20.740'), (1, '23.170')] [2023-10-08 01:30:16,255][52059] Updated weights for policy 1, policy_version 40232 (0.0008) [2023-10-08 01:30:16,481][52060] Updated weights for policy 0, policy_version 39730 (0.0007) [2023-10-08 01:30:16,627][52059] Updated weights for policy 1, policy_version 40242 (0.0009) [2023-10-08 01:30:16,847][52060] Updated weights for policy 0, policy_version 39740 (0.0008) [2023-10-08 01:30:16,983][52059] Updated weights for policy 1, policy_version 40252 (0.0008) [2023-10-08 01:30:20,869][52060] Updated weights for policy 0, policy_version 39750 (0.0007) [2023-10-08 01:30:20,985][52059] Updated weights for policy 1, policy_version 40262 (0.0008) [2023-10-08 01:30:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 81920000. Throughput: 0: 1713.5, 1: 1715.6. Samples: 20490842. Policy #0 lag: (min: 14.0, avg: 22.7, max: 46.0) [2023-10-08 01:30:21,211][50642] Avg episode reward: [(0, '17.500'), (1, '20.430')] [2023-10-08 01:30:21,243][52060] Updated weights for policy 0, policy_version 39760 (0.0009) [2023-10-08 01:30:21,355][52059] Updated weights for policy 1, policy_version 40272 (0.0008) [2023-10-08 01:30:21,603][52060] Updated weights for policy 0, policy_version 39770 (0.0008) [2023-10-08 01:30:21,723][52059] Updated weights for policy 1, policy_version 40282 (0.0009) [2023-10-08 01:30:25,583][52060] Updated weights for policy 0, policy_version 39780 (0.0009) [2023-10-08 01:30:25,641][52059] Updated weights for policy 1, policy_version 40292 (0.0008) [2023-10-08 01:30:25,946][52060] Updated weights for policy 0, policy_version 39790 (0.0008) [2023-10-08 01:30:26,035][52059] Updated weights for policy 1, policy_version 40302 (0.0007) [2023-10-08 01:30:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 81985536. Throughput: 0: 1713.2, 1: 1732.0. Samples: 20512436. Policy #0 lag: (min: 14.0, avg: 22.7, max: 46.0) [2023-10-08 01:30:26,211][50642] Avg episode reward: [(0, '18.570'), (1, '22.450')] [2023-10-08 01:30:26,318][52060] Updated weights for policy 0, policy_version 39800 (0.0007) [2023-10-08 01:30:26,402][52059] Updated weights for policy 1, policy_version 40312 (0.0007) [2023-10-08 01:30:30,228][52059] Updated weights for policy 1, policy_version 40322 (0.0007) [2023-10-08 01:30:30,279][52060] Updated weights for policy 0, policy_version 39810 (0.0008) [2023-10-08 01:30:30,592][52059] Updated weights for policy 1, policy_version 40332 (0.0008) [2023-10-08 01:30:30,642][52060] Updated weights for policy 0, policy_version 39820 (0.0009) [2023-10-08 01:30:30,946][52059] Updated weights for policy 1, policy_version 40342 (0.0008) [2023-10-08 01:30:31,005][52060] Updated weights for policy 0, policy_version 39830 (0.0008) [2023-10-08 01:30:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 82051072. Throughput: 0: 1699.3, 1: 1717.5. Samples: 20532324. Policy #0 lag: (min: 14.0, avg: 22.7, max: 46.0) [2023-10-08 01:30:31,211][50642] Avg episode reward: [(0, '21.520'), (1, '20.220')] [2023-10-08 01:30:31,311][52059] Updated weights for policy 1, policy_version 40352 (0.0009) [2023-10-08 01:30:31,383][52060] Updated weights for policy 0, policy_version 39840 (0.0008) [2023-10-08 01:30:35,184][52059] Updated weights for policy 1, policy_version 40362 (0.0009) [2023-10-08 01:30:35,375][52060] Updated weights for policy 0, policy_version 39850 (0.0008) [2023-10-08 01:30:35,550][52059] Updated weights for policy 1, policy_version 40372 (0.0010) [2023-10-08 01:30:35,744][52060] Updated weights for policy 0, policy_version 39860 (0.0007) [2023-10-08 01:30:35,922][52059] Updated weights for policy 1, policy_version 40382 (0.0007) [2023-10-08 01:30:36,112][52060] Updated weights for policy 0, policy_version 39870 (0.0008) [2023-10-08 01:30:36,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 82182144. Throughput: 0: 1711.9, 1: 1734.8. Samples: 20543016. Policy #0 lag: (min: 14.0, avg: 22.7, max: 46.0) [2023-10-08 01:30:36,211][50642] Avg episode reward: [(0, '19.220'), (1, '21.230')] [2023-10-08 01:30:39,920][52059] Updated weights for policy 1, policy_version 40392 (0.0007) [2023-10-08 01:30:40,217][52060] Updated weights for policy 0, policy_version 39880 (0.0007) [2023-10-08 01:30:40,285][52059] Updated weights for policy 1, policy_version 40402 (0.0007) [2023-10-08 01:30:40,584][52060] Updated weights for policy 0, policy_version 39890 (0.0009) [2023-10-08 01:30:40,643][52059] Updated weights for policy 1, policy_version 40412 (0.0007) [2023-10-08 01:30:40,950][52060] Updated weights for policy 0, policy_version 39900 (0.0009) [2023-10-08 01:30:41,210][50642] Fps is (10 sec: 19660.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 82247680. Throughput: 0: 1711.0, 1: 1737.1. Samples: 20564218. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:30:41,211][50642] Avg episode reward: [(0, '18.660'), (1, '22.570')] [2023-10-08 01:30:44,523][52059] Updated weights for policy 1, policy_version 40422 (0.0008) [2023-10-08 01:30:44,877][52059] Updated weights for policy 1, policy_version 40432 (0.0007) [2023-10-08 01:30:45,030][52060] Updated weights for policy 0, policy_version 39910 (0.0009) [2023-10-08 01:30:45,244][52059] Updated weights for policy 1, policy_version 40442 (0.0008) [2023-10-08 01:30:45,390][52060] Updated weights for policy 0, policy_version 39920 (0.0008) [2023-10-08 01:30:45,761][52060] Updated weights for policy 0, policy_version 39930 (0.0008) [2023-10-08 01:30:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 82313216. Throughput: 0: 1688.4, 1: 1711.9. Samples: 20583206. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:30:46,211][50642] Avg episode reward: [(0, '21.880'), (1, '20.680')] [2023-10-08 01:30:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000040448_41418752.pth... [2023-10-08 01:30:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000039936_40894464.pth... [2023-10-08 01:30:46,256][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000038336_39256064.pth [2023-10-08 01:30:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000038816_39747584.pth [2023-10-08 01:30:46,260][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000039936_40894464.pth [2023-10-08 01:30:46,262][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000040448_41418752.pth [2023-10-08 01:30:49,330][52059] Updated weights for policy 1, policy_version 40452 (0.0009) [2023-10-08 01:30:49,693][52059] Updated weights for policy 1, policy_version 40462 (0.0007) [2023-10-08 01:30:49,799][52060] Updated weights for policy 0, policy_version 39940 (0.0010) [2023-10-08 01:30:50,061][52059] Updated weights for policy 1, policy_version 40472 (0.0009) [2023-10-08 01:30:50,165][52060] Updated weights for policy 0, policy_version 39950 (0.0009) [2023-10-08 01:30:50,537][52060] Updated weights for policy 0, policy_version 39960 (0.0009) [2023-10-08 01:30:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 82378752. Throughput: 0: 1712.4, 1: 1740.8. Samples: 20594892. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:30:51,212][50642] Avg episode reward: [(0, '21.730'), (1, '20.380')] [2023-10-08 01:30:54,101][52059] Updated weights for policy 1, policy_version 40482 (0.0009) [2023-10-08 01:30:54,427][52060] Updated weights for policy 0, policy_version 39970 (0.0009) [2023-10-08 01:30:54,466][52059] Updated weights for policy 1, policy_version 40492 (0.0008) [2023-10-08 01:30:54,788][52060] Updated weights for policy 0, policy_version 39980 (0.0008) [2023-10-08 01:30:54,824][52059] Updated weights for policy 1, policy_version 40502 (0.0007) [2023-10-08 01:30:55,165][52060] Updated weights for policy 0, policy_version 39990 (0.0008) [2023-10-08 01:30:55,187][52059] Updated weights for policy 1, policy_version 40512 (0.0007) [2023-10-08 01:30:55,531][52060] Updated weights for policy 0, policy_version 40000 (0.0011) [2023-10-08 01:30:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 82444288. Throughput: 0: 1701.5, 1: 1719.9. Samples: 20614540. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:30:56,211][50642] Avg episode reward: [(0, '17.990'), (1, '20.150')] [2023-10-08 01:30:58,907][52059] Updated weights for policy 1, policy_version 40522 (0.0010) [2023-10-08 01:30:59,277][52059] Updated weights for policy 1, policy_version 40532 (0.0009) [2023-10-08 01:30:59,498][52060] Updated weights for policy 0, policy_version 40010 (0.0008) [2023-10-08 01:30:59,644][52059] Updated weights for policy 1, policy_version 40542 (0.0008) [2023-10-08 01:30:59,867][52060] Updated weights for policy 0, policy_version 40020 (0.0009) [2023-10-08 01:31:00,227][52060] Updated weights for policy 0, policy_version 40030 (0.0011) [2023-10-08 01:31:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 82509824. Throughput: 0: 1687.7, 1: 1711.9. Samples: 20634736. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:31:01,211][50642] Avg episode reward: [(0, '19.350'), (1, '19.230')] [2023-10-08 01:31:03,425][52059] Updated weights for policy 1, policy_version 40552 (0.0010) [2023-10-08 01:31:03,789][52059] Updated weights for policy 1, policy_version 40562 (0.0009) [2023-10-08 01:31:04,085][52060] Updated weights for policy 0, policy_version 40040 (0.0008) [2023-10-08 01:31:04,147][52059] Updated weights for policy 1, policy_version 40572 (0.0008) [2023-10-08 01:31:04,455][52060] Updated weights for policy 0, policy_version 40050 (0.0008) [2023-10-08 01:31:04,829][52060] Updated weights for policy 0, policy_version 40060 (0.0007) [2023-10-08 01:31:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 82575360. Throughput: 0: 1720.4, 1: 1730.2. Samples: 20646118. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:31:06,211][50642] Avg episode reward: [(0, '21.530'), (1, '20.780')] [2023-10-08 01:31:08,069][52059] Updated weights for policy 1, policy_version 40582 (0.0008) [2023-10-08 01:31:08,441][52059] Updated weights for policy 1, policy_version 40592 (0.0008) [2023-10-08 01:31:08,810][52059] Updated weights for policy 1, policy_version 40602 (0.0008) [2023-10-08 01:31:09,002][52060] Updated weights for policy 0, policy_version 40070 (0.0010) [2023-10-08 01:31:09,377][52060] Updated weights for policy 0, policy_version 40080 (0.0008) [2023-10-08 01:31:09,733][52060] Updated weights for policy 0, policy_version 40090 (0.0009) [2023-10-08 01:31:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 82640896. Throughput: 0: 1691.2, 1: 1717.4. Samples: 20665824. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:31:11,211][50642] Avg episode reward: [(0, '18.260'), (1, '19.610')] [2023-10-08 01:31:12,791][52059] Updated weights for policy 1, policy_version 40612 (0.0008) [2023-10-08 01:31:13,197][52059] Updated weights for policy 1, policy_version 40622 (0.0009) [2023-10-08 01:31:13,558][52059] Updated weights for policy 1, policy_version 40632 (0.0009) [2023-10-08 01:31:13,566][52060] Updated weights for policy 0, policy_version 40100 (0.0007) [2023-10-08 01:31:13,935][52060] Updated weights for policy 0, policy_version 40110 (0.0008) [2023-10-08 01:31:14,304][52060] Updated weights for policy 0, policy_version 40120 (0.0010) [2023-10-08 01:31:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 82706432. Throughput: 0: 1704.4, 1: 1730.5. Samples: 20686896. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:31:16,211][50642] Avg episode reward: [(0, '19.060'), (1, '21.370')] [2023-10-08 01:31:17,437][52059] Updated weights for policy 1, policy_version 40642 (0.0009) [2023-10-08 01:31:17,809][52059] Updated weights for policy 1, policy_version 40652 (0.0007) [2023-10-08 01:31:18,176][52059] Updated weights for policy 1, policy_version 40662 (0.0010) [2023-10-08 01:31:18,337][52060] Updated weights for policy 0, policy_version 40130 (0.0010) [2023-10-08 01:31:18,542][52059] Updated weights for policy 1, policy_version 40672 (0.0007) [2023-10-08 01:31:18,706][52060] Updated weights for policy 0, policy_version 40140 (0.0008) [2023-10-08 01:31:19,082][52060] Updated weights for policy 0, policy_version 40150 (0.0010) [2023-10-08 01:31:19,447][52060] Updated weights for policy 0, policy_version 40160 (0.0007) [2023-10-08 01:31:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 82771968. Throughput: 0: 1705.5, 1: 1714.8. Samples: 20696926. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:31:21,211][50642] Avg episode reward: [(0, '20.840'), (1, '21.570')] [2023-10-08 01:31:22,532][52059] Updated weights for policy 1, policy_version 40682 (0.0008) [2023-10-08 01:31:22,893][52059] Updated weights for policy 1, policy_version 40692 (0.0008) [2023-10-08 01:31:23,260][52059] Updated weights for policy 1, policy_version 40702 (0.0008) [2023-10-08 01:31:23,416][52060] Updated weights for policy 0, policy_version 40170 (0.0009) [2023-10-08 01:31:23,797][52060] Updated weights for policy 0, policy_version 40180 (0.0007) [2023-10-08 01:31:24,152][52060] Updated weights for policy 0, policy_version 40190 (0.0009) [2023-10-08 01:31:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 82837504. Throughput: 0: 1694.6, 1: 1716.8. Samples: 20717730. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:31:26,211][50642] Avg episode reward: [(0, '19.470'), (1, '20.980')] [2023-10-08 01:31:27,159][52059] Updated weights for policy 1, policy_version 40712 (0.0009) [2023-10-08 01:31:27,530][52059] Updated weights for policy 1, policy_version 40722 (0.0008) [2023-10-08 01:31:27,882][52059] Updated weights for policy 1, policy_version 40732 (0.0008) [2023-10-08 01:31:28,070][52060] Updated weights for policy 0, policy_version 40200 (0.0008) [2023-10-08 01:31:28,436][52060] Updated weights for policy 0, policy_version 40210 (0.0007) [2023-10-08 01:31:28,804][52060] Updated weights for policy 0, policy_version 40220 (0.0008) [2023-10-08 01:31:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 82903040. Throughput: 0: 1720.5, 1: 1742.1. Samples: 20739026. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:31:31,211][50642] Avg episode reward: [(0, '18.430'), (1, '19.510')] [2023-10-08 01:31:31,985][52059] Updated weights for policy 1, policy_version 40742 (0.0009) [2023-10-08 01:31:32,347][52059] Updated weights for policy 1, policy_version 40752 (0.0008) [2023-10-08 01:31:32,719][52059] Updated weights for policy 1, policy_version 40762 (0.0008) [2023-10-08 01:31:32,789][52060] Updated weights for policy 0, policy_version 40230 (0.0009) [2023-10-08 01:31:33,141][52060] Updated weights for policy 0, policy_version 40240 (0.0008) [2023-10-08 01:31:33,508][52060] Updated weights for policy 0, policy_version 40250 (0.0009) [2023-10-08 01:31:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 82968576. Throughput: 0: 1695.7, 1: 1714.3. Samples: 20748342. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:31:36,211][50642] Avg episode reward: [(0, '20.190'), (1, '21.080')] [2023-10-08 01:31:36,721][52059] Updated weights for policy 1, policy_version 40772 (0.0007) [2023-10-08 01:31:37,092][52059] Updated weights for policy 1, policy_version 40782 (0.0007) [2023-10-08 01:31:37,313][52060] Updated weights for policy 0, policy_version 40260 (0.0008) [2023-10-08 01:31:37,454][52059] Updated weights for policy 1, policy_version 40792 (0.0009) [2023-10-08 01:31:37,676][52060] Updated weights for policy 0, policy_version 40270 (0.0007) [2023-10-08 01:31:38,042][52060] Updated weights for policy 0, policy_version 40280 (0.0008) [2023-10-08 01:31:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 83034112. Throughput: 0: 1711.2, 1: 1739.4. Samples: 20769814. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:31:41,211][50642] Avg episode reward: [(0, '20.300'), (1, '24.200')] [2023-10-08 01:31:41,303][52059] Updated weights for policy 1, policy_version 40802 (0.0009) [2023-10-08 01:31:41,674][52059] Updated weights for policy 1, policy_version 40812 (0.0009) [2023-10-08 01:31:42,040][52059] Updated weights for policy 1, policy_version 40822 (0.0009) [2023-10-08 01:31:42,148][52060] Updated weights for policy 0, policy_version 40290 (0.0008) [2023-10-08 01:31:42,399][51710] Saving new best policy, reward=24.200! [2023-10-08 01:31:42,405][52059] Updated weights for policy 1, policy_version 40832 (0.0008) [2023-10-08 01:31:42,514][52060] Updated weights for policy 0, policy_version 40300 (0.0008) [2023-10-08 01:31:42,886][52060] Updated weights for policy 0, policy_version 40310 (0.0008) [2023-10-08 01:31:43,243][52060] Updated weights for policy 0, policy_version 40320 (0.0007) [2023-10-08 01:31:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 83099648. Throughput: 0: 1725.7, 1: 1741.2. Samples: 20790744. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:31:46,211][50642] Avg episode reward: [(0, '19.130'), (1, '20.910')] [2023-10-08 01:31:46,467][52059] Updated weights for policy 1, policy_version 40842 (0.0007) [2023-10-08 01:31:46,829][52059] Updated weights for policy 1, policy_version 40852 (0.0007) [2023-10-08 01:31:47,196][52059] Updated weights for policy 1, policy_version 40862 (0.0008) [2023-10-08 01:31:47,224][52060] Updated weights for policy 0, policy_version 40330 (0.0009) [2023-10-08 01:31:47,590][52060] Updated weights for policy 0, policy_version 40340 (0.0008) [2023-10-08 01:31:47,957][52060] Updated weights for policy 0, policy_version 40350 (0.0009) [2023-10-08 01:31:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 83165184. Throughput: 0: 1696.7, 1: 1726.0. Samples: 20800142. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:31:51,211][50642] Avg episode reward: [(0, '19.470'), (1, '19.180')] [2023-10-08 01:31:51,273][52059] Updated weights for policy 1, policy_version 40872 (0.0007) [2023-10-08 01:31:51,633][52059] Updated weights for policy 1, policy_version 40882 (0.0007) [2023-10-08 01:31:51,957][52060] Updated weights for policy 0, policy_version 40360 (0.0009) [2023-10-08 01:31:52,004][52059] Updated weights for policy 1, policy_version 40892 (0.0009) [2023-10-08 01:31:52,333][52060] Updated weights for policy 0, policy_version 40370 (0.0008) [2023-10-08 01:31:52,706][52060] Updated weights for policy 0, policy_version 40380 (0.0007) [2023-10-08 01:31:55,958][52059] Updated weights for policy 1, policy_version 40902 (0.0007) [2023-10-08 01:31:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 83230720. Throughput: 0: 1723.2, 1: 1732.7. Samples: 20821340. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:31:56,211][50642] Avg episode reward: [(0, '22.300'), (1, '22.360')] [2023-10-08 01:31:56,332][52059] Updated weights for policy 1, policy_version 40912 (0.0008) [2023-10-08 01:31:56,557][52060] Updated weights for policy 0, policy_version 40390 (0.0008) [2023-10-08 01:31:56,690][52059] Updated weights for policy 1, policy_version 40922 (0.0007) [2023-10-08 01:31:56,929][52060] Updated weights for policy 0, policy_version 40400 (0.0010) [2023-10-08 01:31:57,304][52060] Updated weights for policy 0, policy_version 40410 (0.0010) [2023-10-08 01:32:00,547][52059] Updated weights for policy 1, policy_version 40932 (0.0008) [2023-10-08 01:32:00,940][52059] Updated weights for policy 1, policy_version 40942 (0.0007) [2023-10-08 01:32:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 83296256. Throughput: 0: 1728.3, 1: 1723.3. Samples: 20842218. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:32:01,211][50642] Avg episode reward: [(0, '18.700'), (1, '18.850')] [2023-10-08 01:32:01,313][52059] Updated weights for policy 1, policy_version 40952 (0.0007) [2023-10-08 01:32:01,339][52060] Updated weights for policy 0, policy_version 40420 (0.0010) [2023-10-08 01:32:01,704][52060] Updated weights for policy 0, policy_version 40430 (0.0010) [2023-10-08 01:32:02,069][52060] Updated weights for policy 0, policy_version 40440 (0.0009) [2023-10-08 01:32:05,132][52059] Updated weights for policy 1, policy_version 40962 (0.0007) [2023-10-08 01:32:05,493][52059] Updated weights for policy 1, policy_version 40972 (0.0009) [2023-10-08 01:32:05,856][52059] Updated weights for policy 1, policy_version 40982 (0.0009) [2023-10-08 01:32:06,034][52060] Updated weights for policy 0, policy_version 40450 (0.0008) [2023-10-08 01:32:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 83361792. Throughput: 0: 1712.6, 1: 1738.4. Samples: 20852218. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:32:06,211][50642] Avg episode reward: [(0, '18.700'), (1, '15.230')] [2023-10-08 01:32:06,219][52059] Updated weights for policy 1, policy_version 40992 (0.0008) [2023-10-08 01:32:06,401][52060] Updated weights for policy 0, policy_version 40460 (0.0009) [2023-10-08 01:32:06,773][52060] Updated weights for policy 0, policy_version 40470 (0.0008) [2023-10-08 01:32:07,144][52060] Updated weights for policy 0, policy_version 40480 (0.0008) [2023-10-08 01:32:09,975][52059] Updated weights for policy 1, policy_version 41002 (0.0007) [2023-10-08 01:32:10,338][52059] Updated weights for policy 1, policy_version 41012 (0.0007) [2023-10-08 01:32:10,710][52059] Updated weights for policy 1, policy_version 41022 (0.0009) [2023-10-08 01:32:11,078][52060] Updated weights for policy 0, policy_version 40490 (0.0010) [2023-10-08 01:32:11,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 83460096. Throughput: 0: 1726.4, 1: 1733.0. Samples: 20873404. Policy #0 lag: (min: 17.0, avg: 21.9, max: 49.0) [2023-10-08 01:32:11,211][50642] Avg episode reward: [(0, '21.300'), (1, '18.340')] [2023-10-08 01:32:11,446][52060] Updated weights for policy 0, policy_version 40500 (0.0008) [2023-10-08 01:32:11,811][52060] Updated weights for policy 0, policy_version 40510 (0.0009) [2023-10-08 01:32:14,746][52059] Updated weights for policy 1, policy_version 41032 (0.0009) [2023-10-08 01:32:15,114][52059] Updated weights for policy 1, policy_version 41042 (0.0009) [2023-10-08 01:32:15,477][52059] Updated weights for policy 1, policy_version 41052 (0.0009) [2023-10-08 01:32:15,752][52060] Updated weights for policy 0, policy_version 40520 (0.0009) [2023-10-08 01:32:16,132][52060] Updated weights for policy 0, policy_version 40530 (0.0008) [2023-10-08 01:32:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 83525632. Throughput: 0: 1716.5, 1: 1707.4. Samples: 20893104. Policy #0 lag: (min: 17.0, avg: 21.9, max: 49.0) [2023-10-08 01:32:16,211][50642] Avg episode reward: [(0, '19.570'), (1, '18.500')] [2023-10-08 01:32:16,508][52060] Updated weights for policy 0, policy_version 40540 (0.0008) [2023-10-08 01:32:19,464][52059] Updated weights for policy 1, policy_version 41062 (0.0007) [2023-10-08 01:32:19,827][52059] Updated weights for policy 1, policy_version 41072 (0.0008) [2023-10-08 01:32:20,200][52059] Updated weights for policy 1, policy_version 41082 (0.0008) [2023-10-08 01:32:20,523][52060] Updated weights for policy 0, policy_version 40550 (0.0009) [2023-10-08 01:32:20,894][52060] Updated weights for policy 0, policy_version 40560 (0.0008) [2023-10-08 01:32:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 83591168. Throughput: 0: 1723.3, 1: 1736.9. Samples: 20904052. Policy #0 lag: (min: 17.0, avg: 21.9, max: 49.0) [2023-10-08 01:32:21,211][50642] Avg episode reward: [(0, '18.340'), (1, '19.910')] [2023-10-08 01:32:21,266][52060] Updated weights for policy 0, policy_version 40570 (0.0008) [2023-10-08 01:32:24,287][52059] Updated weights for policy 1, policy_version 41092 (0.0009) [2023-10-08 01:32:24,654][52059] Updated weights for policy 1, policy_version 41102 (0.0008) [2023-10-08 01:32:25,023][52059] Updated weights for policy 1, policy_version 41112 (0.0009) [2023-10-08 01:32:25,399][52060] Updated weights for policy 0, policy_version 40580 (0.0008) [2023-10-08 01:32:25,765][52060] Updated weights for policy 0, policy_version 40590 (0.0008) [2023-10-08 01:32:26,140][52060] Updated weights for policy 0, policy_version 40600 (0.0007) [2023-10-08 01:32:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 83656704. Throughput: 0: 1718.1, 1: 1713.6. Samples: 20924242. Policy #0 lag: (min: 17.0, avg: 21.9, max: 49.0) [2023-10-08 01:32:26,211][50642] Avg episode reward: [(0, '20.810'), (1, '17.790')] [2023-10-08 01:32:28,860][52059] Updated weights for policy 1, policy_version 41122 (0.0008) [2023-10-08 01:32:29,232][52059] Updated weights for policy 1, policy_version 41132 (0.0008) [2023-10-08 01:32:29,595][52059] Updated weights for policy 1, policy_version 41142 (0.0008) [2023-10-08 01:32:29,962][52059] Updated weights for policy 1, policy_version 41152 (0.0008) [2023-10-08 01:32:30,062][52060] Updated weights for policy 0, policy_version 40610 (0.0008) [2023-10-08 01:32:30,438][52060] Updated weights for policy 0, policy_version 40620 (0.0007) [2023-10-08 01:32:30,798][52060] Updated weights for policy 0, policy_version 40630 (0.0008) [2023-10-08 01:32:31,172][52060] Updated weights for policy 0, policy_version 40640 (0.0010) [2023-10-08 01:32:31,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 83755008. Throughput: 0: 1703.9, 1: 1708.8. Samples: 20944320. Policy #0 lag: (min: 17.0, avg: 21.9, max: 49.0) [2023-10-08 01:32:31,211][50642] Avg episode reward: [(0, '19.300'), (1, '20.450')] [2023-10-08 01:32:33,954][52059] Updated weights for policy 1, policy_version 41162 (0.0008) [2023-10-08 01:32:34,323][52059] Updated weights for policy 1, policy_version 41172 (0.0009) [2023-10-08 01:32:34,685][52059] Updated weights for policy 1, policy_version 41182 (0.0008) [2023-10-08 01:32:35,247][52060] Updated weights for policy 0, policy_version 40650 (0.0010) [2023-10-08 01:32:35,611][52060] Updated weights for policy 0, policy_version 40660 (0.0008) [2023-10-08 01:32:35,975][52060] Updated weights for policy 0, policy_version 40670 (0.0008) [2023-10-08 01:32:36,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 83820544. Throughput: 0: 1718.9, 1: 1734.4. Samples: 20955542. Policy #0 lag: (min: 18.0, avg: 33.9, max: 50.0) [2023-10-08 01:32:36,211][50642] Avg episode reward: [(0, '17.530'), (1, '20.930')] [2023-10-08 01:32:38,687][52059] Updated weights for policy 1, policy_version 41192 (0.0009) [2023-10-08 01:32:39,047][52059] Updated weights for policy 1, policy_version 41202 (0.0010) [2023-10-08 01:32:39,403][52059] Updated weights for policy 1, policy_version 41212 (0.0008) [2023-10-08 01:32:39,842][52060] Updated weights for policy 0, policy_version 40680 (0.0007) [2023-10-08 01:32:40,202][52060] Updated weights for policy 0, policy_version 40690 (0.0009) [2023-10-08 01:32:40,579][52060] Updated weights for policy 0, policy_version 40700 (0.0010) [2023-10-08 01:32:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 83886080. Throughput: 0: 1714.1, 1: 1714.0. Samples: 20975602. Policy #0 lag: (min: 18.0, avg: 33.9, max: 50.0) [2023-10-08 01:32:41,211][50642] Avg episode reward: [(0, '21.110'), (1, '18.590')] [2023-10-08 01:32:43,276][52059] Updated weights for policy 1, policy_version 41222 (0.0009) [2023-10-08 01:32:43,640][52059] Updated weights for policy 1, policy_version 41232 (0.0009) [2023-10-08 01:32:44,003][52059] Updated weights for policy 1, policy_version 41242 (0.0010) [2023-10-08 01:32:44,402][52060] Updated weights for policy 0, policy_version 40710 (0.0008) [2023-10-08 01:32:44,782][52060] Updated weights for policy 0, policy_version 40720 (0.0007) [2023-10-08 01:32:45,142][52060] Updated weights for policy 0, policy_version 40730 (0.0007) [2023-10-08 01:32:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 83951616. Throughput: 0: 1689.9, 1: 1731.3. Samples: 20996174. Policy #0 lag: (min: 18.0, avg: 33.9, max: 50.0) [2023-10-08 01:32:46,211][50642] Avg episode reward: [(0, '21.030'), (1, '19.990')] [2023-10-08 01:32:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000040736_41713664.pth... [2023-10-08 01:32:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000041248_42237952.pth... [2023-10-08 01:32:46,262][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000039136_40075264.pth [2023-10-08 01:32:46,263][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000039648_40599552.pth [2023-10-08 01:32:47,978][52059] Updated weights for policy 1, policy_version 41252 (0.0008) [2023-10-08 01:32:48,344][52059] Updated weights for policy 1, policy_version 41262 (0.0007) [2023-10-08 01:32:48,706][52059] Updated weights for policy 1, policy_version 41272 (0.0008) [2023-10-08 01:32:49,146][52060] Updated weights for policy 0, policy_version 40740 (0.0010) [2023-10-08 01:32:49,519][52060] Updated weights for policy 0, policy_version 40750 (0.0007) [2023-10-08 01:32:49,880][52060] Updated weights for policy 0, policy_version 40760 (0.0009) [2023-10-08 01:32:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 84017152. Throughput: 0: 1723.1, 1: 1720.3. Samples: 21007172. Policy #0 lag: (min: 18.0, avg: 33.9, max: 50.0) [2023-10-08 01:32:51,211][50642] Avg episode reward: [(0, '18.020'), (1, '22.980')] [2023-10-08 01:32:52,552][52059] Updated weights for policy 1, policy_version 41282 (0.0008) [2023-10-08 01:32:52,912][52059] Updated weights for policy 1, policy_version 41292 (0.0007) [2023-10-08 01:32:53,273][52059] Updated weights for policy 1, policy_version 41302 (0.0007) [2023-10-08 01:32:53,631][52059] Updated weights for policy 1, policy_version 41312 (0.0007) [2023-10-08 01:32:53,930][52060] Updated weights for policy 0, policy_version 40770 (0.0009) [2023-10-08 01:32:54,296][52060] Updated weights for policy 0, policy_version 40780 (0.0007) [2023-10-08 01:32:54,668][52060] Updated weights for policy 0, policy_version 40790 (0.0008) [2023-10-08 01:32:55,043][52060] Updated weights for policy 0, policy_version 40800 (0.0010) [2023-10-08 01:32:56,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 84082688. Throughput: 0: 1696.1, 1: 1725.0. Samples: 21027354. Policy #0 lag: (min: 18.0, avg: 33.9, max: 50.0) [2023-10-08 01:32:56,211][50642] Avg episode reward: [(0, '20.300'), (1, '20.250')] [2023-10-08 01:32:57,447][52059] Updated weights for policy 1, policy_version 41322 (0.0008) [2023-10-08 01:32:57,828][52059] Updated weights for policy 1, policy_version 41332 (0.0008) [2023-10-08 01:32:58,186][52059] Updated weights for policy 1, policy_version 41342 (0.0010) [2023-10-08 01:32:59,022][52060] Updated weights for policy 0, policy_version 40810 (0.0008) [2023-10-08 01:32:59,385][52060] Updated weights for policy 0, policy_version 40820 (0.0007) [2023-10-08 01:32:59,758][52060] Updated weights for policy 0, policy_version 40830 (0.0007) [2023-10-08 01:33:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 84148224. Throughput: 0: 1699.5, 1: 1751.1. Samples: 21048382. Policy #0 lag: (min: 18.0, avg: 33.9, max: 50.0) [2023-10-08 01:33:01,211][50642] Avg episode reward: [(0, '21.500'), (1, '18.640')] [2023-10-08 01:33:02,147][52059] Updated weights for policy 1, policy_version 41352 (0.0009) [2023-10-08 01:33:02,517][52059] Updated weights for policy 1, policy_version 41362 (0.0010) [2023-10-08 01:33:02,883][52059] Updated weights for policy 1, policy_version 41372 (0.0007) [2023-10-08 01:33:03,606][52060] Updated weights for policy 0, policy_version 40840 (0.0008) [2023-10-08 01:33:03,977][52060] Updated weights for policy 0, policy_version 40850 (0.0007) [2023-10-08 01:33:04,344][52060] Updated weights for policy 0, policy_version 40860 (0.0009) [2023-10-08 01:33:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 84213760. Throughput: 0: 1712.3, 1: 1717.2. Samples: 21058378. Policy #0 lag: (min: 0.0, avg: 13.2, max: 32.0) [2023-10-08 01:33:06,211][50642] Avg episode reward: [(0, '18.380'), (1, '20.780')] [2023-10-08 01:33:06,737][52059] Updated weights for policy 1, policy_version 41382 (0.0007) [2023-10-08 01:33:07,116][52059] Updated weights for policy 1, policy_version 41392 (0.0010) [2023-10-08 01:33:07,477][52059] Updated weights for policy 1, policy_version 41402 (0.0009) [2023-10-08 01:33:08,462][52060] Updated weights for policy 0, policy_version 40870 (0.0010) [2023-10-08 01:33:08,836][52060] Updated weights for policy 0, policy_version 40880 (0.0008) [2023-10-08 01:33:09,203][52060] Updated weights for policy 0, policy_version 40890 (0.0010) [2023-10-08 01:33:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 84279296. Throughput: 0: 1697.4, 1: 1738.4. Samples: 21078854. Policy #0 lag: (min: 0.0, avg: 13.2, max: 32.0) [2023-10-08 01:33:11,211][50642] Avg episode reward: [(0, '19.760'), (1, '22.720')] [2023-10-08 01:33:11,558][52059] Updated weights for policy 1, policy_version 41412 (0.0007) [2023-10-08 01:33:11,923][52059] Updated weights for policy 1, policy_version 41422 (0.0008) [2023-10-08 01:33:12,292][52059] Updated weights for policy 1, policy_version 41432 (0.0007) [2023-10-08 01:33:13,011][52060] Updated weights for policy 0, policy_version 40900 (0.0008) [2023-10-08 01:33:13,386][52060] Updated weights for policy 0, policy_version 40910 (0.0010) [2023-10-08 01:33:13,752][52060] Updated weights for policy 0, policy_version 40920 (0.0010) [2023-10-08 01:33:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 84344832. Throughput: 0: 1720.1, 1: 1747.6. Samples: 21100368. Policy #0 lag: (min: 0.0, avg: 13.2, max: 32.0) [2023-10-08 01:33:16,211][50642] Avg episode reward: [(0, '21.840'), (1, '19.030')] [2023-10-08 01:33:16,221][52059] Updated weights for policy 1, policy_version 41442 (0.0008) [2023-10-08 01:33:16,584][52059] Updated weights for policy 1, policy_version 41452 (0.0007) [2023-10-08 01:33:16,951][52059] Updated weights for policy 1, policy_version 41462 (0.0008) [2023-10-08 01:33:17,313][52059] Updated weights for policy 1, policy_version 41472 (0.0008) [2023-10-08 01:33:17,570][52060] Updated weights for policy 0, policy_version 40930 (0.0010) [2023-10-08 01:33:17,941][52060] Updated weights for policy 0, policy_version 40940 (0.0008) [2023-10-08 01:33:18,308][52060] Updated weights for policy 0, policy_version 40950 (0.0011) [2023-10-08 01:33:18,673][52060] Updated weights for policy 0, policy_version 40960 (0.0007) [2023-10-08 01:33:21,162][52059] Updated weights for policy 1, policy_version 41482 (0.0008) [2023-10-08 01:33:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 84410368. Throughput: 0: 1705.0, 1: 1722.7. Samples: 21109792. Policy #0 lag: (min: 0.0, avg: 13.2, max: 32.0) [2023-10-08 01:33:21,211][50642] Avg episode reward: [(0, '18.300'), (1, '17.760')] [2023-10-08 01:33:21,525][52059] Updated weights for policy 1, policy_version 41492 (0.0009) [2023-10-08 01:33:21,898][52059] Updated weights for policy 1, policy_version 41502 (0.0009) [2023-10-08 01:33:22,748][52060] Updated weights for policy 0, policy_version 40970 (0.0009) [2023-10-08 01:33:23,108][52060] Updated weights for policy 0, policy_version 40980 (0.0008) [2023-10-08 01:33:23,471][52060] Updated weights for policy 0, policy_version 40990 (0.0007) [2023-10-08 01:33:25,687][52059] Updated weights for policy 1, policy_version 41512 (0.0009) [2023-10-08 01:33:26,049][52059] Updated weights for policy 1, policy_version 41522 (0.0007) [2023-10-08 01:33:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 84475904. Throughput: 0: 1709.5, 1: 1747.8. Samples: 21131182. Policy #0 lag: (min: 0.0, avg: 13.2, max: 32.0) [2023-10-08 01:33:26,211][50642] Avg episode reward: [(0, '19.710'), (1, '23.020')] [2023-10-08 01:33:26,414][52059] Updated weights for policy 1, policy_version 41532 (0.0007) [2023-10-08 01:33:27,586][52060] Updated weights for policy 0, policy_version 41000 (0.0007) [2023-10-08 01:33:27,959][52060] Updated weights for policy 0, policy_version 41010 (0.0007) [2023-10-08 01:33:28,325][52060] Updated weights for policy 0, policy_version 41020 (0.0007) [2023-10-08 01:33:30,292][52059] Updated weights for policy 1, policy_version 41542 (0.0008) [2023-10-08 01:33:30,659][52059] Updated weights for policy 1, policy_version 41552 (0.0009) [2023-10-08 01:33:31,023][52059] Updated weights for policy 1, policy_version 41562 (0.0011) [2023-10-08 01:33:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 84541440. Throughput: 0: 1728.4, 1: 1727.2. Samples: 21151676. Policy #0 lag: (min: 0.0, avg: 13.2, max: 32.0) [2023-10-08 01:33:31,211][50642] Avg episode reward: [(0, '21.860'), (1, '22.310')] [2023-10-08 01:33:32,415][52060] Updated weights for policy 0, policy_version 41030 (0.0009) [2023-10-08 01:33:32,795][52060] Updated weights for policy 0, policy_version 41040 (0.0009) [2023-10-08 01:33:33,163][52060] Updated weights for policy 0, policy_version 41050 (0.0009) [2023-10-08 01:33:35,016][52059] Updated weights for policy 1, policy_version 41572 (0.0008) [2023-10-08 01:33:35,417][52059] Updated weights for policy 1, policy_version 41582 (0.0008) [2023-10-08 01:33:35,776][52059] Updated weights for policy 1, policy_version 41592 (0.0008) [2023-10-08 01:33:36,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 84639744. Throughput: 0: 1690.6, 1: 1744.0. Samples: 21161728. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:33:36,211][50642] Avg episode reward: [(0, '18.330'), (1, '18.440')] [2023-10-08 01:33:37,031][52060] Updated weights for policy 0, policy_version 41060 (0.0008) [2023-10-08 01:33:37,402][52060] Updated weights for policy 0, policy_version 41070 (0.0010) [2023-10-08 01:33:37,780][52060] Updated weights for policy 0, policy_version 41080 (0.0008) [2023-10-08 01:33:39,836][52059] Updated weights for policy 1, policy_version 41602 (0.0009) [2023-10-08 01:33:40,206][52059] Updated weights for policy 1, policy_version 41612 (0.0007) [2023-10-08 01:33:40,569][52059] Updated weights for policy 1, policy_version 41622 (0.0007) [2023-10-08 01:33:40,931][52059] Updated weights for policy 1, policy_version 41632 (0.0007) [2023-10-08 01:33:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 84705280. Throughput: 0: 1720.2, 1: 1739.0. Samples: 21183018. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:33:41,211][50642] Avg episode reward: [(0, '19.120'), (1, '20.960')] [2023-10-08 01:33:41,882][52060] Updated weights for policy 0, policy_version 41090 (0.0009) [2023-10-08 01:33:42,252][52060] Updated weights for policy 0, policy_version 41100 (0.0009) [2023-10-08 01:33:42,625][52060] Updated weights for policy 0, policy_version 41110 (0.0008) [2023-10-08 01:33:42,995][52060] Updated weights for policy 0, policy_version 41120 (0.0009) [2023-10-08 01:33:44,792][52059] Updated weights for policy 1, policy_version 41642 (0.0010) [2023-10-08 01:33:45,155][52059] Updated weights for policy 1, policy_version 41652 (0.0009) [2023-10-08 01:33:45,525][52059] Updated weights for policy 1, policy_version 41662 (0.0007) [2023-10-08 01:33:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 84770816. Throughput: 0: 1723.1, 1: 1716.8. Samples: 21203174. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:33:46,211][50642] Avg episode reward: [(0, '22.260'), (1, '21.870')] [2023-10-08 01:33:46,928][52060] Updated weights for policy 0, policy_version 41130 (0.0008) [2023-10-08 01:33:47,298][52060] Updated weights for policy 0, policy_version 41140 (0.0009) [2023-10-08 01:33:47,672][52060] Updated weights for policy 0, policy_version 41150 (0.0010) [2023-10-08 01:33:49,356][52059] Updated weights for policy 1, policy_version 41672 (0.0007) [2023-10-08 01:33:49,719][52059] Updated weights for policy 1, policy_version 41682 (0.0007) [2023-10-08 01:33:50,075][52059] Updated weights for policy 1, policy_version 41692 (0.0010) [2023-10-08 01:33:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 84836352. Throughput: 0: 1701.9, 1: 1753.6. Samples: 21213876. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:33:51,211][50642] Avg episode reward: [(0, '18.160'), (1, '18.670')] [2023-10-08 01:33:51,558][52060] Updated weights for policy 0, policy_version 41160 (0.0010) [2023-10-08 01:33:51,920][52060] Updated weights for policy 0, policy_version 41170 (0.0009) [2023-10-08 01:33:52,298][52060] Updated weights for policy 0, policy_version 41180 (0.0009) [2023-10-08 01:33:53,905][52059] Updated weights for policy 1, policy_version 41702 (0.0008) [2023-10-08 01:33:54,266][52059] Updated weights for policy 1, policy_version 41712 (0.0008) [2023-10-08 01:33:54,629][52059] Updated weights for policy 1, policy_version 41722 (0.0007) [2023-10-08 01:33:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 84901888. Throughput: 0: 1719.2, 1: 1727.2. Samples: 21233944. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:33:56,211][50642] Avg episode reward: [(0, '18.530'), (1, '17.190')] [2023-10-08 01:33:56,260][52060] Updated weights for policy 0, policy_version 41190 (0.0008) [2023-10-08 01:33:56,631][52060] Updated weights for policy 0, policy_version 41200 (0.0008) [2023-10-08 01:33:56,999][52060] Updated weights for policy 0, policy_version 41210 (0.0008) [2023-10-08 01:33:58,568][52059] Updated weights for policy 1, policy_version 41732 (0.0009) [2023-10-08 01:33:58,941][52059] Updated weights for policy 1, policy_version 41742 (0.0010) [2023-10-08 01:33:59,305][52059] Updated weights for policy 1, policy_version 41752 (0.0008) [2023-10-08 01:34:01,177][52060] Updated weights for policy 0, policy_version 41220 (0.0008) [2023-10-08 01:34:01,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 84967424. Throughput: 0: 1710.4, 1: 1727.1. Samples: 21255060. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:34:01,211][50642] Avg episode reward: [(0, '22.940'), (1, '22.560')] [2023-10-08 01:34:01,547][52060] Updated weights for policy 0, policy_version 41230 (0.0008) [2023-10-08 01:34:01,907][52060] Updated weights for policy 0, policy_version 41240 (0.0007) [2023-10-08 01:34:03,262][52059] Updated weights for policy 1, policy_version 41762 (0.0010) [2023-10-08 01:34:03,626][52059] Updated weights for policy 1, policy_version 41772 (0.0009) [2023-10-08 01:34:03,996][52059] Updated weights for policy 1, policy_version 41782 (0.0011) [2023-10-08 01:34:04,362][52059] Updated weights for policy 1, policy_version 41792 (0.0007) [2023-10-08 01:34:06,049][52060] Updated weights for policy 0, policy_version 41250 (0.0007) [2023-10-08 01:34:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 85032960. Throughput: 0: 1707.2, 1: 1740.5. Samples: 21264942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:34:06,211][50642] Avg episode reward: [(0, '17.610'), (1, '20.290')] [2023-10-08 01:34:06,426][52060] Updated weights for policy 0, policy_version 41260 (0.0008) [2023-10-08 01:34:06,790][52060] Updated weights for policy 0, policy_version 41270 (0.0008) [2023-10-08 01:34:07,162][52060] Updated weights for policy 0, policy_version 41280 (0.0008) [2023-10-08 01:34:08,145][52059] Updated weights for policy 1, policy_version 41802 (0.0008) [2023-10-08 01:34:08,517][52059] Updated weights for policy 1, policy_version 41812 (0.0009) [2023-10-08 01:34:08,881][52059] Updated weights for policy 1, policy_version 41822 (0.0010) [2023-10-08 01:34:11,199][52060] Updated weights for policy 0, policy_version 41290 (0.0010) [2023-10-08 01:34:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 85098496. Throughput: 0: 1710.4, 1: 1730.6. Samples: 21286030. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:34:11,211][50642] Avg episode reward: [(0, '18.210'), (1, '17.820')] [2023-10-08 01:34:11,562][52060] Updated weights for policy 0, policy_version 41300 (0.0010) [2023-10-08 01:34:11,940][52060] Updated weights for policy 0, policy_version 41310 (0.0010) [2023-10-08 01:34:12,763][52059] Updated weights for policy 1, policy_version 41832 (0.0007) [2023-10-08 01:34:13,131][52059] Updated weights for policy 1, policy_version 41842 (0.0007) [2023-10-08 01:34:13,486][52059] Updated weights for policy 1, policy_version 41852 (0.0008) [2023-10-08 01:34:16,023][52060] Updated weights for policy 0, policy_version 41320 (0.0007) [2023-10-08 01:34:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 85164032. Throughput: 0: 1705.4, 1: 1745.7. Samples: 21306976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:34:16,211][50642] Avg episode reward: [(0, '21.230'), (1, '21.410')] [2023-10-08 01:34:16,402][52060] Updated weights for policy 0, policy_version 41330 (0.0008) [2023-10-08 01:34:16,773][52060] Updated weights for policy 0, policy_version 41340 (0.0007) [2023-10-08 01:34:17,540][52059] Updated weights for policy 1, policy_version 41862 (0.0009) [2023-10-08 01:34:17,904][52059] Updated weights for policy 1, policy_version 41872 (0.0008) [2023-10-08 01:34:18,261][52059] Updated weights for policy 1, policy_version 41882 (0.0010) [2023-10-08 01:34:20,620][52060] Updated weights for policy 0, policy_version 41350 (0.0011) [2023-10-08 01:34:20,997][52060] Updated weights for policy 0, policy_version 41360 (0.0008) [2023-10-08 01:34:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 85229568. Throughput: 0: 1713.8, 1: 1726.6. Samples: 21316544. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:34:21,211][50642] Avg episode reward: [(0, '21.320'), (1, '23.360')] [2023-10-08 01:34:21,365][52060] Updated weights for policy 0, policy_version 41370 (0.0008) [2023-10-08 01:34:22,331][52059] Updated weights for policy 1, policy_version 41892 (0.0009) [2023-10-08 01:34:22,697][52059] Updated weights for policy 1, policy_version 41902 (0.0009) [2023-10-08 01:34:23,057][52059] Updated weights for policy 1, policy_version 41912 (0.0007) [2023-10-08 01:34:25,303][52060] Updated weights for policy 0, policy_version 41380 (0.0010) [2023-10-08 01:34:25,669][52060] Updated weights for policy 0, policy_version 41390 (0.0010) [2023-10-08 01:34:26,042][52060] Updated weights for policy 0, policy_version 41400 (0.0010) [2023-10-08 01:34:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 85295104. Throughput: 0: 1709.1, 1: 1727.0. Samples: 21337640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:34:26,211][50642] Avg episode reward: [(0, '17.570'), (1, '17.400')] [2023-10-08 01:34:26,888][52059] Updated weights for policy 1, policy_version 41922 (0.0008) [2023-10-08 01:34:27,301][52059] Updated weights for policy 1, policy_version 41932 (0.0007) [2023-10-08 01:34:27,678][52059] Updated weights for policy 1, policy_version 41942 (0.0011) [2023-10-08 01:34:28,039][52059] Updated weights for policy 1, policy_version 41952 (0.0009) [2023-10-08 01:34:29,823][52060] Updated weights for policy 0, policy_version 41410 (0.0008) [2023-10-08 01:34:30,191][52060] Updated weights for policy 0, policy_version 41420 (0.0007) [2023-10-08 01:34:30,575][52060] Updated weights for policy 0, policy_version 41430 (0.0010) [2023-10-08 01:34:30,946][52060] Updated weights for policy 0, policy_version 41440 (0.0009) [2023-10-08 01:34:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 85393408. Throughput: 0: 1688.1, 1: 1751.5. Samples: 21357960. Policy #0 lag: (min: 17.0, avg: 37.1, max: 49.0) [2023-10-08 01:34:31,211][50642] Avg episode reward: [(0, '19.760'), (1, '18.740')] [2023-10-08 01:34:31,935][52059] Updated weights for policy 1, policy_version 41962 (0.0007) [2023-10-08 01:34:32,307][52059] Updated weights for policy 1, policy_version 41972 (0.0011) [2023-10-08 01:34:32,672][52059] Updated weights for policy 1, policy_version 41982 (0.0008) [2023-10-08 01:34:34,935][52060] Updated weights for policy 0, policy_version 41450 (0.0007) [2023-10-08 01:34:35,296][52060] Updated weights for policy 0, policy_version 41460 (0.0010) [2023-10-08 01:34:35,664][52060] Updated weights for policy 0, policy_version 41470 (0.0009) [2023-10-08 01:34:36,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 85458944. Throughput: 0: 1712.6, 1: 1719.9. Samples: 21368336. Policy #0 lag: (min: 17.0, avg: 37.1, max: 49.0) [2023-10-08 01:34:36,211][50642] Avg episode reward: [(0, '21.220'), (1, '22.570')] [2023-10-08 01:34:36,541][52059] Updated weights for policy 1, policy_version 41992 (0.0010) [2023-10-08 01:34:36,914][52059] Updated weights for policy 1, policy_version 42002 (0.0008) [2023-10-08 01:34:37,276][52059] Updated weights for policy 1, policy_version 42012 (0.0010) [2023-10-08 01:34:39,796][52060] Updated weights for policy 0, policy_version 41480 (0.0008) [2023-10-08 01:34:40,177][52060] Updated weights for policy 0, policy_version 41490 (0.0008) [2023-10-08 01:34:40,544][52060] Updated weights for policy 0, policy_version 41500 (0.0008) [2023-10-08 01:34:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 85524480. Throughput: 0: 1704.4, 1: 1745.9. Samples: 21389208. Policy #0 lag: (min: 17.0, avg: 37.1, max: 49.0) [2023-10-08 01:34:41,211][50642] Avg episode reward: [(0, '18.580'), (1, '21.120')] [2023-10-08 01:34:41,236][52059] Updated weights for policy 1, policy_version 42022 (0.0008) [2023-10-08 01:34:41,596][52059] Updated weights for policy 1, policy_version 42032 (0.0009) [2023-10-08 01:34:41,967][52059] Updated weights for policy 1, policy_version 42042 (0.0009) [2023-10-08 01:34:44,456][52060] Updated weights for policy 0, policy_version 41510 (0.0009) [2023-10-08 01:34:44,822][52060] Updated weights for policy 0, policy_version 41520 (0.0007) [2023-10-08 01:34:45,186][52060] Updated weights for policy 0, policy_version 41530 (0.0007) [2023-10-08 01:34:45,882][52059] Updated weights for policy 1, policy_version 42052 (0.0007) [2023-10-08 01:34:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 85590016. Throughput: 0: 1690.7, 1: 1745.3. Samples: 21409680. Policy #0 lag: (min: 17.0, avg: 37.1, max: 49.0) [2023-10-08 01:34:46,211][50642] Avg episode reward: [(0, '20.320'), (1, '17.950')] [2023-10-08 01:34:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000041536_42532864.pth... [2023-10-08 01:34:46,249][52059] Updated weights for policy 1, policy_version 42062 (0.0007) [2023-10-08 01:34:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000039936_40894464.pth [2023-10-08 01:34:46,611][52059] Updated weights for policy 1, policy_version 42072 (0.0007) [2023-10-08 01:34:46,895][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000042080_43089920.pth... [2023-10-08 01:34:46,933][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000040448_41418752.pth [2023-10-08 01:34:49,138][52060] Updated weights for policy 0, policy_version 41540 (0.0010) [2023-10-08 01:34:49,507][52060] Updated weights for policy 0, policy_version 41550 (0.0008) [2023-10-08 01:34:49,869][52060] Updated weights for policy 0, policy_version 41560 (0.0007) [2023-10-08 01:34:50,645][52059] Updated weights for policy 1, policy_version 42082 (0.0008) [2023-10-08 01:34:51,014][52059] Updated weights for policy 1, policy_version 42092 (0.0007) [2023-10-08 01:34:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 85655552. Throughput: 0: 1722.9, 1: 1734.0. Samples: 21420500. Policy #0 lag: (min: 17.0, avg: 37.1, max: 49.0) [2023-10-08 01:34:51,211][50642] Avg episode reward: [(0, '21.560'), (1, '21.680')] [2023-10-08 01:34:51,371][52059] Updated weights for policy 1, policy_version 42102 (0.0007) [2023-10-08 01:34:51,738][52059] Updated weights for policy 1, policy_version 42112 (0.0009) [2023-10-08 01:34:53,872][52060] Updated weights for policy 0, policy_version 41570 (0.0009) [2023-10-08 01:34:54,245][52060] Updated weights for policy 0, policy_version 41580 (0.0007) [2023-10-08 01:34:54,614][52060] Updated weights for policy 0, policy_version 41590 (0.0007) [2023-10-08 01:34:54,987][52060] Updated weights for policy 0, policy_version 41600 (0.0007) [2023-10-08 01:34:55,723][52059] Updated weights for policy 1, policy_version 42122 (0.0009) [2023-10-08 01:34:56,083][52059] Updated weights for policy 1, policy_version 42132 (0.0007) [2023-10-08 01:34:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 85721088. Throughput: 0: 1695.2, 1: 1743.1. Samples: 21440756. Policy #0 lag: (min: 17.0, avg: 37.1, max: 49.0) [2023-10-08 01:34:56,211][50642] Avg episode reward: [(0, '18.620'), (1, '22.690')] [2023-10-08 01:34:56,445][52059] Updated weights for policy 1, policy_version 42142 (0.0008) [2023-10-08 01:34:58,994][52060] Updated weights for policy 0, policy_version 41610 (0.0008) [2023-10-08 01:34:59,361][52060] Updated weights for policy 0, policy_version 41620 (0.0007) [2023-10-08 01:34:59,732][52060] Updated weights for policy 0, policy_version 41630 (0.0007) [2023-10-08 01:35:00,418][52059] Updated weights for policy 1, policy_version 42152 (0.0008) [2023-10-08 01:35:00,778][52059] Updated weights for policy 1, policy_version 42162 (0.0009) [2023-10-08 01:35:01,142][52059] Updated weights for policy 1, policy_version 42172 (0.0010) [2023-10-08 01:35:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 85786624. Throughput: 0: 1701.2, 1: 1727.9. Samples: 21461286. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:35:01,211][50642] Avg episode reward: [(0, '20.130'), (1, '18.970')] [2023-10-08 01:35:03,700][52060] Updated weights for policy 0, policy_version 41640 (0.0008) [2023-10-08 01:35:04,062][52060] Updated weights for policy 0, policy_version 41650 (0.0009) [2023-10-08 01:35:04,429][52060] Updated weights for policy 0, policy_version 41660 (0.0008) [2023-10-08 01:35:04,996][52059] Updated weights for policy 1, policy_version 42182 (0.0009) [2023-10-08 01:35:05,367][52059] Updated weights for policy 1, policy_version 42192 (0.0010) [2023-10-08 01:35:05,721][52059] Updated weights for policy 1, policy_version 42202 (0.0010) [2023-10-08 01:35:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 85884928. Throughput: 0: 1714.7, 1: 1744.1. Samples: 21472188. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:35:06,211][50642] Avg episode reward: [(0, '21.810'), (1, '19.890')] [2023-10-08 01:35:08,363][52060] Updated weights for policy 0, policy_version 41670 (0.0008) [2023-10-08 01:35:08,748][52060] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-10-08 01:35:09,104][52060] Updated weights for policy 0, policy_version 41690 (0.0010) [2023-10-08 01:35:09,585][52059] Updated weights for policy 1, policy_version 42212 (0.0009) [2023-10-08 01:35:09,955][52059] Updated weights for policy 1, policy_version 42222 (0.0007) [2023-10-08 01:35:10,312][52059] Updated weights for policy 1, policy_version 42232 (0.0009) [2023-10-08 01:35:11,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 85950464. Throughput: 0: 1694.0, 1: 1739.8. Samples: 21492160. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:35:11,211][50642] Avg episode reward: [(0, '20.820'), (1, '20.380')] [2023-10-08 01:35:13,148][52060] Updated weights for policy 0, policy_version 41700 (0.0008) [2023-10-08 01:35:13,516][52060] Updated weights for policy 0, policy_version 41710 (0.0009) [2023-10-08 01:35:13,894][52060] Updated weights for policy 0, policy_version 41720 (0.0009) [2023-10-08 01:35:14,289][52059] Updated weights for policy 1, policy_version 42242 (0.0009) [2023-10-08 01:35:14,708][52059] Updated weights for policy 1, policy_version 42252 (0.0007) [2023-10-08 01:35:15,080][52059] Updated weights for policy 1, policy_version 42262 (0.0008) [2023-10-08 01:35:15,436][52059] Updated weights for policy 1, policy_version 42272 (0.0008) [2023-10-08 01:35:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 86016000. Throughput: 0: 1716.9, 1: 1715.0. Samples: 21512396. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:35:16,211][50642] Avg episode reward: [(0, '19.970'), (1, '20.180')] [2023-10-08 01:35:17,761][52060] Updated weights for policy 0, policy_version 41730 (0.0009) [2023-10-08 01:35:18,131][52060] Updated weights for policy 0, policy_version 41740 (0.0009) [2023-10-08 01:35:18,500][52060] Updated weights for policy 0, policy_version 41750 (0.0009) [2023-10-08 01:35:18,877][52060] Updated weights for policy 0, policy_version 41760 (0.0011) [2023-10-08 01:35:19,384][52059] Updated weights for policy 1, policy_version 42282 (0.0010) [2023-10-08 01:35:19,739][52059] Updated weights for policy 1, policy_version 42292 (0.0009) [2023-10-08 01:35:20,108][52059] Updated weights for policy 1, policy_version 42302 (0.0008) [2023-10-08 01:35:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 86081536. Throughput: 0: 1698.3, 1: 1741.2. Samples: 21523114. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:35:21,211][50642] Avg episode reward: [(0, '20.400'), (1, '20.790')] [2023-10-08 01:35:22,864][52060] Updated weights for policy 0, policy_version 41770 (0.0009) [2023-10-08 01:35:23,232][52060] Updated weights for policy 0, policy_version 41780 (0.0010) [2023-10-08 01:35:23,612][52060] Updated weights for policy 0, policy_version 41790 (0.0009) [2023-10-08 01:35:23,978][52059] Updated weights for policy 1, policy_version 42312 (0.0009) [2023-10-08 01:35:24,347][52059] Updated weights for policy 1, policy_version 42322 (0.0008) [2023-10-08 01:35:24,716][52059] Updated weights for policy 1, policy_version 42332 (0.0008) [2023-10-08 01:35:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 86147072. Throughput: 0: 1705.7, 1: 1718.4. Samples: 21543292. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:35:26,211][50642] Avg episode reward: [(0, '21.760'), (1, '19.380')] [2023-10-08 01:35:27,577][52060] Updated weights for policy 0, policy_version 41800 (0.0007) [2023-10-08 01:35:27,947][52060] Updated weights for policy 0, policy_version 41810 (0.0008) [2023-10-08 01:35:28,310][52060] Updated weights for policy 0, policy_version 41820 (0.0008) [2023-10-08 01:35:28,618][52059] Updated weights for policy 1, policy_version 42342 (0.0008) [2023-10-08 01:35:28,980][52059] Updated weights for policy 1, policy_version 42352 (0.0008) [2023-10-08 01:35:29,346][52059] Updated weights for policy 1, policy_version 42362 (0.0009) [2023-10-08 01:35:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 86212608. Throughput: 0: 1726.4, 1: 1714.4. Samples: 21564516. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 01:35:31,211][50642] Avg episode reward: [(0, '19.560'), (1, '20.800')] [2023-10-08 01:35:32,155][52060] Updated weights for policy 0, policy_version 41830 (0.0010) [2023-10-08 01:35:32,524][52060] Updated weights for policy 0, policy_version 41840 (0.0011) [2023-10-08 01:35:32,892][52060] Updated weights for policy 0, policy_version 41850 (0.0009) [2023-10-08 01:35:33,325][52059] Updated weights for policy 1, policy_version 42372 (0.0007) [2023-10-08 01:35:33,702][52059] Updated weights for policy 1, policy_version 42382 (0.0009) [2023-10-08 01:35:34,065][52059] Updated weights for policy 1, policy_version 42392 (0.0007) [2023-10-08 01:35:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 86278144. Throughput: 0: 1695.2, 1: 1726.2. Samples: 21574462. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 01:35:36,211][50642] Avg episode reward: [(0, '19.880'), (1, '21.540')] [2023-10-08 01:35:36,865][52060] Updated weights for policy 0, policy_version 41860 (0.0009) [2023-10-08 01:35:37,254][52060] Updated weights for policy 0, policy_version 41870 (0.0012) [2023-10-08 01:35:37,632][52060] Updated weights for policy 0, policy_version 41880 (0.0011) [2023-10-08 01:35:37,978][52059] Updated weights for policy 1, policy_version 42402 (0.0007) [2023-10-08 01:35:38,343][52059] Updated weights for policy 1, policy_version 42412 (0.0009) [2023-10-08 01:35:38,711][52059] Updated weights for policy 1, policy_version 42422 (0.0008) [2023-10-08 01:35:39,074][52059] Updated weights for policy 1, policy_version 42432 (0.0007) [2023-10-08 01:35:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 86343680. Throughput: 0: 1715.9, 1: 1711.7. Samples: 21594998. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 01:35:41,211][50642] Avg episode reward: [(0, '20.660'), (1, '19.040')] [2023-10-08 01:35:41,824][52060] Updated weights for policy 0, policy_version 41890 (0.0009) [2023-10-08 01:35:42,198][52060] Updated weights for policy 0, policy_version 41900 (0.0007) [2023-10-08 01:35:42,573][52060] Updated weights for policy 0, policy_version 41910 (0.0008) [2023-10-08 01:35:42,945][52060] Updated weights for policy 0, policy_version 41920 (0.0008) [2023-10-08 01:35:43,102][52059] Updated weights for policy 1, policy_version 42442 (0.0010) [2023-10-08 01:35:43,471][52059] Updated weights for policy 1, policy_version 42452 (0.0007) [2023-10-08 01:35:43,835][52059] Updated weights for policy 1, policy_version 42462 (0.0009) [2023-10-08 01:35:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 86409216. Throughput: 0: 1711.9, 1: 1727.8. Samples: 21616070. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 01:35:46,211][50642] Avg episode reward: [(0, '20.100'), (1, '20.740')] [2023-10-08 01:35:46,943][52060] Updated weights for policy 0, policy_version 41930 (0.0007) [2023-10-08 01:35:47,310][52060] Updated weights for policy 0, policy_version 41940 (0.0007) [2023-10-08 01:35:47,685][52060] Updated weights for policy 0, policy_version 41950 (0.0008) [2023-10-08 01:35:47,693][52059] Updated weights for policy 1, policy_version 42472 (0.0010) [2023-10-08 01:35:48,051][52059] Updated weights for policy 1, policy_version 42482 (0.0007) [2023-10-08 01:35:48,427][52059] Updated weights for policy 1, policy_version 42492 (0.0008) [2023-10-08 01:35:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 86474752. Throughput: 0: 1694.2, 1: 1711.8. Samples: 21625456. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 01:35:51,211][50642] Avg episode reward: [(0, '18.890'), (1, '21.530')] [2023-10-08 01:35:51,791][52060] Updated weights for policy 0, policy_version 41960 (0.0008) [2023-10-08 01:35:52,154][52060] Updated weights for policy 0, policy_version 41970 (0.0009) [2023-10-08 01:35:52,450][52059] Updated weights for policy 1, policy_version 42502 (0.0009) [2023-10-08 01:35:52,521][52060] Updated weights for policy 0, policy_version 41980 (0.0009) [2023-10-08 01:35:52,816][52059] Updated weights for policy 1, policy_version 42512 (0.0008) [2023-10-08 01:35:53,177][52059] Updated weights for policy 1, policy_version 42522 (0.0007) [2023-10-08 01:35:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 86540288. Throughput: 0: 1719.6, 1: 1718.6. Samples: 21646878. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 01:35:56,211][50642] Avg episode reward: [(0, '20.420'), (1, '20.750')] [2023-10-08 01:35:56,535][52060] Updated weights for policy 0, policy_version 41990 (0.0007) [2023-10-08 01:35:56,898][52060] Updated weights for policy 0, policy_version 42000 (0.0007) [2023-10-08 01:35:57,049][52059] Updated weights for policy 1, policy_version 42532 (0.0008) [2023-10-08 01:35:57,258][52060] Updated weights for policy 0, policy_version 42010 (0.0008) [2023-10-08 01:35:57,422][52059] Updated weights for policy 1, policy_version 42542 (0.0008) [2023-10-08 01:35:57,788][52059] Updated weights for policy 1, policy_version 42552 (0.0008) [2023-10-08 01:36:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 86605824. Throughput: 0: 1719.6, 1: 1744.9. Samples: 21668296. Policy #0 lag: (min: 14.0, avg: 18.8, max: 46.0) [2023-10-08 01:36:01,211][50642] Avg episode reward: [(0, '20.220'), (1, '19.170')] [2023-10-08 01:36:01,278][52060] Updated weights for policy 0, policy_version 42020 (0.0008) [2023-10-08 01:36:01,644][52060] Updated weights for policy 0, policy_version 42030 (0.0009) [2023-10-08 01:36:01,772][52059] Updated weights for policy 1, policy_version 42562 (0.0010) [2023-10-08 01:36:02,014][52060] Updated weights for policy 0, policy_version 42040 (0.0010) [2023-10-08 01:36:02,175][52059] Updated weights for policy 1, policy_version 42572 (0.0009) [2023-10-08 01:36:02,544][52059] Updated weights for policy 1, policy_version 42582 (0.0008) [2023-10-08 01:36:02,904][52059] Updated weights for policy 1, policy_version 42592 (0.0010) [2023-10-08 01:36:05,876][52060] Updated weights for policy 0, policy_version 42050 (0.0008) [2023-10-08 01:36:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 86671360. Throughput: 0: 1716.2, 1: 1714.5. Samples: 21677494. Policy #0 lag: (min: 14.0, avg: 18.8, max: 46.0) [2023-10-08 01:36:06,211][50642] Avg episode reward: [(0, '20.220'), (1, '22.080')] [2023-10-08 01:36:06,251][52060] Updated weights for policy 0, policy_version 42060 (0.0008) [2023-10-08 01:36:06,615][52060] Updated weights for policy 0, policy_version 42070 (0.0010) [2023-10-08 01:36:06,861][52059] Updated weights for policy 1, policy_version 42602 (0.0008) [2023-10-08 01:36:06,979][52060] Updated weights for policy 0, policy_version 42080 (0.0007) [2023-10-08 01:36:07,218][52059] Updated weights for policy 1, policy_version 42612 (0.0009) [2023-10-08 01:36:07,590][52059] Updated weights for policy 1, policy_version 42622 (0.0008) [2023-10-08 01:36:11,092][52060] Updated weights for policy 0, policy_version 42090 (0.0009) [2023-10-08 01:36:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 86736896. Throughput: 0: 1711.4, 1: 1738.2. Samples: 21698524. Policy #0 lag: (min: 14.0, avg: 18.8, max: 46.0) [2023-10-08 01:36:11,211][50642] Avg episode reward: [(0, '21.010'), (1, '21.560')] [2023-10-08 01:36:11,345][52059] Updated weights for policy 1, policy_version 42632 (0.0007) [2023-10-08 01:36:11,461][52060] Updated weights for policy 0, policy_version 42100 (0.0008) [2023-10-08 01:36:11,705][52059] Updated weights for policy 1, policy_version 42642 (0.0008) [2023-10-08 01:36:11,825][52060] Updated weights for policy 0, policy_version 42110 (0.0008) [2023-10-08 01:36:12,067][52059] Updated weights for policy 1, policy_version 42652 (0.0007) [2023-10-08 01:36:15,811][52060] Updated weights for policy 0, policy_version 42120 (0.0009) [2023-10-08 01:36:16,054][52059] Updated weights for policy 1, policy_version 42662 (0.0008) [2023-10-08 01:36:16,175][52060] Updated weights for policy 0, policy_version 42130 (0.0009) [2023-10-08 01:36:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 86802432. Throughput: 0: 1700.8, 1: 1746.7. Samples: 21719652. Policy #0 lag: (min: 14.0, avg: 18.8, max: 46.0) [2023-10-08 01:36:16,211][50642] Avg episode reward: [(0, '21.220'), (1, '20.010')] [2023-10-08 01:36:16,417][52059] Updated weights for policy 1, policy_version 42672 (0.0007) [2023-10-08 01:36:16,546][52060] Updated weights for policy 0, policy_version 42140 (0.0007) [2023-10-08 01:36:16,782][52059] Updated weights for policy 1, policy_version 42682 (0.0008) [2023-10-08 01:36:20,378][52060] Updated weights for policy 0, policy_version 42150 (0.0009) [2023-10-08 01:36:20,740][52060] Updated weights for policy 0, policy_version 42160 (0.0009) [2023-10-08 01:36:20,748][52059] Updated weights for policy 1, policy_version 42692 (0.0009) [2023-10-08 01:36:21,116][52059] Updated weights for policy 1, policy_version 42702 (0.0008) [2023-10-08 01:36:21,121][52060] Updated weights for policy 0, policy_version 42170 (0.0009) [2023-10-08 01:36:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 86867968. Throughput: 0: 1709.3, 1: 1733.2. Samples: 21729374. Policy #0 lag: (min: 14.0, avg: 18.8, max: 46.0) [2023-10-08 01:36:21,211][50642] Avg episode reward: [(0, '20.050'), (1, '21.480')] [2023-10-08 01:36:21,480][52059] Updated weights for policy 1, policy_version 42712 (0.0008) [2023-10-08 01:36:25,175][52060] Updated weights for policy 0, policy_version 42180 (0.0009) [2023-10-08 01:36:25,406][52059] Updated weights for policy 1, policy_version 42722 (0.0010) [2023-10-08 01:36:25,543][52060] Updated weights for policy 0, policy_version 42190 (0.0007) [2023-10-08 01:36:25,772][52059] Updated weights for policy 1, policy_version 42732 (0.0007) [2023-10-08 01:36:25,899][52060] Updated weights for policy 0, policy_version 42200 (0.0007) [2023-10-08 01:36:26,138][52059] Updated weights for policy 1, policy_version 42742 (0.0007) [2023-10-08 01:36:26,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 86966272. Throughput: 0: 1714.2, 1: 1747.2. Samples: 21750760. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 01:36:26,211][50642] Avg episode reward: [(0, '21.000'), (1, '22.250')] [2023-10-08 01:36:26,498][52059] Updated weights for policy 1, policy_version 42752 (0.0008) [2023-10-08 01:36:29,812][52060] Updated weights for policy 0, policy_version 42210 (0.0009) [2023-10-08 01:36:30,175][52060] Updated weights for policy 0, policy_version 42220 (0.0008) [2023-10-08 01:36:30,437][52059] Updated weights for policy 1, policy_version 42762 (0.0009) [2023-10-08 01:36:30,542][52060] Updated weights for policy 0, policy_version 42230 (0.0007) [2023-10-08 01:36:30,793][52059] Updated weights for policy 1, policy_version 42772 (0.0009) [2023-10-08 01:36:30,918][52060] Updated weights for policy 0, policy_version 42240 (0.0007) [2023-10-08 01:36:31,166][52059] Updated weights for policy 1, policy_version 42782 (0.0008) [2023-10-08 01:36:31,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 87031808. Throughput: 0: 1694.2, 1: 1728.3. Samples: 21770080. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 01:36:31,211][50642] Avg episode reward: [(0, '21.210'), (1, '21.570')] [2023-10-08 01:36:35,051][52060] Updated weights for policy 0, policy_version 42250 (0.0008) [2023-10-08 01:36:35,068][52059] Updated weights for policy 1, policy_version 42792 (0.0007) [2023-10-08 01:36:35,427][52060] Updated weights for policy 0, policy_version 42260 (0.0008) [2023-10-08 01:36:35,439][52059] Updated weights for policy 1, policy_version 42802 (0.0007) [2023-10-08 01:36:35,799][52060] Updated weights for policy 0, policy_version 42270 (0.0010) [2023-10-08 01:36:35,802][52059] Updated weights for policy 1, policy_version 42812 (0.0008) [2023-10-08 01:36:36,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 87130112. Throughput: 0: 1723.2, 1: 1744.0. Samples: 21781476. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 01:36:36,211][50642] Avg episode reward: [(0, '20.350'), (1, '21.000')] [2023-10-08 01:36:39,801][52060] Updated weights for policy 0, policy_version 42280 (0.0008) [2023-10-08 01:36:39,817][52059] Updated weights for policy 1, policy_version 42822 (0.0008) [2023-10-08 01:36:40,167][52060] Updated weights for policy 0, policy_version 42290 (0.0009) [2023-10-08 01:36:40,190][52059] Updated weights for policy 1, policy_version 42832 (0.0008) [2023-10-08 01:36:40,536][52060] Updated weights for policy 0, policy_version 42300 (0.0009) [2023-10-08 01:36:40,556][52059] Updated weights for policy 1, policy_version 42842 (0.0007) [2023-10-08 01:36:41,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 87195648. Throughput: 0: 1706.5, 1: 1736.8. Samples: 21801828. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 01:36:41,211][50642] Avg episode reward: [(0, '19.390'), (1, '20.890')] [2023-10-08 01:36:44,350][52060] Updated weights for policy 0, policy_version 42310 (0.0009) [2023-10-08 01:36:44,557][52059] Updated weights for policy 1, policy_version 42852 (0.0008) [2023-10-08 01:36:44,728][52060] Updated weights for policy 0, policy_version 42320 (0.0008) [2023-10-08 01:36:44,916][52059] Updated weights for policy 1, policy_version 42862 (0.0009) [2023-10-08 01:36:45,103][52060] Updated weights for policy 0, policy_version 42330 (0.0009) [2023-10-08 01:36:45,282][52059] Updated weights for policy 1, policy_version 42872 (0.0008) [2023-10-08 01:36:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 87261184. Throughput: 0: 1685.6, 1: 1710.7. Samples: 21821128. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 01:36:46,211][50642] Avg episode reward: [(0, '20.140'), (1, '20.670')] [2023-10-08 01:36:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000042336_43352064.pth... [2023-10-08 01:36:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000042880_43909120.pth... [2023-10-08 01:36:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000040736_41713664.pth [2023-10-08 01:36:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000041248_42237952.pth [2023-10-08 01:36:49,064][52060] Updated weights for policy 0, policy_version 42340 (0.0009) [2023-10-08 01:36:49,360][52059] Updated weights for policy 1, policy_version 42882 (0.0009) [2023-10-08 01:36:49,437][52060] Updated weights for policy 0, policy_version 42350 (0.0009) [2023-10-08 01:36:49,787][52059] Updated weights for policy 1, policy_version 42892 (0.0009) [2023-10-08 01:36:49,803][52060] Updated weights for policy 0, policy_version 42360 (0.0008) [2023-10-08 01:36:50,141][52059] Updated weights for policy 1, policy_version 42902 (0.0009) [2023-10-08 01:36:50,501][52059] Updated weights for policy 1, policy_version 42912 (0.0011) [2023-10-08 01:36:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 87326720. Throughput: 0: 1716.4, 1: 1742.1. Samples: 21833126. Policy #0 lag: (min: 31.0, avg: 36.6, max: 63.0) [2023-10-08 01:36:51,211][50642] Avg episode reward: [(0, '20.210'), (1, '18.460')] [2023-10-08 01:36:53,656][52060] Updated weights for policy 0, policy_version 42370 (0.0007) [2023-10-08 01:36:54,021][52060] Updated weights for policy 0, policy_version 42380 (0.0010) [2023-10-08 01:36:54,396][52060] Updated weights for policy 0, policy_version 42390 (0.0007) [2023-10-08 01:36:54,418][52059] Updated weights for policy 1, policy_version 42922 (0.0008) [2023-10-08 01:36:54,760][52060] Updated weights for policy 0, policy_version 42400 (0.0007) [2023-10-08 01:36:54,779][52059] Updated weights for policy 1, policy_version 42932 (0.0008) [2023-10-08 01:36:55,147][52059] Updated weights for policy 1, policy_version 42942 (0.0008) [2023-10-08 01:36:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 87392256. Throughput: 0: 1694.8, 1: 1716.8. Samples: 21852048. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:36:56,211][50642] Avg episode reward: [(0, '18.990'), (1, '20.780')] [2023-10-08 01:36:58,655][52060] Updated weights for policy 0, policy_version 42410 (0.0010) [2023-10-08 01:36:59,019][52060] Updated weights for policy 0, policy_version 42420 (0.0009) [2023-10-08 01:36:59,057][52059] Updated weights for policy 1, policy_version 42952 (0.0009) [2023-10-08 01:36:59,383][52060] Updated weights for policy 0, policy_version 42430 (0.0008) [2023-10-08 01:36:59,411][52059] Updated weights for policy 1, policy_version 42962 (0.0009) [2023-10-08 01:36:59,777][52059] Updated weights for policy 1, policy_version 42972 (0.0008) [2023-10-08 01:37:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 87457792. Throughput: 0: 1707.1, 1: 1703.8. Samples: 21873142. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:37:01,211][50642] Avg episode reward: [(0, '19.320'), (1, '21.300')] [2023-10-08 01:37:03,515][52060] Updated weights for policy 0, policy_version 42440 (0.0009) [2023-10-08 01:37:03,701][52059] Updated weights for policy 1, policy_version 42982 (0.0009) [2023-10-08 01:37:03,876][52060] Updated weights for policy 0, policy_version 42450 (0.0008) [2023-10-08 01:37:04,060][52059] Updated weights for policy 1, policy_version 42992 (0.0010) [2023-10-08 01:37:04,245][52060] Updated weights for policy 0, policy_version 42460 (0.0007) [2023-10-08 01:37:04,421][52059] Updated weights for policy 1, policy_version 43002 (0.0009) [2023-10-08 01:37:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 87523328. Throughput: 0: 1711.5, 1: 1725.2. Samples: 21884030. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:37:06,211][50642] Avg episode reward: [(0, '21.700'), (1, '20.310')] [2023-10-08 01:37:08,157][52060] Updated weights for policy 0, policy_version 42470 (0.0009) [2023-10-08 01:37:08,338][52059] Updated weights for policy 1, policy_version 43012 (0.0007) [2023-10-08 01:37:08,531][52060] Updated weights for policy 0, policy_version 42480 (0.0008) [2023-10-08 01:37:08,700][52059] Updated weights for policy 1, policy_version 43022 (0.0008) [2023-10-08 01:37:08,892][52060] Updated weights for policy 0, policy_version 42490 (0.0010) [2023-10-08 01:37:09,068][52059] Updated weights for policy 1, policy_version 43032 (0.0009) [2023-10-08 01:37:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 87588864. Throughput: 0: 1695.2, 1: 1705.6. Samples: 21903798. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:37:11,211][50642] Avg episode reward: [(0, '20.980'), (1, '19.600')] [2023-10-08 01:37:12,910][52060] Updated weights for policy 0, policy_version 42500 (0.0010) [2023-10-08 01:37:13,120][52059] Updated weights for policy 1, policy_version 43042 (0.0009) [2023-10-08 01:37:13,279][52060] Updated weights for policy 0, policy_version 42510 (0.0007) [2023-10-08 01:37:13,484][52059] Updated weights for policy 1, policy_version 43052 (0.0008) [2023-10-08 01:37:13,645][52060] Updated weights for policy 0, policy_version 42520 (0.0007) [2023-10-08 01:37:13,842][52059] Updated weights for policy 1, policy_version 43062 (0.0009) [2023-10-08 01:37:14,215][52059] Updated weights for policy 1, policy_version 43072 (0.0010) [2023-10-08 01:37:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 87654400. Throughput: 0: 1722.0, 1: 1719.1. Samples: 21924930. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:37:16,211][50642] Avg episode reward: [(0, '19.430'), (1, '21.050')] [2023-10-08 01:37:17,653][52060] Updated weights for policy 0, policy_version 42530 (0.0007) [2023-10-08 01:37:18,020][52060] Updated weights for policy 0, policy_version 42540 (0.0007) [2023-10-08 01:37:18,213][52059] Updated weights for policy 1, policy_version 43082 (0.0008) [2023-10-08 01:37:18,388][52060] Updated weights for policy 0, policy_version 42550 (0.0008) [2023-10-08 01:37:18,573][52059] Updated weights for policy 1, policy_version 43092 (0.0008) [2023-10-08 01:37:18,752][52060] Updated weights for policy 0, policy_version 42560 (0.0008) [2023-10-08 01:37:18,941][52059] Updated weights for policy 1, policy_version 43102 (0.0008) [2023-10-08 01:37:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 87719936. Throughput: 0: 1693.9, 1: 1705.5. Samples: 21934450. Policy #0 lag: (min: 2.0, avg: 8.0, max: 34.0) [2023-10-08 01:37:21,211][50642] Avg episode reward: [(0, '20.950'), (1, '19.390')] [2023-10-08 01:37:22,803][52059] Updated weights for policy 1, policy_version 43112 (0.0008) [2023-10-08 01:37:22,817][52060] Updated weights for policy 0, policy_version 42570 (0.0007) [2023-10-08 01:37:23,171][52059] Updated weights for policy 1, policy_version 43122 (0.0008) [2023-10-08 01:37:23,181][52060] Updated weights for policy 0, policy_version 42580 (0.0008) [2023-10-08 01:37:23,531][52059] Updated weights for policy 1, policy_version 43132 (0.0007) [2023-10-08 01:37:23,554][52060] Updated weights for policy 0, policy_version 42590 (0.0007) [2023-10-08 01:37:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 87785472. Throughput: 0: 1703.1, 1: 1712.5. Samples: 21955528. Policy #0 lag: (min: 12.0, avg: 19.7, max: 44.0) [2023-10-08 01:37:26,211][50642] Avg episode reward: [(0, '20.190'), (1, '19.390')] [2023-10-08 01:37:27,516][52060] Updated weights for policy 0, policy_version 42600 (0.0007) [2023-10-08 01:37:27,523][52059] Updated weights for policy 1, policy_version 43142 (0.0008) [2023-10-08 01:37:27,882][52060] Updated weights for policy 0, policy_version 42610 (0.0009) [2023-10-08 01:37:27,884][52059] Updated weights for policy 1, policy_version 43152 (0.0008) [2023-10-08 01:37:28,246][52059] Updated weights for policy 1, policy_version 43162 (0.0009) [2023-10-08 01:37:28,250][52060] Updated weights for policy 0, policy_version 42620 (0.0008) [2023-10-08 01:37:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 87851008. Throughput: 0: 1728.5, 1: 1732.4. Samples: 21976866. Policy #0 lag: (min: 12.0, avg: 19.7, max: 44.0) [2023-10-08 01:37:31,211][50642] Avg episode reward: [(0, '19.800'), (1, '19.650')] [2023-10-08 01:37:32,235][52060] Updated weights for policy 0, policy_version 42630 (0.0009) [2023-10-08 01:37:32,325][52059] Updated weights for policy 1, policy_version 43172 (0.0007) [2023-10-08 01:37:32,611][52060] Updated weights for policy 0, policy_version 42640 (0.0008) [2023-10-08 01:37:32,685][52059] Updated weights for policy 1, policy_version 43182 (0.0009) [2023-10-08 01:37:32,991][52060] Updated weights for policy 0, policy_version 42650 (0.0007) [2023-10-08 01:37:33,051][52059] Updated weights for policy 1, policy_version 43192 (0.0009) [2023-10-08 01:37:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 87916544. Throughput: 0: 1698.7, 1: 1702.2. Samples: 21986166. Policy #0 lag: (min: 12.0, avg: 19.7, max: 44.0) [2023-10-08 01:37:36,211][50642] Avg episode reward: [(0, '20.400'), (1, '21.420')] [2023-10-08 01:37:36,963][52059] Updated weights for policy 1, policy_version 43202 (0.0009) [2023-10-08 01:37:37,021][52060] Updated weights for policy 0, policy_version 42660 (0.0009) [2023-10-08 01:37:37,374][52059] Updated weights for policy 1, policy_version 43212 (0.0009) [2023-10-08 01:37:37,387][52060] Updated weights for policy 0, policy_version 42670 (0.0007) [2023-10-08 01:37:37,727][52059] Updated weights for policy 1, policy_version 43222 (0.0009) [2023-10-08 01:37:37,748][52060] Updated weights for policy 0, policy_version 42680 (0.0009) [2023-10-08 01:37:38,094][52059] Updated weights for policy 1, policy_version 43232 (0.0009) [2023-10-08 01:37:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 87982080. Throughput: 0: 1723.2, 1: 1722.4. Samples: 22007096. Policy #0 lag: (min: 12.0, avg: 19.7, max: 44.0) [2023-10-08 01:37:41,211][50642] Avg episode reward: [(0, '20.330'), (1, '22.130')] [2023-10-08 01:37:41,722][52060] Updated weights for policy 0, policy_version 42690 (0.0010) [2023-10-08 01:37:42,083][52060] Updated weights for policy 0, policy_version 42700 (0.0007) [2023-10-08 01:37:42,127][52059] Updated weights for policy 1, policy_version 43242 (0.0007) [2023-10-08 01:37:42,457][52060] Updated weights for policy 0, policy_version 42710 (0.0008) [2023-10-08 01:37:42,497][52059] Updated weights for policy 1, policy_version 43252 (0.0007) [2023-10-08 01:37:42,815][52060] Updated weights for policy 0, policy_version 42720 (0.0008) [2023-10-08 01:37:42,862][52059] Updated weights for policy 1, policy_version 43262 (0.0008) [2023-10-08 01:37:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88047616. Throughput: 0: 1718.3, 1: 1726.8. Samples: 22028174. Policy #0 lag: (min: 12.0, avg: 19.7, max: 44.0) [2023-10-08 01:37:46,211][50642] Avg episode reward: [(0, '20.160'), (1, '21.900')] [2023-10-08 01:37:46,854][52060] Updated weights for policy 0, policy_version 42730 (0.0008) [2023-10-08 01:37:46,919][52059] Updated weights for policy 1, policy_version 43272 (0.0007) [2023-10-08 01:37:47,232][52060] Updated weights for policy 0, policy_version 42740 (0.0007) [2023-10-08 01:37:47,287][52059] Updated weights for policy 1, policy_version 43282 (0.0007) [2023-10-08 01:37:47,606][52060] Updated weights for policy 0, policy_version 42750 (0.0009) [2023-10-08 01:37:47,644][52059] Updated weights for policy 1, policy_version 43292 (0.0008) [2023-10-08 01:37:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88113152. Throughput: 0: 1705.6, 1: 1701.4. Samples: 22037342. Policy #0 lag: (min: 12.0, avg: 19.7, max: 44.0) [2023-10-08 01:37:51,211][50642] Avg episode reward: [(0, '20.070'), (1, '21.460')] [2023-10-08 01:37:51,442][52060] Updated weights for policy 0, policy_version 42760 (0.0009) [2023-10-08 01:37:51,646][52059] Updated weights for policy 1, policy_version 43302 (0.0007) [2023-10-08 01:37:51,812][52060] Updated weights for policy 0, policy_version 42770 (0.0008) [2023-10-08 01:37:52,010][52059] Updated weights for policy 1, policy_version 43312 (0.0007) [2023-10-08 01:37:52,180][52060] Updated weights for policy 0, policy_version 42780 (0.0008) [2023-10-08 01:37:52,377][52059] Updated weights for policy 1, policy_version 43322 (0.0009) [2023-10-08 01:37:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88178688. Throughput: 0: 1718.8, 1: 1718.3. Samples: 22058466. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:37:56,211][50642] Avg episode reward: [(0, '20.550'), (1, '19.960')] [2023-10-08 01:37:56,226][52060] Updated weights for policy 0, policy_version 42790 (0.0007) [2023-10-08 01:37:56,325][52059] Updated weights for policy 1, policy_version 43332 (0.0007) [2023-10-08 01:37:56,590][52060] Updated weights for policy 0, policy_version 42800 (0.0009) [2023-10-08 01:37:56,694][52059] Updated weights for policy 1, policy_version 43342 (0.0008) [2023-10-08 01:37:56,962][52060] Updated weights for policy 0, policy_version 42810 (0.0008) [2023-10-08 01:37:57,056][52059] Updated weights for policy 1, policy_version 43352 (0.0007) [2023-10-08 01:38:00,932][52060] Updated weights for policy 0, policy_version 42820 (0.0008) [2023-10-08 01:38:00,960][52059] Updated weights for policy 1, policy_version 43362 (0.0007) [2023-10-08 01:38:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88244224. Throughput: 0: 1713.7, 1: 1725.0. Samples: 22079672. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:38:01,211][50642] Avg episode reward: [(0, '20.400'), (1, '21.410')] [2023-10-08 01:38:01,302][52060] Updated weights for policy 0, policy_version 42830 (0.0008) [2023-10-08 01:38:01,330][52059] Updated weights for policy 1, policy_version 43372 (0.0008) [2023-10-08 01:38:01,667][52060] Updated weights for policy 0, policy_version 42840 (0.0007) [2023-10-08 01:38:01,698][52059] Updated weights for policy 1, policy_version 43382 (0.0008) [2023-10-08 01:38:02,061][52059] Updated weights for policy 1, policy_version 43392 (0.0008) [2023-10-08 01:38:05,665][52060] Updated weights for policy 0, policy_version 42850 (0.0008) [2023-10-08 01:38:06,033][52060] Updated weights for policy 0, policy_version 42860 (0.0008) [2023-10-08 01:38:06,081][52059] Updated weights for policy 1, policy_version 43402 (0.0009) [2023-10-08 01:38:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88309760. Throughput: 0: 1715.2, 1: 1721.8. Samples: 22089116. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:38:06,211][50642] Avg episode reward: [(0, '21.540'), (1, '17.410')] [2023-10-08 01:38:06,399][52060] Updated weights for policy 0, policy_version 42870 (0.0007) [2023-10-08 01:38:06,447][52059] Updated weights for policy 1, policy_version 43412 (0.0007) [2023-10-08 01:38:06,769][52060] Updated weights for policy 0, policy_version 42880 (0.0007) [2023-10-08 01:38:06,815][52059] Updated weights for policy 1, policy_version 43422 (0.0009) [2023-10-08 01:38:10,708][52060] Updated weights for policy 0, policy_version 42890 (0.0009) [2023-10-08 01:38:10,895][52059] Updated weights for policy 1, policy_version 43432 (0.0008) [2023-10-08 01:38:11,078][52060] Updated weights for policy 0, policy_version 42900 (0.0007) [2023-10-08 01:38:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88375296. Throughput: 0: 1717.9, 1: 1716.8. Samples: 22110086. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:38:11,211][50642] Avg episode reward: [(0, '20.290'), (1, '18.180')] [2023-10-08 01:38:11,249][52059] Updated weights for policy 1, policy_version 43442 (0.0011) [2023-10-08 01:38:11,437][52060] Updated weights for policy 0, policy_version 42910 (0.0009) [2023-10-08 01:38:11,616][52059] Updated weights for policy 1, policy_version 43452 (0.0010) [2023-10-08 01:38:15,414][52060] Updated weights for policy 0, policy_version 42920 (0.0008) [2023-10-08 01:38:15,420][52059] Updated weights for policy 1, policy_version 43462 (0.0008) [2023-10-08 01:38:15,778][52060] Updated weights for policy 0, policy_version 42930 (0.0008) [2023-10-08 01:38:15,780][52059] Updated weights for policy 1, policy_version 43472 (0.0009) [2023-10-08 01:38:16,138][52060] Updated weights for policy 0, policy_version 42940 (0.0007) [2023-10-08 01:38:16,149][52059] Updated weights for policy 1, policy_version 43482 (0.0009) [2023-10-08 01:38:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 88440832. Throughput: 0: 1693.5, 1: 1707.2. Samples: 22129900. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:38:16,211][50642] Avg episode reward: [(0, '20.480'), (1, '17.880')] [2023-10-08 01:38:20,115][52059] Updated weights for policy 1, policy_version 43492 (0.0009) [2023-10-08 01:38:20,282][52060] Updated weights for policy 0, policy_version 42950 (0.0007) [2023-10-08 01:38:20,485][52059] Updated weights for policy 1, policy_version 43502 (0.0010) [2023-10-08 01:38:20,659][52060] Updated weights for policy 0, policy_version 42960 (0.0009) [2023-10-08 01:38:20,848][52059] Updated weights for policy 1, policy_version 43512 (0.0008) [2023-10-08 01:38:21,027][52060] Updated weights for policy 0, policy_version 42970 (0.0009) [2023-10-08 01:38:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 88539136. Throughput: 0: 1713.9, 1: 1722.6. Samples: 22140810. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 01:38:21,211][50642] Avg episode reward: [(0, '21.660'), (1, '19.100')] [2023-10-08 01:38:24,477][52059] Updated weights for policy 1, policy_version 43522 (0.0008) [2023-10-08 01:38:24,907][52059] Updated weights for policy 1, policy_version 43532 (0.0008) [2023-10-08 01:38:25,072][52060] Updated weights for policy 0, policy_version 42980 (0.0010) [2023-10-08 01:38:25,271][52059] Updated weights for policy 1, policy_version 43542 (0.0007) [2023-10-08 01:38:25,449][52060] Updated weights for policy 0, policy_version 42990 (0.0009) [2023-10-08 01:38:25,627][52059] Updated weights for policy 1, policy_version 43552 (0.0008) [2023-10-08 01:38:25,810][52060] Updated weights for policy 0, policy_version 43000 (0.0009) [2023-10-08 01:38:26,210][50642] Fps is (10 sec: 19660.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 88637440. Throughput: 0: 1713.4, 1: 1722.8. Samples: 22161724. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:38:26,211][50642] Avg episode reward: [(0, '19.370'), (1, '18.050')] [2023-10-08 01:38:29,453][52059] Updated weights for policy 1, policy_version 43562 (0.0010) [2023-10-08 01:38:29,725][52060] Updated weights for policy 0, policy_version 43010 (0.0009) [2023-10-08 01:38:29,823][52059] Updated weights for policy 1, policy_version 43572 (0.0007) [2023-10-08 01:38:30,091][52060] Updated weights for policy 0, policy_version 43020 (0.0009) [2023-10-08 01:38:30,186][52059] Updated weights for policy 1, policy_version 43582 (0.0007) [2023-10-08 01:38:30,457][52060] Updated weights for policy 0, policy_version 43030 (0.0009) [2023-10-08 01:38:30,819][52060] Updated weights for policy 0, policy_version 43040 (0.0008) [2023-10-08 01:38:31,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 88702976. Throughput: 0: 1690.1, 1: 1709.9. Samples: 22181172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:38:31,211][50642] Avg episode reward: [(0, '19.550'), (1, '18.750')] [2023-10-08 01:38:34,013][52059] Updated weights for policy 1, policy_version 43592 (0.0010) [2023-10-08 01:38:34,380][52059] Updated weights for policy 1, policy_version 43602 (0.0010) [2023-10-08 01:38:34,739][52059] Updated weights for policy 1, policy_version 43612 (0.0007) [2023-10-08 01:38:34,772][52060] Updated weights for policy 0, policy_version 43050 (0.0009) [2023-10-08 01:38:35,135][52060] Updated weights for policy 0, policy_version 43060 (0.0011) [2023-10-08 01:38:35,503][52060] Updated weights for policy 0, policy_version 43070 (0.0008) [2023-10-08 01:38:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 88768512. Throughput: 0: 1717.3, 1: 1744.4. Samples: 22193118. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:38:36,211][50642] Avg episode reward: [(0, '20.430'), (1, '20.050')] [2023-10-08 01:38:38,747][52059] Updated weights for policy 1, policy_version 43622 (0.0008) [2023-10-08 01:38:39,109][52059] Updated weights for policy 1, policy_version 43632 (0.0010) [2023-10-08 01:38:39,474][52060] Updated weights for policy 0, policy_version 43080 (0.0007) [2023-10-08 01:38:39,481][52059] Updated weights for policy 1, policy_version 43642 (0.0011) [2023-10-08 01:38:39,841][52060] Updated weights for policy 0, policy_version 43090 (0.0007) [2023-10-08 01:38:40,212][52060] Updated weights for policy 0, policy_version 43100 (0.0009) [2023-10-08 01:38:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 88834048. Throughput: 0: 1707.9, 1: 1719.5. Samples: 22212704. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:38:41,211][50642] Avg episode reward: [(0, '19.660'), (1, '17.110')] [2023-10-08 01:38:43,383][52059] Updated weights for policy 1, policy_version 43652 (0.0007) [2023-10-08 01:38:43,744][52059] Updated weights for policy 1, policy_version 43662 (0.0009) [2023-10-08 01:38:44,109][52059] Updated weights for policy 1, policy_version 43672 (0.0007) [2023-10-08 01:38:44,220][52060] Updated weights for policy 0, policy_version 43110 (0.0008) [2023-10-08 01:38:44,598][52060] Updated weights for policy 0, policy_version 43120 (0.0008) [2023-10-08 01:38:44,966][52060] Updated weights for policy 0, policy_version 43130 (0.0008) [2023-10-08 01:38:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 88899584. Throughput: 0: 1694.2, 1: 1719.6. Samples: 22233290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:38:46,211][50642] Avg episode reward: [(0, '20.260'), (1, '18.550')] [2023-10-08 01:38:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000043680_44728320.pth... [2023-10-08 01:38:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000043136_44171264.pth... [2023-10-08 01:38:46,247][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000042080_43089920.pth [2023-10-08 01:38:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000041536_42532864.pth [2023-10-08 01:38:48,012][52059] Updated weights for policy 1, policy_version 43682 (0.0008) [2023-10-08 01:38:48,379][52059] Updated weights for policy 1, policy_version 43692 (0.0009) [2023-10-08 01:38:48,666][52060] Updated weights for policy 0, policy_version 43140 (0.0008) [2023-10-08 01:38:48,749][52059] Updated weights for policy 1, policy_version 43702 (0.0009) [2023-10-08 01:38:49,029][52060] Updated weights for policy 0, policy_version 43150 (0.0009) [2023-10-08 01:38:49,110][52059] Updated weights for policy 1, policy_version 43712 (0.0008) [2023-10-08 01:38:49,413][52060] Updated weights for policy 0, policy_version 43160 (0.0010) [2023-10-08 01:38:51,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 88965120. Throughput: 0: 1716.2, 1: 1727.7. Samples: 22244094. Policy #0 lag: (min: 2.0, avg: 8.5, max: 34.0) [2023-10-08 01:38:51,211][50642] Avg episode reward: [(0, '19.150'), (1, '16.920')] [2023-10-08 01:38:53,104][52059] Updated weights for policy 1, policy_version 43722 (0.0009) [2023-10-08 01:38:53,448][52060] Updated weights for policy 0, policy_version 43170 (0.0007) [2023-10-08 01:38:53,460][52059] Updated weights for policy 1, policy_version 43732 (0.0008) [2023-10-08 01:38:53,825][52059] Updated weights for policy 1, policy_version 43742 (0.0007) [2023-10-08 01:38:53,831][52060] Updated weights for policy 0, policy_version 43180 (0.0008) [2023-10-08 01:38:54,190][52060] Updated weights for policy 0, policy_version 43190 (0.0008) [2023-10-08 01:38:54,559][52060] Updated weights for policy 0, policy_version 43200 (0.0007) [2023-10-08 01:38:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 89030656. Throughput: 0: 1696.2, 1: 1726.8. Samples: 22264122. Policy #0 lag: (min: 2.0, avg: 8.5, max: 34.0) [2023-10-08 01:38:56,211][50642] Avg episode reward: [(0, '19.520'), (1, '19.480')] [2023-10-08 01:38:57,861][52059] Updated weights for policy 1, policy_version 43752 (0.0010) [2023-10-08 01:38:58,225][52059] Updated weights for policy 1, policy_version 43762 (0.0008) [2023-10-08 01:38:58,498][52060] Updated weights for policy 0, policy_version 43210 (0.0009) [2023-10-08 01:38:58,596][52059] Updated weights for policy 1, policy_version 43772 (0.0009) [2023-10-08 01:38:58,864][52060] Updated weights for policy 0, policy_version 43220 (0.0007) [2023-10-08 01:38:59,228][52060] Updated weights for policy 0, policy_version 43230 (0.0007) [2023-10-08 01:39:01,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 89096192. Throughput: 0: 1718.5, 1: 1742.3. Samples: 22285636. Policy #0 lag: (min: 2.0, avg: 8.5, max: 34.0) [2023-10-08 01:39:01,211][50642] Avg episode reward: [(0, '21.000'), (1, '16.940')] [2023-10-08 01:39:02,425][52059] Updated weights for policy 1, policy_version 43782 (0.0008) [2023-10-08 01:39:02,787][52059] Updated weights for policy 1, policy_version 43792 (0.0008) [2023-10-08 01:39:03,145][52059] Updated weights for policy 1, policy_version 43802 (0.0008) [2023-10-08 01:39:03,252][52060] Updated weights for policy 0, policy_version 43240 (0.0007) [2023-10-08 01:39:03,617][52060] Updated weights for policy 0, policy_version 43250 (0.0009) [2023-10-08 01:39:03,983][52060] Updated weights for policy 0, policy_version 43260 (0.0010) [2023-10-08 01:39:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 89161728. Throughput: 0: 1705.2, 1: 1730.2. Samples: 22295402. Policy #0 lag: (min: 2.0, avg: 8.5, max: 34.0) [2023-10-08 01:39:06,211][50642] Avg episode reward: [(0, '19.100'), (1, '18.630')] [2023-10-08 01:39:07,044][52059] Updated weights for policy 1, policy_version 43812 (0.0008) [2023-10-08 01:39:07,402][52059] Updated weights for policy 1, policy_version 43822 (0.0008) [2023-10-08 01:39:07,775][52059] Updated weights for policy 1, policy_version 43832 (0.0009) [2023-10-08 01:39:08,161][52060] Updated weights for policy 0, policy_version 43270 (0.0010) [2023-10-08 01:39:08,535][52060] Updated weights for policy 0, policy_version 43280 (0.0010) [2023-10-08 01:39:08,908][52060] Updated weights for policy 0, policy_version 43290 (0.0010) [2023-10-08 01:39:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 89227264. Throughput: 0: 1692.7, 1: 1742.0. Samples: 22316284. Policy #0 lag: (min: 2.0, avg: 8.5, max: 34.0) [2023-10-08 01:39:11,211][50642] Avg episode reward: [(0, '19.920'), (1, '22.740')] [2023-10-08 01:39:11,591][52059] Updated weights for policy 1, policy_version 43842 (0.0009) [2023-10-08 01:39:11,989][52059] Updated weights for policy 1, policy_version 43852 (0.0010) [2023-10-08 01:39:12,353][52059] Updated weights for policy 1, policy_version 43862 (0.0007) [2023-10-08 01:39:12,719][52059] Updated weights for policy 1, policy_version 43872 (0.0009) [2023-10-08 01:39:12,830][52060] Updated weights for policy 0, policy_version 43300 (0.0008) [2023-10-08 01:39:13,197][52060] Updated weights for policy 0, policy_version 43310 (0.0010) [2023-10-08 01:39:13,571][52060] Updated weights for policy 0, policy_version 43320 (0.0009) [2023-10-08 01:39:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 89292800. Throughput: 0: 1713.3, 1: 1758.3. Samples: 22337394. Policy #0 lag: (min: 2.0, avg: 8.5, max: 34.0) [2023-10-08 01:39:16,211][50642] Avg episode reward: [(0, '22.440'), (1, '18.900')] [2023-10-08 01:39:16,559][52059] Updated weights for policy 1, policy_version 43882 (0.0010) [2023-10-08 01:39:16,921][52059] Updated weights for policy 1, policy_version 43892 (0.0011) [2023-10-08 01:39:17,279][52059] Updated weights for policy 1, policy_version 43902 (0.0010) [2023-10-08 01:39:17,462][52060] Updated weights for policy 0, policy_version 43330 (0.0007) [2023-10-08 01:39:17,829][52060] Updated weights for policy 0, policy_version 43340 (0.0008) [2023-10-08 01:39:18,198][52060] Updated weights for policy 0, policy_version 43350 (0.0009) [2023-10-08 01:39:18,570][52060] Updated weights for policy 0, policy_version 43360 (0.0008) [2023-10-08 01:39:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 89358336. Throughput: 0: 1688.7, 1: 1733.6. Samples: 22347124. Policy #0 lag: (min: 31.0, avg: 43.9, max: 63.0) [2023-10-08 01:39:21,211][50642] Avg episode reward: [(0, '20.560'), (1, '19.480')] [2023-10-08 01:39:21,298][52059] Updated weights for policy 1, policy_version 43912 (0.0010) [2023-10-08 01:39:21,665][52059] Updated weights for policy 1, policy_version 43922 (0.0009) [2023-10-08 01:39:22,029][52059] Updated weights for policy 1, policy_version 43932 (0.0009) [2023-10-08 01:39:22,605][52060] Updated weights for policy 0, policy_version 43370 (0.0007) [2023-10-08 01:39:22,982][52060] Updated weights for policy 0, policy_version 43380 (0.0007) [2023-10-08 01:39:23,355][52060] Updated weights for policy 0, policy_version 43390 (0.0007) [2023-10-08 01:39:26,058][52059] Updated weights for policy 1, policy_version 43942 (0.0008) [2023-10-08 01:39:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 89423872. Throughput: 0: 1704.3, 1: 1754.0. Samples: 22368328. Policy #0 lag: (min: 31.0, avg: 43.9, max: 63.0) [2023-10-08 01:39:26,211][50642] Avg episode reward: [(0, '18.690'), (1, '21.360')] [2023-10-08 01:39:26,422][52059] Updated weights for policy 1, policy_version 43952 (0.0007) [2023-10-08 01:39:26,783][52059] Updated weights for policy 1, policy_version 43962 (0.0009) [2023-10-08 01:39:27,251][52060] Updated weights for policy 0, policy_version 43400 (0.0009) [2023-10-08 01:39:27,626][52060] Updated weights for policy 0, policy_version 43410 (0.0007) [2023-10-08 01:39:27,992][52060] Updated weights for policy 0, policy_version 43420 (0.0007) [2023-10-08 01:39:30,796][52059] Updated weights for policy 1, policy_version 43972 (0.0009) [2023-10-08 01:39:31,158][52059] Updated weights for policy 1, policy_version 43982 (0.0008) [2023-10-08 01:39:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 89489408. Throughput: 0: 1724.3, 1: 1745.5. Samples: 22389428. Policy #0 lag: (min: 31.0, avg: 43.9, max: 63.0) [2023-10-08 01:39:31,211][50642] Avg episode reward: [(0, '22.230'), (1, '16.070')] [2023-10-08 01:39:31,511][52059] Updated weights for policy 1, policy_version 43992 (0.0007) [2023-10-08 01:39:31,969][52060] Updated weights for policy 0, policy_version 43430 (0.0008) [2023-10-08 01:39:32,337][52060] Updated weights for policy 0, policy_version 43440 (0.0010) [2023-10-08 01:39:32,703][52060] Updated weights for policy 0, policy_version 43450 (0.0010) [2023-10-08 01:39:35,239][52059] Updated weights for policy 1, policy_version 44002 (0.0009) [2023-10-08 01:39:35,611][52059] Updated weights for policy 1, policy_version 44012 (0.0009) [2023-10-08 01:39:35,975][52059] Updated weights for policy 1, policy_version 44022 (0.0008) [2023-10-08 01:39:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 89554944. Throughput: 0: 1700.9, 1: 1744.8. Samples: 22399152. Policy #0 lag: (min: 31.0, avg: 43.9, max: 63.0) [2023-10-08 01:39:36,211][50642] Avg episode reward: [(0, '20.650'), (1, '17.340')] [2023-10-08 01:39:36,335][52059] Updated weights for policy 1, policy_version 44032 (0.0009) [2023-10-08 01:39:36,707][52060] Updated weights for policy 0, policy_version 43460 (0.0008) [2023-10-08 01:39:37,072][52060] Updated weights for policy 0, policy_version 43470 (0.0010) [2023-10-08 01:39:37,444][52060] Updated weights for policy 0, policy_version 43480 (0.0010) [2023-10-08 01:39:40,326][52059] Updated weights for policy 1, policy_version 44042 (0.0007) [2023-10-08 01:39:40,689][52059] Updated weights for policy 1, policy_version 44052 (0.0008) [2023-10-08 01:39:41,054][52059] Updated weights for policy 1, policy_version 44062 (0.0008) [2023-10-08 01:39:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 89653248. Throughput: 0: 1720.2, 1: 1756.9. Samples: 22420592. Policy #0 lag: (min: 31.0, avg: 43.9, max: 63.0) [2023-10-08 01:39:41,211][50642] Avg episode reward: [(0, '18.800'), (1, '17.550')] [2023-10-08 01:39:41,405][52060] Updated weights for policy 0, policy_version 43490 (0.0008) [2023-10-08 01:39:41,776][52060] Updated weights for policy 0, policy_version 43500 (0.0009) [2023-10-08 01:39:42,137][52060] Updated weights for policy 0, policy_version 43510 (0.0007) [2023-10-08 01:39:42,513][52060] Updated weights for policy 0, policy_version 43520 (0.0007) [2023-10-08 01:39:44,784][52059] Updated weights for policy 1, policy_version 44072 (0.0009) [2023-10-08 01:39:45,156][52059] Updated weights for policy 1, policy_version 44082 (0.0011) [2023-10-08 01:39:45,513][52059] Updated weights for policy 1, policy_version 44092 (0.0009) [2023-10-08 01:39:46,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 89718784. Throughput: 0: 1723.8, 1: 1728.5. Samples: 22440986. Policy #0 lag: (min: 31.0, avg: 43.9, max: 63.0) [2023-10-08 01:39:46,211][50642] Avg episode reward: [(0, '19.480'), (1, '18.970')] [2023-10-08 01:39:46,481][52060] Updated weights for policy 0, policy_version 43530 (0.0010) [2023-10-08 01:39:46,842][52060] Updated weights for policy 0, policy_version 43540 (0.0007) [2023-10-08 01:39:47,211][52060] Updated weights for policy 0, policy_version 43550 (0.0010) [2023-10-08 01:39:49,441][52059] Updated weights for policy 1, policy_version 44102 (0.0007) [2023-10-08 01:39:49,812][52059] Updated weights for policy 1, policy_version 44112 (0.0007) [2023-10-08 01:39:50,164][52059] Updated weights for policy 1, policy_version 44122 (0.0007) [2023-10-08 01:39:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 89784320. Throughput: 0: 1714.0, 1: 1761.1. Samples: 22451782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:39:51,211][50642] Avg episode reward: [(0, '22.350'), (1, '19.510')] [2023-10-08 01:39:51,265][52060] Updated weights for policy 0, policy_version 43560 (0.0009) [2023-10-08 01:39:51,646][52060] Updated weights for policy 0, policy_version 43570 (0.0009) [2023-10-08 01:39:52,017][52060] Updated weights for policy 0, policy_version 43580 (0.0009) [2023-10-08 01:39:53,952][52059] Updated weights for policy 1, policy_version 44132 (0.0008) [2023-10-08 01:39:54,314][52059] Updated weights for policy 1, policy_version 44142 (0.0009) [2023-10-08 01:39:54,677][52059] Updated weights for policy 1, policy_version 44152 (0.0010) [2023-10-08 01:39:56,003][52060] Updated weights for policy 0, policy_version 43590 (0.0008) [2023-10-08 01:39:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 89849856. Throughput: 0: 1733.3, 1: 1731.2. Samples: 22472186. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:39:56,211][50642] Avg episode reward: [(0, '19.020'), (1, '21.300')] [2023-10-08 01:39:56,380][52060] Updated weights for policy 0, policy_version 43600 (0.0009) [2023-10-08 01:39:56,756][52060] Updated weights for policy 0, policy_version 43610 (0.0009) [2023-10-08 01:39:58,601][52059] Updated weights for policy 1, policy_version 44162 (0.0009) [2023-10-08 01:39:59,015][52059] Updated weights for policy 1, policy_version 44172 (0.0008) [2023-10-08 01:39:59,385][52059] Updated weights for policy 1, policy_version 44182 (0.0007) [2023-10-08 01:39:59,752][52059] Updated weights for policy 1, policy_version 44192 (0.0007) [2023-10-08 01:40:00,683][52060] Updated weights for policy 0, policy_version 43620 (0.0008) [2023-10-08 01:40:01,045][52060] Updated weights for policy 0, policy_version 43630 (0.0007) [2023-10-08 01:40:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 89915392. Throughput: 0: 1728.5, 1: 1727.4. Samples: 22492908. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:40:01,211][50642] Avg episode reward: [(0, '17.900'), (1, '19.730')] [2023-10-08 01:40:01,417][52060] Updated weights for policy 0, policy_version 43640 (0.0008) [2023-10-08 01:40:03,648][52059] Updated weights for policy 1, policy_version 44202 (0.0007) [2023-10-08 01:40:04,011][52059] Updated weights for policy 1, policy_version 44212 (0.0009) [2023-10-08 01:40:04,368][52059] Updated weights for policy 1, policy_version 44222 (0.0007) [2023-10-08 01:40:05,435][52060] Updated weights for policy 0, policy_version 43650 (0.0009) [2023-10-08 01:40:05,802][52060] Updated weights for policy 0, policy_version 43660 (0.0009) [2023-10-08 01:40:06,158][52060] Updated weights for policy 0, policy_version 43670 (0.0007) [2023-10-08 01:40:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 89980928. Throughput: 0: 1735.0, 1: 1739.2. Samples: 22503464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:40:06,211][50642] Avg episode reward: [(0, '23.000'), (1, '20.730')] [2023-10-08 01:40:06,525][52060] Updated weights for policy 0, policy_version 43680 (0.0009) [2023-10-08 01:40:08,220][52059] Updated weights for policy 1, policy_version 44232 (0.0008) [2023-10-08 01:40:08,590][52059] Updated weights for policy 1, policy_version 44242 (0.0007) [2023-10-08 01:40:08,959][52059] Updated weights for policy 1, policy_version 44252 (0.0007) [2023-10-08 01:40:10,504][52060] Updated weights for policy 0, policy_version 43690 (0.0010) [2023-10-08 01:40:10,868][52060] Updated weights for policy 0, policy_version 43700 (0.0011) [2023-10-08 01:40:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 90046464. Throughput: 0: 1729.4, 1: 1732.3. Samples: 22524104. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:40:11,211][50642] Avg episode reward: [(0, '19.290'), (1, '22.730')] [2023-10-08 01:40:11,248][52060] Updated weights for policy 0, policy_version 43710 (0.0010) [2023-10-08 01:40:13,015][52059] Updated weights for policy 1, policy_version 44262 (0.0010) [2023-10-08 01:40:13,380][52059] Updated weights for policy 1, policy_version 44272 (0.0007) [2023-10-08 01:40:13,745][52059] Updated weights for policy 1, policy_version 44282 (0.0009) [2023-10-08 01:40:15,265][52060] Updated weights for policy 0, policy_version 43720 (0.0008) [2023-10-08 01:40:15,637][52060] Updated weights for policy 0, policy_version 43730 (0.0008) [2023-10-08 01:40:16,001][52060] Updated weights for policy 0, policy_version 43740 (0.0008) [2023-10-08 01:40:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 90144768. Throughput: 0: 1703.5, 1: 1735.0. Samples: 22544164. Policy #0 lag: (min: 8.0, avg: 26.8, max: 40.0) [2023-10-08 01:40:16,211][50642] Avg episode reward: [(0, '17.910'), (1, '20.640')] [2023-10-08 01:40:17,728][52059] Updated weights for policy 1, policy_version 44292 (0.0010) [2023-10-08 01:40:18,095][52059] Updated weights for policy 1, policy_version 44302 (0.0008) [2023-10-08 01:40:18,467][52059] Updated weights for policy 1, policy_version 44312 (0.0008) [2023-10-08 01:40:19,998][52060] Updated weights for policy 0, policy_version 43750 (0.0007) [2023-10-08 01:40:20,367][52060] Updated weights for policy 0, policy_version 43760 (0.0008) [2023-10-08 01:40:20,731][52060] Updated weights for policy 0, policy_version 43770 (0.0008) [2023-10-08 01:40:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 90210304. Throughput: 0: 1723.7, 1: 1728.8. Samples: 22554514. Policy #0 lag: (min: 8.0, avg: 26.8, max: 40.0) [2023-10-08 01:40:21,211][50642] Avg episode reward: [(0, '21.570'), (1, '20.360')] [2023-10-08 01:40:22,495][52059] Updated weights for policy 1, policy_version 44322 (0.0007) [2023-10-08 01:40:22,850][52059] Updated weights for policy 1, policy_version 44332 (0.0007) [2023-10-08 01:40:23,215][52059] Updated weights for policy 1, policy_version 44342 (0.0007) [2023-10-08 01:40:23,585][52059] Updated weights for policy 1, policy_version 44352 (0.0007) [2023-10-08 01:40:24,746][52060] Updated weights for policy 0, policy_version 43780 (0.0010) [2023-10-08 01:40:25,111][52060] Updated weights for policy 0, policy_version 43790 (0.0008) [2023-10-08 01:40:25,481][52060] Updated weights for policy 0, policy_version 43800 (0.0008) [2023-10-08 01:40:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 90275840. Throughput: 0: 1717.3, 1: 1725.9. Samples: 22575536. Policy #0 lag: (min: 8.0, avg: 26.8, max: 40.0) [2023-10-08 01:40:26,211][50642] Avg episode reward: [(0, '21.790'), (1, '21.780')] [2023-10-08 01:40:27,593][52059] Updated weights for policy 1, policy_version 44362 (0.0008) [2023-10-08 01:40:27,954][52059] Updated weights for policy 1, policy_version 44372 (0.0007) [2023-10-08 01:40:28,322][52059] Updated weights for policy 1, policy_version 44382 (0.0008) [2023-10-08 01:40:29,505][52060] Updated weights for policy 0, policy_version 43810 (0.0009) [2023-10-08 01:40:29,869][52060] Updated weights for policy 0, policy_version 43820 (0.0010) [2023-10-08 01:40:30,243][52060] Updated weights for policy 0, policy_version 43830 (0.0009) [2023-10-08 01:40:30,608][52060] Updated weights for policy 0, policy_version 43840 (0.0008) [2023-10-08 01:40:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 90341376. Throughput: 0: 1686.8, 1: 1752.3. Samples: 22595746. Policy #0 lag: (min: 8.0, avg: 26.8, max: 40.0) [2023-10-08 01:40:31,211][50642] Avg episode reward: [(0, '17.830'), (1, '19.380')] [2023-10-08 01:40:32,082][52059] Updated weights for policy 1, policy_version 44392 (0.0010) [2023-10-08 01:40:32,451][52059] Updated weights for policy 1, policy_version 44402 (0.0007) [2023-10-08 01:40:32,813][52059] Updated weights for policy 1, policy_version 44412 (0.0009) [2023-10-08 01:40:34,532][52060] Updated weights for policy 0, policy_version 43850 (0.0007) [2023-10-08 01:40:34,899][52060] Updated weights for policy 0, policy_version 43860 (0.0007) [2023-10-08 01:40:35,279][52060] Updated weights for policy 0, policy_version 43870 (0.0010) [2023-10-08 01:40:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 90406912. Throughput: 0: 1719.2, 1: 1718.7. Samples: 22606488. Policy #0 lag: (min: 8.0, avg: 26.8, max: 40.0) [2023-10-08 01:40:36,211][50642] Avg episode reward: [(0, '21.360'), (1, '20.810')] [2023-10-08 01:40:36,782][52059] Updated weights for policy 1, policy_version 44422 (0.0009) [2023-10-08 01:40:37,152][52059] Updated weights for policy 1, policy_version 44432 (0.0010) [2023-10-08 01:40:37,515][52059] Updated weights for policy 1, policy_version 44442 (0.0011) [2023-10-08 01:40:39,216][52060] Updated weights for policy 0, policy_version 43880 (0.0009) [2023-10-08 01:40:39,589][52060] Updated weights for policy 0, policy_version 43890 (0.0007) [2023-10-08 01:40:39,964][52060] Updated weights for policy 0, policy_version 43900 (0.0007) [2023-10-08 01:40:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 90472448. Throughput: 0: 1695.2, 1: 1741.3. Samples: 22626828. Policy #0 lag: (min: 8.0, avg: 26.8, max: 40.0) [2023-10-08 01:40:41,211][50642] Avg episode reward: [(0, '23.280'), (1, '21.010')] [2023-10-08 01:40:41,466][52059] Updated weights for policy 1, policy_version 44452 (0.0009) [2023-10-08 01:40:41,831][52059] Updated weights for policy 1, policy_version 44462 (0.0008) [2023-10-08 01:40:42,199][52059] Updated weights for policy 1, policy_version 44472 (0.0007) [2023-10-08 01:40:44,075][52060] Updated weights for policy 0, policy_version 43910 (0.0008) [2023-10-08 01:40:44,455][52060] Updated weights for policy 0, policy_version 43920 (0.0007) [2023-10-08 01:40:44,827][52060] Updated weights for policy 0, policy_version 43930 (0.0008) [2023-10-08 01:40:45,945][52059] Updated weights for policy 1, policy_version 44482 (0.0007) [2023-10-08 01:40:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 90537984. Throughput: 0: 1689.9, 1: 1748.1. Samples: 22647616. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 01:40:46,211][50642] Avg episode reward: [(0, '18.080'), (1, '21.230')] [2023-10-08 01:40:46,223][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000043936_44990464.pth... [2023-10-08 01:40:46,262][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000042336_43352064.pth [2023-10-08 01:40:46,364][52059] Updated weights for policy 1, policy_version 44492 (0.0007) [2023-10-08 01:40:46,735][52059] Updated weights for policy 1, policy_version 44502 (0.0007) [2023-10-08 01:40:47,096][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000044512_45580288.pth... [2023-10-08 01:40:47,100][52059] Updated weights for policy 1, policy_version 44512 (0.0007) [2023-10-08 01:40:47,137][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000042880_43909120.pth [2023-10-08 01:40:48,682][52060] Updated weights for policy 0, policy_version 43940 (0.0007) [2023-10-08 01:40:49,047][52060] Updated weights for policy 0, policy_version 43950 (0.0008) [2023-10-08 01:40:49,415][52060] Updated weights for policy 0, policy_version 43960 (0.0010) [2023-10-08 01:40:51,030][52059] Updated weights for policy 1, policy_version 44522 (0.0009) [2023-10-08 01:40:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 90603520. Throughput: 0: 1704.2, 1: 1726.9. Samples: 22657862. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 01:40:51,211][50642] Avg episode reward: [(0, '18.430'), (1, '21.830')] [2023-10-08 01:40:51,384][52059] Updated weights for policy 1, policy_version 44532 (0.0009) [2023-10-08 01:40:51,762][52059] Updated weights for policy 1, policy_version 44542 (0.0008) [2023-10-08 01:40:53,432][52060] Updated weights for policy 0, policy_version 43970 (0.0008) [2023-10-08 01:40:53,804][52060] Updated weights for policy 0, policy_version 43980 (0.0008) [2023-10-08 01:40:54,174][52060] Updated weights for policy 0, policy_version 43990 (0.0008) [2023-10-08 01:40:54,547][52060] Updated weights for policy 0, policy_version 44000 (0.0010) [2023-10-08 01:40:55,774][52059] Updated weights for policy 1, policy_version 44552 (0.0007) [2023-10-08 01:40:56,148][52059] Updated weights for policy 1, policy_version 44562 (0.0007) [2023-10-08 01:40:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 90669056. Throughput: 0: 1682.0, 1: 1740.9. Samples: 22678134. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 01:40:56,211][50642] Avg episode reward: [(0, '23.360'), (1, '20.460')] [2023-10-08 01:40:56,511][52059] Updated weights for policy 1, policy_version 44572 (0.0007) [2023-10-08 01:40:58,694][52060] Updated weights for policy 0, policy_version 44010 (0.0011) [2023-10-08 01:40:59,063][52060] Updated weights for policy 0, policy_version 44020 (0.0010) [2023-10-08 01:40:59,442][52060] Updated weights for policy 0, policy_version 44030 (0.0010) [2023-10-08 01:41:00,323][52059] Updated weights for policy 1, policy_version 44582 (0.0008) [2023-10-08 01:41:00,679][52059] Updated weights for policy 1, policy_version 44592 (0.0009) [2023-10-08 01:41:01,040][52059] Updated weights for policy 1, policy_version 44602 (0.0010) [2023-10-08 01:41:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 90734592. Throughput: 0: 1697.7, 1: 1726.7. Samples: 22698264. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 01:41:01,211][50642] Avg episode reward: [(0, '17.960'), (1, '24.000')] [2023-10-08 01:41:03,284][52060] Updated weights for policy 0, policy_version 44040 (0.0011) [2023-10-08 01:41:03,650][52060] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-10-08 01:41:04,019][52060] Updated weights for policy 0, policy_version 44060 (0.0008) [2023-10-08 01:41:04,993][52059] Updated weights for policy 1, policy_version 44612 (0.0008) [2023-10-08 01:41:05,349][52059] Updated weights for policy 1, policy_version 44622 (0.0008) [2023-10-08 01:41:05,718][52059] Updated weights for policy 1, policy_version 44632 (0.0010) [2023-10-08 01:41:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 90832896. Throughput: 0: 1689.6, 1: 1747.7. Samples: 22709196. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 01:41:06,211][50642] Avg episode reward: [(0, '17.830'), (1, '21.210')] [2023-10-08 01:41:07,947][52060] Updated weights for policy 0, policy_version 44070 (0.0007) [2023-10-08 01:41:08,323][52060] Updated weights for policy 0, policy_version 44080 (0.0008) [2023-10-08 01:41:08,686][52060] Updated weights for policy 0, policy_version 44090 (0.0011) [2023-10-08 01:41:09,657][52059] Updated weights for policy 1, policy_version 44642 (0.0009) [2023-10-08 01:41:10,026][52059] Updated weights for policy 1, policy_version 44652 (0.0008) [2023-10-08 01:41:10,390][52059] Updated weights for policy 1, policy_version 44662 (0.0009) [2023-10-08 01:41:10,763][52059] Updated weights for policy 1, policy_version 44672 (0.0008) [2023-10-08 01:41:11,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 90898432. Throughput: 0: 1690.7, 1: 1738.4. Samples: 22729846. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 01:41:11,211][50642] Avg episode reward: [(0, '21.290'), (1, '19.920')] [2023-10-08 01:41:12,773][52060] Updated weights for policy 0, policy_version 44100 (0.0010) [2023-10-08 01:41:13,140][52060] Updated weights for policy 0, policy_version 44110 (0.0008) [2023-10-08 01:41:13,509][52060] Updated weights for policy 0, policy_version 44120 (0.0007) [2023-10-08 01:41:14,724][52059] Updated weights for policy 1, policy_version 44682 (0.0010) [2023-10-08 01:41:15,092][52059] Updated weights for policy 1, policy_version 44692 (0.0010) [2023-10-08 01:41:15,445][52059] Updated weights for policy 1, policy_version 44702 (0.0009) [2023-10-08 01:41:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 90963968. Throughput: 0: 1716.5, 1: 1713.4. Samples: 22750094. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:16,211][50642] Avg episode reward: [(0, '18.010'), (1, '23.510')] [2023-10-08 01:41:17,441][52060] Updated weights for policy 0, policy_version 44130 (0.0008) [2023-10-08 01:41:17,806][52060] Updated weights for policy 0, policy_version 44140 (0.0007) [2023-10-08 01:41:18,174][52060] Updated weights for policy 0, policy_version 44150 (0.0008) [2023-10-08 01:41:18,540][52060] Updated weights for policy 0, policy_version 44160 (0.0007) [2023-10-08 01:41:19,380][52059] Updated weights for policy 1, policy_version 44712 (0.0011) [2023-10-08 01:41:19,738][52059] Updated weights for policy 1, policy_version 44722 (0.0007) [2023-10-08 01:41:20,106][52059] Updated weights for policy 1, policy_version 44732 (0.0008) [2023-10-08 01:41:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 91029504. Throughput: 0: 1687.1, 1: 1746.1. Samples: 22760982. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:21,211][50642] Avg episode reward: [(0, '17.780'), (1, '20.640')] [2023-10-08 01:41:22,626][52060] Updated weights for policy 0, policy_version 44170 (0.0008) [2023-10-08 01:41:22,989][52060] Updated weights for policy 0, policy_version 44180 (0.0008) [2023-10-08 01:41:23,361][52060] Updated weights for policy 0, policy_version 44190 (0.0007) [2023-10-08 01:41:24,096][52059] Updated weights for policy 1, policy_version 44742 (0.0007) [2023-10-08 01:41:24,467][52059] Updated weights for policy 1, policy_version 44752 (0.0007) [2023-10-08 01:41:24,833][52059] Updated weights for policy 1, policy_version 44762 (0.0007) [2023-10-08 01:41:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 91095040. Throughput: 0: 1705.2, 1: 1720.9. Samples: 22781002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:26,211][50642] Avg episode reward: [(0, '21.150'), (1, '18.630')] [2023-10-08 01:41:27,428][52060] Updated weights for policy 0, policy_version 44200 (0.0008) [2023-10-08 01:41:27,788][52060] Updated weights for policy 0, policy_version 44210 (0.0008) [2023-10-08 01:41:28,165][52060] Updated weights for policy 0, policy_version 44220 (0.0008) [2023-10-08 01:41:28,837][52059] Updated weights for policy 1, policy_version 44772 (0.0008) [2023-10-08 01:41:29,194][52059] Updated weights for policy 1, policy_version 44782 (0.0009) [2023-10-08 01:41:29,560][52059] Updated weights for policy 1, policy_version 44792 (0.0007) [2023-10-08 01:41:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 91160576. Throughput: 0: 1714.1, 1: 1711.7. Samples: 22801776. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:31,211][50642] Avg episode reward: [(0, '20.060'), (1, '21.450')] [2023-10-08 01:41:32,235][52060] Updated weights for policy 0, policy_version 44230 (0.0009) [2023-10-08 01:41:32,624][52060] Updated weights for policy 0, policy_version 44240 (0.0008) [2023-10-08 01:41:32,997][52060] Updated weights for policy 0, policy_version 44250 (0.0009) [2023-10-08 01:41:33,353][52059] Updated weights for policy 1, policy_version 44802 (0.0007) [2023-10-08 01:41:33,774][52059] Updated weights for policy 1, policy_version 44812 (0.0009) [2023-10-08 01:41:34,136][52059] Updated weights for policy 1, policy_version 44822 (0.0009) [2023-10-08 01:41:34,493][52059] Updated weights for policy 1, policy_version 44832 (0.0010) [2023-10-08 01:41:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 91226112. Throughput: 0: 1683.6, 1: 1731.3. Samples: 22811532. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:36,211][50642] Avg episode reward: [(0, '19.690'), (1, '21.640')] [2023-10-08 01:41:36,909][52060] Updated weights for policy 0, policy_version 44260 (0.0008) [2023-10-08 01:41:37,276][52060] Updated weights for policy 0, policy_version 44270 (0.0009) [2023-10-08 01:41:37,639][52060] Updated weights for policy 0, policy_version 44280 (0.0007) [2023-10-08 01:41:38,447][52059] Updated weights for policy 1, policy_version 44842 (0.0010) [2023-10-08 01:41:38,813][52059] Updated weights for policy 1, policy_version 44852 (0.0010) [2023-10-08 01:41:39,175][52059] Updated weights for policy 1, policy_version 44862 (0.0010) [2023-10-08 01:41:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 91291648. Throughput: 0: 1715.2, 1: 1712.8. Samples: 22832392. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:41,211][50642] Avg episode reward: [(0, '21.880'), (1, '18.550')] [2023-10-08 01:41:41,511][52060] Updated weights for policy 0, policy_version 44290 (0.0009) [2023-10-08 01:41:41,883][52060] Updated weights for policy 0, policy_version 44300 (0.0010) [2023-10-08 01:41:42,246][52060] Updated weights for policy 0, policy_version 44310 (0.0009) [2023-10-08 01:41:42,608][52060] Updated weights for policy 0, policy_version 44320 (0.0007) [2023-10-08 01:41:42,953][52059] Updated weights for policy 1, policy_version 44872 (0.0009) [2023-10-08 01:41:43,330][52059] Updated weights for policy 1, policy_version 44882 (0.0008) [2023-10-08 01:41:43,687][52059] Updated weights for policy 1, policy_version 44892 (0.0009) [2023-10-08 01:41:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 91357184. Throughput: 0: 1723.1, 1: 1731.9. Samples: 22853736. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:41:46,211][50642] Avg episode reward: [(0, '20.640'), (1, '20.610')] [2023-10-08 01:41:46,612][52060] Updated weights for policy 0, policy_version 44330 (0.0010) [2023-10-08 01:41:46,978][52060] Updated weights for policy 0, policy_version 44340 (0.0011) [2023-10-08 01:41:47,343][52060] Updated weights for policy 0, policy_version 44350 (0.0010) [2023-10-08 01:41:47,610][52059] Updated weights for policy 1, policy_version 44902 (0.0009) [2023-10-08 01:41:47,972][52059] Updated weights for policy 1, policy_version 44912 (0.0009) [2023-10-08 01:41:48,334][52059] Updated weights for policy 1, policy_version 44922 (0.0010) [2023-10-08 01:41:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 91422720. Throughput: 0: 1708.9, 1: 1712.7. Samples: 22863166. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) [2023-10-08 01:41:51,211][50642] Avg episode reward: [(0, '19.920'), (1, '23.050')] [2023-10-08 01:41:51,338][52060] Updated weights for policy 0, policy_version 44360 (0.0007) [2023-10-08 01:41:51,701][52060] Updated weights for policy 0, policy_version 44370 (0.0010) [2023-10-08 01:41:52,078][52060] Updated weights for policy 0, policy_version 44380 (0.0008) [2023-10-08 01:41:52,394][52059] Updated weights for policy 1, policy_version 44932 (0.0010) [2023-10-08 01:41:52,762][52059] Updated weights for policy 1, policy_version 44942 (0.0009) [2023-10-08 01:41:53,122][52059] Updated weights for policy 1, policy_version 44952 (0.0007) [2023-10-08 01:41:55,936][52060] Updated weights for policy 0, policy_version 44390 (0.0009) [2023-10-08 01:41:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 91488256. Throughput: 0: 1722.1, 1: 1718.9. Samples: 22884690. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) [2023-10-08 01:41:56,211][50642] Avg episode reward: [(0, '22.070'), (1, '17.680')] [2023-10-08 01:41:56,297][52060] Updated weights for policy 0, policy_version 44400 (0.0009) [2023-10-08 01:41:56,665][52060] Updated weights for policy 0, policy_version 44410 (0.0009) [2023-10-08 01:41:56,889][52059] Updated weights for policy 1, policy_version 44962 (0.0007) [2023-10-08 01:41:57,256][52059] Updated weights for policy 1, policy_version 44972 (0.0007) [2023-10-08 01:41:57,621][52059] Updated weights for policy 1, policy_version 44982 (0.0007) [2023-10-08 01:41:57,977][52059] Updated weights for policy 1, policy_version 44992 (0.0008) [2023-10-08 01:42:00,641][52060] Updated weights for policy 0, policy_version 44420 (0.0008) [2023-10-08 01:42:01,001][52060] Updated weights for policy 0, policy_version 44430 (0.0008) [2023-10-08 01:42:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 91553792. Throughput: 0: 1713.0, 1: 1745.2. Samples: 22905714. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) [2023-10-08 01:42:01,211][50642] Avg episode reward: [(0, '22.880'), (1, '20.470')] [2023-10-08 01:42:01,368][52060] Updated weights for policy 0, policy_version 44440 (0.0007) [2023-10-08 01:42:01,964][52059] Updated weights for policy 1, policy_version 45002 (0.0011) [2023-10-08 01:42:02,339][52059] Updated weights for policy 1, policy_version 45012 (0.0007) [2023-10-08 01:42:02,714][52059] Updated weights for policy 1, policy_version 45022 (0.0010) [2023-10-08 01:42:05,464][52060] Updated weights for policy 0, policy_version 44450 (0.0008) [2023-10-08 01:42:05,830][52060] Updated weights for policy 0, policy_version 44460 (0.0007) [2023-10-08 01:42:06,204][52060] Updated weights for policy 0, policy_version 44470 (0.0008) [2023-10-08 01:42:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 91619328. Throughput: 0: 1720.8, 1: 1711.9. Samples: 22915452. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) [2023-10-08 01:42:06,211][50642] Avg episode reward: [(0, '19.340'), (1, '22.830')] [2023-10-08 01:42:06,567][52060] Updated weights for policy 0, policy_version 44480 (0.0007) [2023-10-08 01:42:06,696][52059] Updated weights for policy 1, policy_version 45032 (0.0008) [2023-10-08 01:42:07,065][52059] Updated weights for policy 1, policy_version 45042 (0.0007) [2023-10-08 01:42:07,433][52059] Updated weights for policy 1, policy_version 45052 (0.0007) [2023-10-08 01:42:10,442][52060] Updated weights for policy 0, policy_version 44490 (0.0007) [2023-10-08 01:42:10,804][52060] Updated weights for policy 0, policy_version 44500 (0.0009) [2023-10-08 01:42:11,173][52060] Updated weights for policy 0, policy_version 44510 (0.0007) [2023-10-08 01:42:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 91684864. Throughput: 0: 1723.3, 1: 1735.6. Samples: 22936656. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) [2023-10-08 01:42:11,211][50642] Avg episode reward: [(0, '20.330'), (1, '19.250')] [2023-10-08 01:42:11,465][52059] Updated weights for policy 1, policy_version 45062 (0.0008) [2023-10-08 01:42:11,828][52059] Updated weights for policy 1, policy_version 45072 (0.0009) [2023-10-08 01:42:12,193][52059] Updated weights for policy 1, policy_version 45082 (0.0007) [2023-10-08 01:42:15,213][52060] Updated weights for policy 0, policy_version 44520 (0.0008) [2023-10-08 01:42:15,587][52060] Updated weights for policy 0, policy_version 44530 (0.0009) [2023-10-08 01:42:15,953][52060] Updated weights for policy 0, policy_version 44540 (0.0007) [2023-10-08 01:42:16,095][52059] Updated weights for policy 1, policy_version 45092 (0.0008) [2023-10-08 01:42:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 91783168. Throughput: 0: 1706.2, 1: 1749.3. Samples: 22957272. Policy #0 lag: (min: 31.0, avg: 31.2, max: 41.0) [2023-10-08 01:42:16,211][50642] Avg episode reward: [(0, '21.280'), (1, '18.300')] [2023-10-08 01:42:16,459][52059] Updated weights for policy 1, policy_version 45102 (0.0009) [2023-10-08 01:42:16,830][52059] Updated weights for policy 1, policy_version 45112 (0.0008) [2023-10-08 01:42:19,826][52060] Updated weights for policy 0, policy_version 44550 (0.0007) [2023-10-08 01:42:20,201][52060] Updated weights for policy 0, policy_version 44560 (0.0007) [2023-10-08 01:42:20,569][52060] Updated weights for policy 0, policy_version 44570 (0.0007) [2023-10-08 01:42:20,799][52059] Updated weights for policy 1, policy_version 45122 (0.0008) [2023-10-08 01:42:21,180][52059] Updated weights for policy 1, policy_version 45132 (0.0008) [2023-10-08 01:42:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 91848704. Throughput: 0: 1740.4, 1: 1731.9. Samples: 22967786. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:21,211][50642] Avg episode reward: [(0, '19.470'), (1, '22.280')] [2023-10-08 01:42:21,540][52059] Updated weights for policy 1, policy_version 45142 (0.0007) [2023-10-08 01:42:21,899][52059] Updated weights for policy 1, policy_version 45152 (0.0007) [2023-10-08 01:42:24,480][52060] Updated weights for policy 0, policy_version 44580 (0.0008) [2023-10-08 01:42:24,857][52060] Updated weights for policy 0, policy_version 44590 (0.0011) [2023-10-08 01:42:25,210][52060] Updated weights for policy 0, policy_version 44600 (0.0010) [2023-10-08 01:42:25,879][52059] Updated weights for policy 1, policy_version 45162 (0.0008) [2023-10-08 01:42:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 91914240. Throughput: 0: 1722.1, 1: 1747.2. Samples: 22988510. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:26,211][50642] Avg episode reward: [(0, '20.210'), (1, '22.290')] [2023-10-08 01:42:26,246][52059] Updated weights for policy 1, policy_version 45172 (0.0010) [2023-10-08 01:42:26,611][52059] Updated weights for policy 1, policy_version 45182 (0.0009) [2023-10-08 01:42:28,979][52060] Updated weights for policy 0, policy_version 44610 (0.0010) [2023-10-08 01:42:29,344][52060] Updated weights for policy 0, policy_version 44620 (0.0010) [2023-10-08 01:42:29,715][52060] Updated weights for policy 0, policy_version 44630 (0.0008) [2023-10-08 01:42:30,077][52060] Updated weights for policy 0, policy_version 44640 (0.0009) [2023-10-08 01:42:30,585][52059] Updated weights for policy 1, policy_version 45192 (0.0007) [2023-10-08 01:42:30,947][52059] Updated weights for policy 1, policy_version 45202 (0.0009) [2023-10-08 01:42:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 91979776. Throughput: 0: 1708.0, 1: 1733.2. Samples: 23008590. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:31,211][50642] Avg episode reward: [(0, '22.020'), (1, '18.490')] [2023-10-08 01:42:31,317][52059] Updated weights for policy 1, policy_version 45212 (0.0008) [2023-10-08 01:42:34,049][52060] Updated weights for policy 0, policy_version 44650 (0.0011) [2023-10-08 01:42:34,423][52060] Updated weights for policy 0, policy_version 44660 (0.0009) [2023-10-08 01:42:34,794][52060] Updated weights for policy 0, policy_version 44670 (0.0007) [2023-10-08 01:42:35,098][52059] Updated weights for policy 1, policy_version 45222 (0.0008) [2023-10-08 01:42:35,467][52059] Updated weights for policy 1, policy_version 45232 (0.0008) [2023-10-08 01:42:35,842][52059] Updated weights for policy 1, policy_version 45242 (0.0009) [2023-10-08 01:42:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 92078080. Throughput: 0: 1739.2, 1: 1746.4. Samples: 23020014. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:36,211][50642] Avg episode reward: [(0, '19.920'), (1, '20.560')] [2023-10-08 01:42:38,736][52060] Updated weights for policy 0, policy_version 44680 (0.0008) [2023-10-08 01:42:39,119][52060] Updated weights for policy 0, policy_version 44690 (0.0008) [2023-10-08 01:42:39,489][52060] Updated weights for policy 0, policy_version 44700 (0.0008) [2023-10-08 01:42:39,672][52059] Updated weights for policy 1, policy_version 45252 (0.0008) [2023-10-08 01:42:40,039][52059] Updated weights for policy 1, policy_version 45262 (0.0009) [2023-10-08 01:42:40,406][52059] Updated weights for policy 1, policy_version 45272 (0.0009) [2023-10-08 01:42:41,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 92143616. Throughput: 0: 1712.2, 1: 1741.0. Samples: 23040086. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:41,211][50642] Avg episode reward: [(0, '20.240'), (1, '23.540')] [2023-10-08 01:42:43,434][52060] Updated weights for policy 0, policy_version 44710 (0.0008) [2023-10-08 01:42:43,802][52060] Updated weights for policy 0, policy_version 44720 (0.0008) [2023-10-08 01:42:44,162][52060] Updated weights for policy 0, policy_version 44730 (0.0009) [2023-10-08 01:42:44,398][52059] Updated weights for policy 1, policy_version 45282 (0.0008) [2023-10-08 01:42:44,768][52059] Updated weights for policy 1, policy_version 45292 (0.0009) [2023-10-08 01:42:45,126][52059] Updated weights for policy 1, policy_version 45302 (0.0008) [2023-10-08 01:42:45,495][52059] Updated weights for policy 1, policy_version 45312 (0.0009) [2023-10-08 01:42:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 92209152. Throughput: 0: 1721.1, 1: 1715.4. Samples: 23060356. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:46,211][50642] Avg episode reward: [(0, '21.780'), (1, '20.460')] [2023-10-08 01:42:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000044736_45809664.pth... [2023-10-08 01:42:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000045312_46399488.pth... [2023-10-08 01:42:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000043136_44171264.pth [2023-10-08 01:42:46,259][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000043680_44728320.pth [2023-10-08 01:42:48,069][52060] Updated weights for policy 0, policy_version 44740 (0.0009) [2023-10-08 01:42:48,450][52060] Updated weights for policy 0, policy_version 44750 (0.0008) [2023-10-08 01:42:48,818][52060] Updated weights for policy 0, policy_version 44760 (0.0010) [2023-10-08 01:42:49,448][52059] Updated weights for policy 1, policy_version 45322 (0.0010) [2023-10-08 01:42:49,819][52059] Updated weights for policy 1, policy_version 45332 (0.0008) [2023-10-08 01:42:50,182][52059] Updated weights for policy 1, policy_version 45342 (0.0009) [2023-10-08 01:42:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 92274688. Throughput: 0: 1719.2, 1: 1743.7. Samples: 23071284. Policy #0 lag: (min: 16.0, avg: 34.8, max: 48.0) [2023-10-08 01:42:51,211][50642] Avg episode reward: [(0, '20.640'), (1, '18.890')] [2023-10-08 01:42:52,783][52060] Updated weights for policy 0, policy_version 44770 (0.0009) [2023-10-08 01:42:53,144][52060] Updated weights for policy 0, policy_version 44780 (0.0008) [2023-10-08 01:42:53,510][52060] Updated weights for policy 0, policy_version 44790 (0.0007) [2023-10-08 01:42:53,884][52060] Updated weights for policy 0, policy_version 44800 (0.0010) [2023-10-08 01:42:54,072][52059] Updated weights for policy 1, policy_version 45352 (0.0009) [2023-10-08 01:42:54,435][52059] Updated weights for policy 1, policy_version 45362 (0.0007) [2023-10-08 01:42:54,790][52059] Updated weights for policy 1, policy_version 45372 (0.0007) [2023-10-08 01:42:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 92340224. Throughput: 0: 1709.4, 1: 1724.9. Samples: 23091198. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:42:56,211][50642] Avg episode reward: [(0, '19.820'), (1, '23.410')] [2023-10-08 01:42:57,829][52060] Updated weights for policy 0, policy_version 44810 (0.0009) [2023-10-08 01:42:58,197][52060] Updated weights for policy 0, policy_version 44820 (0.0009) [2023-10-08 01:42:58,561][52060] Updated weights for policy 0, policy_version 44830 (0.0009) [2023-10-08 01:42:58,697][52059] Updated weights for policy 1, policy_version 45382 (0.0008) [2023-10-08 01:42:59,068][52059] Updated weights for policy 1, policy_version 45392 (0.0007) [2023-10-08 01:42:59,432][52059] Updated weights for policy 1, policy_version 45402 (0.0009) [2023-10-08 01:43:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 92405760. Throughput: 0: 1733.6, 1: 1717.3. Samples: 23112564. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:43:01,211][50642] Avg episode reward: [(0, '20.760'), (1, '19.040')] [2023-10-08 01:43:02,456][52060] Updated weights for policy 0, policy_version 44840 (0.0009) [2023-10-08 01:43:02,824][52060] Updated weights for policy 0, policy_version 44850 (0.0008) [2023-10-08 01:43:03,182][52060] Updated weights for policy 0, policy_version 44860 (0.0008) [2023-10-08 01:43:03,320][52059] Updated weights for policy 1, policy_version 45412 (0.0009) [2023-10-08 01:43:03,683][52059] Updated weights for policy 1, policy_version 45422 (0.0010) [2023-10-08 01:43:04,049][52059] Updated weights for policy 1, policy_version 45432 (0.0010) [2023-10-08 01:43:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 92471296. Throughput: 0: 1707.7, 1: 1730.9. Samples: 23122520. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:43:06,211][50642] Avg episode reward: [(0, '20.540'), (1, '19.030')] [2023-10-08 01:43:07,071][52060] Updated weights for policy 0, policy_version 44870 (0.0008) [2023-10-08 01:43:07,447][52060] Updated weights for policy 0, policy_version 44880 (0.0007) [2023-10-08 01:43:07,808][52060] Updated weights for policy 0, policy_version 44890 (0.0008) [2023-10-08 01:43:07,975][52059] Updated weights for policy 1, policy_version 45442 (0.0010) [2023-10-08 01:43:08,364][52059] Updated weights for policy 1, policy_version 45452 (0.0007) [2023-10-08 01:43:08,734][52059] Updated weights for policy 1, policy_version 45462 (0.0009) [2023-10-08 01:43:09,102][52059] Updated weights for policy 1, policy_version 45472 (0.0009) [2023-10-08 01:43:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 92536832. Throughput: 0: 1723.0, 1: 1719.2. Samples: 23143410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:43:11,211][50642] Avg episode reward: [(0, '20.090'), (1, '20.900')] [2023-10-08 01:43:11,727][52060] Updated weights for policy 0, policy_version 44900 (0.0009) [2023-10-08 01:43:12,101][52060] Updated weights for policy 0, policy_version 44910 (0.0010) [2023-10-08 01:43:12,465][52060] Updated weights for policy 0, policy_version 44920 (0.0011) [2023-10-08 01:43:13,066][52059] Updated weights for policy 1, policy_version 45482 (0.0007) [2023-10-08 01:43:13,429][52059] Updated weights for policy 1, policy_version 45492 (0.0007) [2023-10-08 01:43:13,796][52059] Updated weights for policy 1, policy_version 45502 (0.0008) [2023-10-08 01:43:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 92602368. Throughput: 0: 1736.9, 1: 1728.8. Samples: 23164550. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:43:16,211][50642] Avg episode reward: [(0, '21.760'), (1, '22.960')] [2023-10-08 01:43:16,502][52060] Updated weights for policy 0, policy_version 44930 (0.0007) [2023-10-08 01:43:16,866][52060] Updated weights for policy 0, policy_version 44940 (0.0009) [2023-10-08 01:43:17,231][52060] Updated weights for policy 0, policy_version 44950 (0.0007) [2023-10-08 01:43:17,596][52060] Updated weights for policy 0, policy_version 44960 (0.0009) [2023-10-08 01:43:17,649][52059] Updated weights for policy 1, policy_version 45512 (0.0007) [2023-10-08 01:43:18,002][52059] Updated weights for policy 1, policy_version 45522 (0.0008) [2023-10-08 01:43:18,371][52059] Updated weights for policy 1, policy_version 45532 (0.0008) [2023-10-08 01:43:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 92667904. Throughput: 0: 1707.5, 1: 1716.0. Samples: 23174070. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:43:21,211][50642] Avg episode reward: [(0, '21.740'), (1, '19.150')] [2023-10-08 01:43:21,513][52060] Updated weights for policy 0, policy_version 44970 (0.0008) [2023-10-08 01:43:21,882][52060] Updated weights for policy 0, policy_version 44980 (0.0008) [2023-10-08 01:43:22,179][52059] Updated weights for policy 1, policy_version 45542 (0.0008) [2023-10-08 01:43:22,256][52060] Updated weights for policy 0, policy_version 44990 (0.0009) [2023-10-08 01:43:22,540][52059] Updated weights for policy 1, policy_version 45552 (0.0008) [2023-10-08 01:43:22,904][52059] Updated weights for policy 1, policy_version 45562 (0.0008) [2023-10-08 01:43:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 92733440. Throughput: 0: 1731.0, 1: 1722.8. Samples: 23195508. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:43:26,211][50642] Avg episode reward: [(0, '19.130'), (1, '20.870')] [2023-10-08 01:43:26,348][52060] Updated weights for policy 0, policy_version 45000 (0.0010) [2023-10-08 01:43:26,726][52060] Updated weights for policy 0, policy_version 45010 (0.0007) [2023-10-08 01:43:27,053][52059] Updated weights for policy 1, policy_version 45572 (0.0007) [2023-10-08 01:43:27,086][52060] Updated weights for policy 0, policy_version 45020 (0.0007) [2023-10-08 01:43:27,418][52059] Updated weights for policy 1, policy_version 45582 (0.0007) [2023-10-08 01:43:27,774][52059] Updated weights for policy 1, policy_version 45592 (0.0009) [2023-10-08 01:43:31,050][52060] Updated weights for policy 0, policy_version 45030 (0.0008) [2023-10-08 01:43:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 92798976. Throughput: 0: 1728.4, 1: 1750.0. Samples: 23216886. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 01:43:31,211][50642] Avg episode reward: [(0, '19.960'), (1, '21.980')] [2023-10-08 01:43:31,411][52060] Updated weights for policy 0, policy_version 45040 (0.0007) [2023-10-08 01:43:31,685][52059] Updated weights for policy 1, policy_version 45602 (0.0010) [2023-10-08 01:43:31,776][52060] Updated weights for policy 0, policy_version 45050 (0.0008) [2023-10-08 01:43:32,048][52059] Updated weights for policy 1, policy_version 45612 (0.0008) [2023-10-08 01:43:32,412][52059] Updated weights for policy 1, policy_version 45622 (0.0009) [2023-10-08 01:43:32,778][52059] Updated weights for policy 1, policy_version 45632 (0.0007) [2023-10-08 01:43:35,787][52060] Updated weights for policy 0, policy_version 45060 (0.0008) [2023-10-08 01:43:36,156][52060] Updated weights for policy 0, policy_version 45070 (0.0008) [2023-10-08 01:43:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 92864512. Throughput: 0: 1723.6, 1: 1721.0. Samples: 23226290. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 01:43:36,211][50642] Avg episode reward: [(0, '21.330'), (1, '18.940')] [2023-10-08 01:43:36,527][52060] Updated weights for policy 0, policy_version 45080 (0.0007) [2023-10-08 01:43:36,600][52059] Updated weights for policy 1, policy_version 45642 (0.0008) [2023-10-08 01:43:36,967][52059] Updated weights for policy 1, policy_version 45652 (0.0009) [2023-10-08 01:43:37,338][52059] Updated weights for policy 1, policy_version 45662 (0.0008) [2023-10-08 01:43:40,443][52060] Updated weights for policy 0, policy_version 45090 (0.0008) [2023-10-08 01:43:40,816][52060] Updated weights for policy 0, policy_version 45100 (0.0009) [2023-10-08 01:43:41,173][52060] Updated weights for policy 0, policy_version 45110 (0.0008) [2023-10-08 01:43:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 92930048. Throughput: 0: 1732.1, 1: 1746.2. Samples: 23247724. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 01:43:41,211][50642] Avg episode reward: [(0, '19.970'), (1, '19.700')] [2023-10-08 01:43:41,311][52059] Updated weights for policy 1, policy_version 45672 (0.0009) [2023-10-08 01:43:41,548][52060] Updated weights for policy 0, policy_version 45120 (0.0009) [2023-10-08 01:43:41,679][52059] Updated weights for policy 1, policy_version 45682 (0.0007) [2023-10-08 01:43:42,045][52059] Updated weights for policy 1, policy_version 45692 (0.0010) [2023-10-08 01:43:45,424][52060] Updated weights for policy 0, policy_version 45130 (0.0009) [2023-10-08 01:43:45,800][52060] Updated weights for policy 0, policy_version 45140 (0.0008) [2023-10-08 01:43:46,048][52059] Updated weights for policy 1, policy_version 45702 (0.0009) [2023-10-08 01:43:46,166][52060] Updated weights for policy 0, policy_version 45150 (0.0008) [2023-10-08 01:43:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 92995584. Throughput: 0: 1711.3, 1: 1742.7. Samples: 23267996. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 01:43:46,211][50642] Avg episode reward: [(0, '19.590'), (1, '23.310')] [2023-10-08 01:43:46,416][52059] Updated weights for policy 1, policy_version 45712 (0.0008) [2023-10-08 01:43:46,770][52059] Updated weights for policy 1, policy_version 45722 (0.0009) [2023-10-08 01:43:50,118][52060] Updated weights for policy 0, policy_version 45160 (0.0008) [2023-10-08 01:43:50,495][52060] Updated weights for policy 0, policy_version 45170 (0.0007) [2023-10-08 01:43:50,687][52059] Updated weights for policy 1, policy_version 45732 (0.0008) [2023-10-08 01:43:50,856][52060] Updated weights for policy 0, policy_version 45180 (0.0007) [2023-10-08 01:43:51,046][52059] Updated weights for policy 1, policy_version 45742 (0.0009) [2023-10-08 01:43:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 93093888. Throughput: 0: 1726.0, 1: 1732.4. Samples: 23278148. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 01:43:51,211][50642] Avg episode reward: [(0, '21.880'), (1, '20.210')] [2023-10-08 01:43:51,415][52059] Updated weights for policy 1, policy_version 45752 (0.0009) [2023-10-08 01:43:54,937][52060] Updated weights for policy 0, policy_version 45190 (0.0007) [2023-10-08 01:43:55,315][52060] Updated weights for policy 0, policy_version 45200 (0.0008) [2023-10-08 01:43:55,332][52059] Updated weights for policy 1, policy_version 45762 (0.0008) [2023-10-08 01:43:55,682][52060] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-10-08 01:43:55,740][52059] Updated weights for policy 1, policy_version 45772 (0.0008) [2023-10-08 01:43:56,095][52059] Updated weights for policy 1, policy_version 45782 (0.0007) [2023-10-08 01:43:56,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 93159424. Throughput: 0: 1716.2, 1: 1745.2. Samples: 23299170. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 01:43:56,211][50642] Avg episode reward: [(0, '21.140'), (1, '19.160')] [2023-10-08 01:43:56,461][52059] Updated weights for policy 1, policy_version 45792 (0.0009) [2023-10-08 01:43:59,678][52060] Updated weights for policy 0, policy_version 45220 (0.0007) [2023-10-08 01:44:00,049][52060] Updated weights for policy 0, policy_version 45230 (0.0009) [2023-10-08 01:44:00,278][52059] Updated weights for policy 1, policy_version 45802 (0.0009) [2023-10-08 01:44:00,417][52060] Updated weights for policy 0, policy_version 45240 (0.0010) [2023-10-08 01:44:00,638][52059] Updated weights for policy 1, policy_version 45812 (0.0008) [2023-10-08 01:44:01,004][52059] Updated weights for policy 1, policy_version 45822 (0.0007) [2023-10-08 01:44:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 93257728. Throughput: 0: 1685.7, 1: 1723.8. Samples: 23317976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:01,211][50642] Avg episode reward: [(0, '19.810'), (1, '20.640')] [2023-10-08 01:44:04,481][52060] Updated weights for policy 0, policy_version 45250 (0.0007) [2023-10-08 01:44:04,858][52060] Updated weights for policy 0, policy_version 45260 (0.0007) [2023-10-08 01:44:04,918][52059] Updated weights for policy 1, policy_version 45832 (0.0007) [2023-10-08 01:44:05,215][52060] Updated weights for policy 0, policy_version 45270 (0.0007) [2023-10-08 01:44:05,275][52059] Updated weights for policy 1, policy_version 45842 (0.0009) [2023-10-08 01:44:05,581][52060] Updated weights for policy 0, policy_version 45280 (0.0010) [2023-10-08 01:44:05,639][52059] Updated weights for policy 1, policy_version 45852 (0.0009) [2023-10-08 01:44:06,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 93323264. Throughput: 0: 1716.3, 1: 1743.2. Samples: 23329750. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:06,211][50642] Avg episode reward: [(0, '22.350'), (1, '23.030')] [2023-10-08 01:44:09,554][52060] Updated weights for policy 0, policy_version 45290 (0.0007) [2023-10-08 01:44:09,629][52059] Updated weights for policy 1, policy_version 45862 (0.0007) [2023-10-08 01:44:09,924][52060] Updated weights for policy 0, policy_version 45300 (0.0007) [2023-10-08 01:44:09,995][52059] Updated weights for policy 1, policy_version 45872 (0.0008) [2023-10-08 01:44:10,301][52060] Updated weights for policy 0, policy_version 45310 (0.0011) [2023-10-08 01:44:10,367][52059] Updated weights for policy 1, policy_version 45882 (0.0007) [2023-10-08 01:44:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 93388800. Throughput: 0: 1700.1, 1: 1735.7. Samples: 23350120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:11,211][50642] Avg episode reward: [(0, '21.220'), (1, '19.360')] [2023-10-08 01:44:14,274][52060] Updated weights for policy 0, policy_version 45320 (0.0010) [2023-10-08 01:44:14,473][52059] Updated weights for policy 1, policy_version 45892 (0.0008) [2023-10-08 01:44:14,651][52060] Updated weights for policy 0, policy_version 45330 (0.0007) [2023-10-08 01:44:14,843][52059] Updated weights for policy 1, policy_version 45902 (0.0007) [2023-10-08 01:44:15,026][52060] Updated weights for policy 0, policy_version 45340 (0.0008) [2023-10-08 01:44:15,212][52059] Updated weights for policy 1, policy_version 45912 (0.0011) [2023-10-08 01:44:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 93454336. Throughput: 0: 1688.1, 1: 1709.2. Samples: 23369766. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:16,211][50642] Avg episode reward: [(0, '19.970'), (1, '19.350')] [2023-10-08 01:44:19,055][52060] Updated weights for policy 0, policy_version 45350 (0.0009) [2023-10-08 01:44:19,140][52059] Updated weights for policy 1, policy_version 45922 (0.0010) [2023-10-08 01:44:19,419][52060] Updated weights for policy 0, policy_version 45360 (0.0009) [2023-10-08 01:44:19,508][52059] Updated weights for policy 1, policy_version 45932 (0.0008) [2023-10-08 01:44:19,788][52060] Updated weights for policy 0, policy_version 45370 (0.0007) [2023-10-08 01:44:19,863][52059] Updated weights for policy 1, policy_version 45942 (0.0010) [2023-10-08 01:44:20,226][52059] Updated weights for policy 1, policy_version 45952 (0.0008) [2023-10-08 01:44:21,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 93519872. Throughput: 0: 1712.0, 1: 1741.2. Samples: 23381686. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:21,211][50642] Avg episode reward: [(0, '21.650'), (1, '23.160')] [2023-10-08 01:44:23,733][52060] Updated weights for policy 0, policy_version 45380 (0.0008) [2023-10-08 01:44:24,040][52059] Updated weights for policy 1, policy_version 45962 (0.0009) [2023-10-08 01:44:24,090][52060] Updated weights for policy 0, policy_version 45390 (0.0009) [2023-10-08 01:44:24,390][52059] Updated weights for policy 1, policy_version 45972 (0.0009) [2023-10-08 01:44:24,455][52060] Updated weights for policy 0, policy_version 45400 (0.0007) [2023-10-08 01:44:24,750][52059] Updated weights for policy 1, policy_version 45982 (0.0007) [2023-10-08 01:44:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 93585408. Throughput: 0: 1684.5, 1: 1709.8. Samples: 23400466. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:26,211][50642] Avg episode reward: [(0, '21.980'), (1, '17.800')] [2023-10-08 01:44:28,539][52060] Updated weights for policy 0, policy_version 45410 (0.0008) [2023-10-08 01:44:28,621][52059] Updated weights for policy 1, policy_version 45992 (0.0008) [2023-10-08 01:44:28,902][52060] Updated weights for policy 0, policy_version 45420 (0.0008) [2023-10-08 01:44:28,982][52059] Updated weights for policy 1, policy_version 46002 (0.0009) [2023-10-08 01:44:29,273][52060] Updated weights for policy 0, policy_version 45430 (0.0007) [2023-10-08 01:44:29,356][52059] Updated weights for policy 1, policy_version 46012 (0.0009) [2023-10-08 01:44:29,633][52060] Updated weights for policy 0, policy_version 45440 (0.0008) [2023-10-08 01:44:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 93650944. Throughput: 0: 1703.0, 1: 1717.6. Samples: 23421926. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:44:31,211][50642] Avg episode reward: [(0, '18.630'), (1, '18.450')] [2023-10-08 01:44:33,226][52059] Updated weights for policy 1, policy_version 46022 (0.0008) [2023-10-08 01:44:33,593][52059] Updated weights for policy 1, policy_version 46032 (0.0008) [2023-10-08 01:44:33,744][52060] Updated weights for policy 0, policy_version 45450 (0.0009) [2023-10-08 01:44:33,953][52059] Updated weights for policy 1, policy_version 46042 (0.0007) [2023-10-08 01:44:34,102][52060] Updated weights for policy 0, policy_version 45460 (0.0007) [2023-10-08 01:44:34,472][52060] Updated weights for policy 0, policy_version 45470 (0.0007) [2023-10-08 01:44:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 93716480. Throughput: 0: 1706.5, 1: 1726.1. Samples: 23432616. Policy #0 lag: (min: 8.0, avg: 34.1, max: 40.0) [2023-10-08 01:44:36,211][50642] Avg episode reward: [(0, '20.330'), (1, '21.560')] [2023-10-08 01:44:38,045][52059] Updated weights for policy 1, policy_version 46052 (0.0009) [2023-10-08 01:44:38,391][52060] Updated weights for policy 0, policy_version 45480 (0.0009) [2023-10-08 01:44:38,412][52059] Updated weights for policy 1, policy_version 46062 (0.0007) [2023-10-08 01:44:38,755][52060] Updated weights for policy 0, policy_version 45490 (0.0008) [2023-10-08 01:44:38,773][52059] Updated weights for policy 1, policy_version 46072 (0.0007) [2023-10-08 01:44:39,114][52060] Updated weights for policy 0, policy_version 45500 (0.0009) [2023-10-08 01:44:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 93782016. Throughput: 0: 1693.2, 1: 1715.0. Samples: 23452540. Policy #0 lag: (min: 8.0, avg: 34.1, max: 40.0) [2023-10-08 01:44:41,211][50642] Avg episode reward: [(0, '22.160'), (1, '18.840')] [2023-10-08 01:44:42,867][52059] Updated weights for policy 1, policy_version 46082 (0.0008) [2023-10-08 01:44:43,162][52060] Updated weights for policy 0, policy_version 45510 (0.0008) [2023-10-08 01:44:43,278][52059] Updated weights for policy 1, policy_version 46092 (0.0008) [2023-10-08 01:44:43,531][52060] Updated weights for policy 0, policy_version 45520 (0.0009) [2023-10-08 01:44:43,645][52059] Updated weights for policy 1, policy_version 46102 (0.0007) [2023-10-08 01:44:43,901][52060] Updated weights for policy 0, policy_version 45530 (0.0008) [2023-10-08 01:44:43,998][52059] Updated weights for policy 1, policy_version 46112 (0.0009) [2023-10-08 01:44:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 93847552. Throughput: 0: 1719.6, 1: 1733.9. Samples: 23473380. Policy #0 lag: (min: 8.0, avg: 34.1, max: 40.0) [2023-10-08 01:44:46,211][50642] Avg episode reward: [(0, '18.740'), (1, '16.610')] [2023-10-08 01:44:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000046112_47218688.pth... [2023-10-08 01:44:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000045536_46628864.pth... [2023-10-08 01:44:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000044512_45580288.pth [2023-10-08 01:44:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000043936_44990464.pth [2023-10-08 01:44:47,858][52060] Updated weights for policy 0, policy_version 45540 (0.0008) [2023-10-08 01:44:47,991][52059] Updated weights for policy 1, policy_version 46122 (0.0008) [2023-10-08 01:44:48,222][52060] Updated weights for policy 0, policy_version 45550 (0.0007) [2023-10-08 01:44:48,365][52059] Updated weights for policy 1, policy_version 46132 (0.0009) [2023-10-08 01:44:48,589][52060] Updated weights for policy 0, policy_version 45560 (0.0008) [2023-10-08 01:44:48,733][52059] Updated weights for policy 1, policy_version 46142 (0.0011) [2023-10-08 01:44:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 93913088. Throughput: 0: 1692.7, 1: 1710.4. Samples: 23482886. Policy #0 lag: (min: 8.0, avg: 34.1, max: 40.0) [2023-10-08 01:44:51,211][50642] Avg episode reward: [(0, '18.430'), (1, '19.900')] [2023-10-08 01:44:52,640][52060] Updated weights for policy 0, policy_version 45570 (0.0008) [2023-10-08 01:44:52,827][52059] Updated weights for policy 1, policy_version 46152 (0.0008) [2023-10-08 01:44:53,014][52060] Updated weights for policy 0, policy_version 45580 (0.0009) [2023-10-08 01:44:53,186][52059] Updated weights for policy 1, policy_version 46162 (0.0010) [2023-10-08 01:44:53,383][52060] Updated weights for policy 0, policy_version 45590 (0.0007) [2023-10-08 01:44:53,546][52059] Updated weights for policy 1, policy_version 46172 (0.0007) [2023-10-08 01:44:53,744][52060] Updated weights for policy 0, policy_version 45600 (0.0007) [2023-10-08 01:44:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 93978624. Throughput: 0: 1702.0, 1: 1717.5. Samples: 23503998. Policy #0 lag: (min: 8.0, avg: 34.1, max: 40.0) [2023-10-08 01:44:56,211][50642] Avg episode reward: [(0, '21.520'), (1, '19.920')] [2023-10-08 01:44:57,440][52059] Updated weights for policy 1, policy_version 46182 (0.0008) [2023-10-08 01:44:57,659][52060] Updated weights for policy 0, policy_version 45610 (0.0010) [2023-10-08 01:44:57,809][52059] Updated weights for policy 1, policy_version 46192 (0.0009) [2023-10-08 01:44:58,024][52060] Updated weights for policy 0, policy_version 45620 (0.0007) [2023-10-08 01:44:58,176][52059] Updated weights for policy 1, policy_version 46202 (0.0009) [2023-10-08 01:44:58,386][52060] Updated weights for policy 0, policy_version 45630 (0.0009) [2023-10-08 01:45:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13773.7). Total num frames: 94044160. Throughput: 0: 1714.4, 1: 1736.0. Samples: 23525034. Policy #0 lag: (min: 8.0, avg: 34.1, max: 40.0) [2023-10-08 01:45:01,211][50642] Avg episode reward: [(0, '20.540'), (1, '16.600')] [2023-10-08 01:45:02,099][52059] Updated weights for policy 1, policy_version 46212 (0.0007) [2023-10-08 01:45:02,442][52060] Updated weights for policy 0, policy_version 45640 (0.0008) [2023-10-08 01:45:02,463][52059] Updated weights for policy 1, policy_version 46222 (0.0007) [2023-10-08 01:45:02,803][52060] Updated weights for policy 0, policy_version 45650 (0.0007) [2023-10-08 01:45:02,833][52059] Updated weights for policy 1, policy_version 46232 (0.0007) [2023-10-08 01:45:03,176][52060] Updated weights for policy 0, policy_version 45660 (0.0007) [2023-10-08 01:45:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 94109696. Throughput: 0: 1691.1, 1: 1704.4. Samples: 23534480. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:06,211][50642] Avg episode reward: [(0, '19.490'), (1, '18.780')] [2023-10-08 01:45:06,746][52059] Updated weights for policy 1, policy_version 46242 (0.0008) [2023-10-08 01:45:07,097][52059] Updated weights for policy 1, policy_version 46252 (0.0008) [2023-10-08 01:45:07,120][52060] Updated weights for policy 0, policy_version 45670 (0.0008) [2023-10-08 01:45:07,462][52059] Updated weights for policy 1, policy_version 46262 (0.0007) [2023-10-08 01:45:07,498][52060] Updated weights for policy 0, policy_version 45680 (0.0007) [2023-10-08 01:45:07,830][52059] Updated weights for policy 1, policy_version 46272 (0.0007) [2023-10-08 01:45:07,873][52060] Updated weights for policy 0, policy_version 45690 (0.0007) [2023-10-08 01:45:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 94175232. Throughput: 0: 1718.8, 1: 1730.8. Samples: 23555696. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:11,211][50642] Avg episode reward: [(0, '20.010'), (1, '21.050')] [2023-10-08 01:45:11,773][52060] Updated weights for policy 0, policy_version 45700 (0.0007) [2023-10-08 01:45:11,865][52059] Updated weights for policy 1, policy_version 46282 (0.0008) [2023-10-08 01:45:12,136][52060] Updated weights for policy 0, policy_version 45710 (0.0008) [2023-10-08 01:45:12,231][52059] Updated weights for policy 1, policy_version 46292 (0.0007) [2023-10-08 01:45:12,503][52060] Updated weights for policy 0, policy_version 45720 (0.0008) [2023-10-08 01:45:12,590][52059] Updated weights for policy 1, policy_version 46302 (0.0009) [2023-10-08 01:45:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 94240768. Throughput: 0: 1722.3, 1: 1725.6. Samples: 23577080. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:16,211][50642] Avg episode reward: [(0, '22.360'), (1, '18.470')] [2023-10-08 01:45:16,292][52060] Updated weights for policy 0, policy_version 45730 (0.0008) [2023-10-08 01:45:16,582][52059] Updated weights for policy 1, policy_version 46312 (0.0008) [2023-10-08 01:45:16,656][52060] Updated weights for policy 0, policy_version 45740 (0.0007) [2023-10-08 01:45:16,950][52059] Updated weights for policy 1, policy_version 46322 (0.0009) [2023-10-08 01:45:17,036][52060] Updated weights for policy 0, policy_version 45750 (0.0007) [2023-10-08 01:45:17,305][52059] Updated weights for policy 1, policy_version 46332 (0.0008) [2023-10-08 01:45:17,400][52060] Updated weights for policy 0, policy_version 45760 (0.0007) [2023-10-08 01:45:21,133][52059] Updated weights for policy 1, policy_version 46342 (0.0007) [2023-10-08 01:45:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 94306304. Throughput: 0: 1706.4, 1: 1715.2. Samples: 23586588. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:21,211][50642] Avg episode reward: [(0, '19.520'), (1, '19.000')] [2023-10-08 01:45:21,411][52060] Updated weights for policy 0, policy_version 45770 (0.0007) [2023-10-08 01:45:21,509][52059] Updated weights for policy 1, policy_version 46352 (0.0008) [2023-10-08 01:45:21,771][52060] Updated weights for policy 0, policy_version 45780 (0.0009) [2023-10-08 01:45:21,870][52059] Updated weights for policy 1, policy_version 46362 (0.0007) [2023-10-08 01:45:22,134][52060] Updated weights for policy 0, policy_version 45790 (0.0010) [2023-10-08 01:45:25,818][52059] Updated weights for policy 1, policy_version 46372 (0.0007) [2023-10-08 01:45:26,180][52059] Updated weights for policy 1, policy_version 46382 (0.0009) [2023-10-08 01:45:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 94371840. Throughput: 0: 1717.8, 1: 1723.2. Samples: 23607388. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:26,211][50642] Avg episode reward: [(0, '21.290'), (1, '23.000')] [2023-10-08 01:45:26,281][52060] Updated weights for policy 0, policy_version 45800 (0.0009) [2023-10-08 01:45:26,537][52059] Updated weights for policy 1, policy_version 46392 (0.0007) [2023-10-08 01:45:26,645][52060] Updated weights for policy 0, policy_version 45810 (0.0008) [2023-10-08 01:45:27,025][52060] Updated weights for policy 0, policy_version 45820 (0.0009) [2023-10-08 01:45:30,587][52059] Updated weights for policy 1, policy_version 46402 (0.0009) [2023-10-08 01:45:30,989][52059] Updated weights for policy 1, policy_version 46412 (0.0008) [2023-10-08 01:45:31,209][52060] Updated weights for policy 0, policy_version 45830 (0.0009) [2023-10-08 01:45:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 94437376. Throughput: 0: 1722.9, 1: 1715.6. Samples: 23628116. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:31,211][50642] Avg episode reward: [(0, '21.710'), (1, '20.390')] [2023-10-08 01:45:31,352][52059] Updated weights for policy 1, policy_version 46422 (0.0008) [2023-10-08 01:45:31,578][52060] Updated weights for policy 0, policy_version 45840 (0.0009) [2023-10-08 01:45:31,716][52059] Updated weights for policy 1, policy_version 46432 (0.0009) [2023-10-08 01:45:31,941][52060] Updated weights for policy 0, policy_version 45850 (0.0008) [2023-10-08 01:45:35,528][52059] Updated weights for policy 1, policy_version 46442 (0.0008) [2023-10-08 01:45:35,816][52060] Updated weights for policy 0, policy_version 45860 (0.0008) [2023-10-08 01:45:35,894][52059] Updated weights for policy 1, policy_version 46452 (0.0007) [2023-10-08 01:45:36,185][52060] Updated weights for policy 0, policy_version 45870 (0.0008) [2023-10-08 01:45:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 94502912. Throughput: 0: 1716.4, 1: 1727.6. Samples: 23637870. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 01:45:36,211][50642] Avg episode reward: [(0, '21.760'), (1, '19.910')] [2023-10-08 01:45:36,270][52059] Updated weights for policy 1, policy_version 46462 (0.0009) [2023-10-08 01:45:36,550][52060] Updated weights for policy 0, policy_version 45880 (0.0008) [2023-10-08 01:45:40,168][52059] Updated weights for policy 1, policy_version 46472 (0.0009) [2023-10-08 01:45:40,477][52060] Updated weights for policy 0, policy_version 45890 (0.0008) [2023-10-08 01:45:40,535][52059] Updated weights for policy 1, policy_version 46482 (0.0009) [2023-10-08 01:45:40,855][52060] Updated weights for policy 0, policy_version 45900 (0.0008) [2023-10-08 01:45:40,890][52059] Updated weights for policy 1, policy_version 46492 (0.0008) [2023-10-08 01:45:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 94601216. Throughput: 0: 1719.3, 1: 1730.3. Samples: 23659230. Policy #0 lag: (min: 2.0, avg: 10.3, max: 34.0) [2023-10-08 01:45:41,211][50642] Avg episode reward: [(0, '21.000'), (1, '21.710')] [2023-10-08 01:45:41,220][52060] Updated weights for policy 0, policy_version 45910 (0.0010) [2023-10-08 01:45:41,586][52060] Updated weights for policy 0, policy_version 45920 (0.0011) [2023-10-08 01:45:44,778][52059] Updated weights for policy 1, policy_version 46502 (0.0009) [2023-10-08 01:45:45,151][52059] Updated weights for policy 1, policy_version 46512 (0.0008) [2023-10-08 01:45:45,505][52059] Updated weights for policy 1, policy_version 46522 (0.0008) [2023-10-08 01:45:45,663][52060] Updated weights for policy 0, policy_version 45930 (0.0007) [2023-10-08 01:45:46,026][52060] Updated weights for policy 0, policy_version 45940 (0.0007) [2023-10-08 01:45:46,210][50642] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 94666752. Throughput: 0: 1704.9, 1: 1705.4. Samples: 23678496. Policy #0 lag: (min: 2.0, avg: 10.3, max: 34.0) [2023-10-08 01:45:46,211][50642] Avg episode reward: [(0, '19.990'), (1, '21.400')] [2023-10-08 01:45:46,401][52060] Updated weights for policy 0, policy_version 45950 (0.0007) [2023-10-08 01:45:49,408][52059] Updated weights for policy 1, policy_version 46532 (0.0007) [2023-10-08 01:45:49,766][52059] Updated weights for policy 1, policy_version 46542 (0.0007) [2023-10-08 01:45:50,136][52059] Updated weights for policy 1, policy_version 46552 (0.0008) [2023-10-08 01:45:50,313][52060] Updated weights for policy 0, policy_version 45960 (0.0009) [2023-10-08 01:45:50,672][52060] Updated weights for policy 0, policy_version 45970 (0.0009) [2023-10-08 01:45:51,048][52060] Updated weights for policy 0, policy_version 45980 (0.0007) [2023-10-08 01:45:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 94765056. Throughput: 0: 1719.1, 1: 1737.8. Samples: 23690042. Policy #0 lag: (min: 2.0, avg: 10.3, max: 34.0) [2023-10-08 01:45:51,211][50642] Avg episode reward: [(0, '21.720'), (1, '19.550')] [2023-10-08 01:45:54,142][52059] Updated weights for policy 1, policy_version 46562 (0.0009) [2023-10-08 01:45:54,514][52059] Updated weights for policy 1, policy_version 46572 (0.0011) [2023-10-08 01:45:54,870][52060] Updated weights for policy 0, policy_version 45990 (0.0008) [2023-10-08 01:45:54,880][52059] Updated weights for policy 1, policy_version 46582 (0.0007) [2023-10-08 01:45:55,239][52060] Updated weights for policy 0, policy_version 46000 (0.0007) [2023-10-08 01:45:55,252][52059] Updated weights for policy 1, policy_version 46592 (0.0008) [2023-10-08 01:45:55,612][52060] Updated weights for policy 0, policy_version 46010 (0.0008) [2023-10-08 01:45:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 94830592. Throughput: 0: 1716.0, 1: 1719.9. Samples: 23710310. Policy #0 lag: (min: 2.0, avg: 10.3, max: 34.0) [2023-10-08 01:45:56,211][50642] Avg episode reward: [(0, '20.370'), (1, '19.960')] [2023-10-08 01:45:59,257][52059] Updated weights for policy 1, policy_version 46602 (0.0011) [2023-10-08 01:45:59,625][52059] Updated weights for policy 1, policy_version 46612 (0.0007) [2023-10-08 01:45:59,718][52060] Updated weights for policy 0, policy_version 46020 (0.0010) [2023-10-08 01:45:59,999][52059] Updated weights for policy 1, policy_version 46622 (0.0008) [2023-10-08 01:46:00,087][52060] Updated weights for policy 0, policy_version 46030 (0.0008) [2023-10-08 01:46:00,464][52060] Updated weights for policy 0, policy_version 46040 (0.0009) [2023-10-08 01:46:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 94896128. Throughput: 0: 1680.7, 1: 1711.5. Samples: 23729732. Policy #0 lag: (min: 2.0, avg: 10.3, max: 34.0) [2023-10-08 01:46:01,211][50642] Avg episode reward: [(0, '20.190'), (1, '20.960')] [2023-10-08 01:46:03,852][52059] Updated weights for policy 1, policy_version 46632 (0.0007) [2023-10-08 01:46:04,215][52059] Updated weights for policy 1, policy_version 46642 (0.0008) [2023-10-08 01:46:04,587][52060] Updated weights for policy 0, policy_version 46050 (0.0009) [2023-10-08 01:46:04,587][52059] Updated weights for policy 1, policy_version 46652 (0.0007) [2023-10-08 01:46:04,971][52060] Updated weights for policy 0, policy_version 46060 (0.0011) [2023-10-08 01:46:05,338][52060] Updated weights for policy 0, policy_version 46070 (0.0011) [2023-10-08 01:46:05,713][52060] Updated weights for policy 0, policy_version 46080 (0.0010) [2023-10-08 01:46:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 94961664. Throughput: 0: 1701.2, 1: 1734.3. Samples: 23741184. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:06,211][50642] Avg episode reward: [(0, '20.390'), (1, '20.010')] [2023-10-08 01:46:08,588][52059] Updated weights for policy 1, policy_version 46662 (0.0009) [2023-10-08 01:46:08,947][52059] Updated weights for policy 1, policy_version 46672 (0.0007) [2023-10-08 01:46:09,312][52059] Updated weights for policy 1, policy_version 46682 (0.0009) [2023-10-08 01:46:09,624][52060] Updated weights for policy 0, policy_version 46090 (0.0007) [2023-10-08 01:46:09,992][52060] Updated weights for policy 0, policy_version 46100 (0.0009) [2023-10-08 01:46:10,367][52060] Updated weights for policy 0, policy_version 46110 (0.0011) [2023-10-08 01:46:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 95027200. Throughput: 0: 1697.1, 1: 1715.9. Samples: 23760974. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:11,211][50642] Avg episode reward: [(0, '20.420'), (1, '19.140')] [2023-10-08 01:46:13,094][52059] Updated weights for policy 1, policy_version 46692 (0.0008) [2023-10-08 01:46:13,455][52059] Updated weights for policy 1, policy_version 46702 (0.0008) [2023-10-08 01:46:13,833][52059] Updated weights for policy 1, policy_version 46712 (0.0008) [2023-10-08 01:46:14,369][52060] Updated weights for policy 0, policy_version 46120 (0.0007) [2023-10-08 01:46:14,731][52060] Updated weights for policy 0, policy_version 46130 (0.0007) [2023-10-08 01:46:15,100][52060] Updated weights for policy 0, policy_version 46140 (0.0007) [2023-10-08 01:46:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 95092736. Throughput: 0: 1680.0, 1: 1729.9. Samples: 23781558. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:16,211][50642] Avg episode reward: [(0, '20.150'), (1, '20.580')] [2023-10-08 01:46:17,843][52059] Updated weights for policy 1, policy_version 46722 (0.0009) [2023-10-08 01:46:18,250][52059] Updated weights for policy 1, policy_version 46732 (0.0008) [2023-10-08 01:46:18,612][52059] Updated weights for policy 1, policy_version 46742 (0.0009) [2023-10-08 01:46:18,972][52059] Updated weights for policy 1, policy_version 46752 (0.0008) [2023-10-08 01:46:19,265][52060] Updated weights for policy 0, policy_version 46150 (0.0007) [2023-10-08 01:46:19,635][52060] Updated weights for policy 0, policy_version 46160 (0.0010) [2023-10-08 01:46:20,004][52060] Updated weights for policy 0, policy_version 46170 (0.0008) [2023-10-08 01:46:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 95158272. Throughput: 0: 1710.6, 1: 1723.2. Samples: 23792390. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:21,211][50642] Avg episode reward: [(0, '20.590'), (1, '20.540')] [2023-10-08 01:46:22,945][52059] Updated weights for policy 1, policy_version 46762 (0.0009) [2023-10-08 01:46:23,313][52059] Updated weights for policy 1, policy_version 46772 (0.0010) [2023-10-08 01:46:23,677][52059] Updated weights for policy 1, policy_version 46782 (0.0009) [2023-10-08 01:46:24,023][52060] Updated weights for policy 0, policy_version 46180 (0.0010) [2023-10-08 01:46:24,393][52060] Updated weights for policy 0, policy_version 46190 (0.0010) [2023-10-08 01:46:24,755][52060] Updated weights for policy 0, policy_version 46200 (0.0010) [2023-10-08 01:46:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 95223808. Throughput: 0: 1685.6, 1: 1716.6. Samples: 23812326. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:26,211][50642] Avg episode reward: [(0, '19.730'), (1, '18.890')] [2023-10-08 01:46:27,439][52059] Updated weights for policy 1, policy_version 46792 (0.0007) [2023-10-08 01:46:27,808][52059] Updated weights for policy 1, policy_version 46802 (0.0007) [2023-10-08 01:46:28,174][52059] Updated weights for policy 1, policy_version 46812 (0.0008) [2023-10-08 01:46:28,700][52060] Updated weights for policy 0, policy_version 46210 (0.0011) [2023-10-08 01:46:29,076][52060] Updated weights for policy 0, policy_version 46220 (0.0008) [2023-10-08 01:46:29,450][52060] Updated weights for policy 0, policy_version 46230 (0.0010) [2023-10-08 01:46:29,831][52060] Updated weights for policy 0, policy_version 46240 (0.0008) [2023-10-08 01:46:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 95289344. Throughput: 0: 1696.6, 1: 1749.2. Samples: 23833558. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:31,211][50642] Avg episode reward: [(0, '20.850'), (1, '17.510')] [2023-10-08 01:46:32,140][52059] Updated weights for policy 1, policy_version 46822 (0.0008) [2023-10-08 01:46:32,516][52059] Updated weights for policy 1, policy_version 46832 (0.0007) [2023-10-08 01:46:32,876][52059] Updated weights for policy 1, policy_version 46842 (0.0007) [2023-10-08 01:46:33,862][52060] Updated weights for policy 0, policy_version 46250 (0.0009) [2023-10-08 01:46:34,237][52060] Updated weights for policy 0, policy_version 46260 (0.0009) [2023-10-08 01:46:34,601][52060] Updated weights for policy 0, policy_version 46270 (0.0007) [2023-10-08 01:46:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 95354880. Throughput: 0: 1699.1, 1: 1714.8. Samples: 23843666. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:46:36,211][50642] Avg episode reward: [(0, '20.740'), (1, '22.710')] [2023-10-08 01:46:36,786][52059] Updated weights for policy 1, policy_version 46852 (0.0007) [2023-10-08 01:46:37,150][52059] Updated weights for policy 1, policy_version 46862 (0.0009) [2023-10-08 01:46:37,522][52059] Updated weights for policy 1, policy_version 46872 (0.0009) [2023-10-08 01:46:38,708][52060] Updated weights for policy 0, policy_version 46280 (0.0009) [2023-10-08 01:46:39,078][52060] Updated weights for policy 0, policy_version 46290 (0.0009) [2023-10-08 01:46:39,449][52060] Updated weights for policy 0, policy_version 46300 (0.0007) [2023-10-08 01:46:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 95420416. Throughput: 0: 1676.0, 1: 1734.5. Samples: 23863782. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:46:41,211][50642] Avg episode reward: [(0, '20.610'), (1, '19.920')] [2023-10-08 01:46:41,440][52059] Updated weights for policy 1, policy_version 46882 (0.0009) [2023-10-08 01:46:41,801][52059] Updated weights for policy 1, policy_version 46892 (0.0011) [2023-10-08 01:46:42,162][52059] Updated weights for policy 1, policy_version 46902 (0.0011) [2023-10-08 01:46:42,525][52059] Updated weights for policy 1, policy_version 46912 (0.0007) [2023-10-08 01:46:43,345][52060] Updated weights for policy 0, policy_version 46310 (0.0007) [2023-10-08 01:46:43,705][52060] Updated weights for policy 0, policy_version 46320 (0.0009) [2023-10-08 01:46:44,085][52060] Updated weights for policy 0, policy_version 46330 (0.0008) [2023-10-08 01:46:46,211][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 95485952. Throughput: 0: 1704.6, 1: 1746.7. Samples: 23885042. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:46:46,212][50642] Avg episode reward: [(0, '20.650'), (1, '19.390')] [2023-10-08 01:46:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000046336_47448064.pth... [2023-10-08 01:46:46,261][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000044736_45809664.pth [2023-10-08 01:46:46,339][52059] Updated weights for policy 1, policy_version 46922 (0.0009) [2023-10-08 01:46:46,700][52059] Updated weights for policy 1, policy_version 46932 (0.0010) [2023-10-08 01:46:47,074][52059] Updated weights for policy 1, policy_version 46942 (0.0010) [2023-10-08 01:46:47,146][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000046944_48070656.pth... [2023-10-08 01:46:47,175][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000045312_46399488.pth [2023-10-08 01:46:48,019][52060] Updated weights for policy 0, policy_version 46340 (0.0009) [2023-10-08 01:46:48,390][52060] Updated weights for policy 0, policy_version 46350 (0.0009) [2023-10-08 01:46:48,760][52060] Updated weights for policy 0, policy_version 46360 (0.0008) [2023-10-08 01:46:51,051][52059] Updated weights for policy 1, policy_version 46952 (0.0010) [2023-10-08 01:46:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 95551488. Throughput: 0: 1689.0, 1: 1722.4. Samples: 23894698. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:46:51,211][50642] Avg episode reward: [(0, '20.770'), (1, '22.740')] [2023-10-08 01:46:51,424][52059] Updated weights for policy 1, policy_version 46962 (0.0008) [2023-10-08 01:46:51,801][52059] Updated weights for policy 1, policy_version 46972 (0.0009) [2023-10-08 01:46:52,658][52060] Updated weights for policy 0, policy_version 46370 (0.0008) [2023-10-08 01:46:53,026][52060] Updated weights for policy 0, policy_version 46380 (0.0009) [2023-10-08 01:46:53,397][52060] Updated weights for policy 0, policy_version 46390 (0.0008) [2023-10-08 01:46:53,756][52060] Updated weights for policy 0, policy_version 46400 (0.0009) [2023-10-08 01:46:55,700][52059] Updated weights for policy 1, policy_version 46982 (0.0008) [2023-10-08 01:46:56,065][52059] Updated weights for policy 1, policy_version 46992 (0.0007) [2023-10-08 01:46:56,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13107.3, 300 sec: 13773.7). Total num frames: 95617024. Throughput: 0: 1694.4, 1: 1748.6. Samples: 23915908. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:46:56,211][50642] Avg episode reward: [(0, '20.000'), (1, '23.110')] [2023-10-08 01:46:56,431][52059] Updated weights for policy 1, policy_version 47002 (0.0008) [2023-10-08 01:46:57,788][52060] Updated weights for policy 0, policy_version 46410 (0.0008) [2023-10-08 01:46:58,164][52060] Updated weights for policy 0, policy_version 46420 (0.0007) [2023-10-08 01:46:58,542][52060] Updated weights for policy 0, policy_version 46430 (0.0011) [2023-10-08 01:47:00,544][52059] Updated weights for policy 1, policy_version 47012 (0.0009) [2023-10-08 01:47:00,901][52059] Updated weights for policy 1, policy_version 47022 (0.0007) [2023-10-08 01:47:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 95682560. Throughput: 0: 1709.3, 1: 1736.3. Samples: 23936612. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:47:01,211][50642] Avg episode reward: [(0, '19.380'), (1, '19.370')] [2023-10-08 01:47:01,266][52059] Updated weights for policy 1, policy_version 47032 (0.0008) [2023-10-08 01:47:02,519][52060] Updated weights for policy 0, policy_version 46440 (0.0010) [2023-10-08 01:47:02,898][52060] Updated weights for policy 0, policy_version 46450 (0.0009) [2023-10-08 01:47:03,266][52060] Updated weights for policy 0, policy_version 46460 (0.0011) [2023-10-08 01:47:05,173][52059] Updated weights for policy 1, policy_version 47042 (0.0009) [2023-10-08 01:47:05,568][52059] Updated weights for policy 1, policy_version 47052 (0.0010) [2023-10-08 01:47:05,928][52059] Updated weights for policy 1, policy_version 47062 (0.0010) [2023-10-08 01:47:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 95748096. Throughput: 0: 1678.1, 1: 1747.5. Samples: 23946540. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:47:06,211][50642] Avg episode reward: [(0, '21.300'), (1, '20.180')] [2023-10-08 01:47:06,291][52059] Updated weights for policy 1, policy_version 47072 (0.0010) [2023-10-08 01:47:07,329][52060] Updated weights for policy 0, policy_version 46470 (0.0010) [2023-10-08 01:47:07,715][52060] Updated weights for policy 0, policy_version 46480 (0.0008) [2023-10-08 01:47:08,090][52060] Updated weights for policy 0, policy_version 46490 (0.0008) [2023-10-08 01:47:10,091][52059] Updated weights for policy 1, policy_version 47082 (0.0009) [2023-10-08 01:47:10,447][52059] Updated weights for policy 1, policy_version 47092 (0.0009) [2023-10-08 01:47:10,830][52059] Updated weights for policy 1, policy_version 47102 (0.0009) [2023-10-08 01:47:11,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 95846400. Throughput: 0: 1707.0, 1: 1745.5. Samples: 23967690. Policy #0 lag: (min: 8.0, avg: 30.7, max: 40.0) [2023-10-08 01:47:11,211][50642] Avg episode reward: [(0, '20.460'), (1, '24.480')] [2023-10-08 01:47:11,213][51710] Saving new best policy, reward=24.480! [2023-10-08 01:47:12,035][52060] Updated weights for policy 0, policy_version 46500 (0.0009) [2023-10-08 01:47:12,400][52060] Updated weights for policy 0, policy_version 46510 (0.0010) [2023-10-08 01:47:12,781][52060] Updated weights for policy 0, policy_version 46520 (0.0011) [2023-10-08 01:47:14,838][52059] Updated weights for policy 1, policy_version 47112 (0.0008) [2023-10-08 01:47:15,214][52059] Updated weights for policy 1, policy_version 47122 (0.0007) [2023-10-08 01:47:15,572][52059] Updated weights for policy 1, policy_version 47132 (0.0008) [2023-10-08 01:47:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 95911936. Throughput: 0: 1712.6, 1: 1715.9. Samples: 23987840. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:16,211][50642] Avg episode reward: [(0, '18.650'), (1, '20.270')] [2023-10-08 01:47:16,842][52060] Updated weights for policy 0, policy_version 46530 (0.0009) [2023-10-08 01:47:17,219][52060] Updated weights for policy 0, policy_version 46540 (0.0007) [2023-10-08 01:47:17,588][52060] Updated weights for policy 0, policy_version 46550 (0.0009) [2023-10-08 01:47:17,954][52060] Updated weights for policy 0, policy_version 46560 (0.0010) [2023-10-08 01:47:19,331][52059] Updated weights for policy 1, policy_version 47142 (0.0009) [2023-10-08 01:47:19,696][52059] Updated weights for policy 1, policy_version 47152 (0.0009) [2023-10-08 01:47:20,068][52059] Updated weights for policy 1, policy_version 47162 (0.0008) [2023-10-08 01:47:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 95977472. Throughput: 0: 1693.8, 1: 1753.4. Samples: 23998790. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:21,211][50642] Avg episode reward: [(0, '19.890'), (1, '19.240')] [2023-10-08 01:47:21,940][52060] Updated weights for policy 0, policy_version 46570 (0.0008) [2023-10-08 01:47:22,306][52060] Updated weights for policy 0, policy_version 46580 (0.0008) [2023-10-08 01:47:22,677][52060] Updated weights for policy 0, policy_version 46590 (0.0007) [2023-10-08 01:47:24,010][52059] Updated weights for policy 1, policy_version 47172 (0.0008) [2023-10-08 01:47:24,372][52059] Updated weights for policy 1, policy_version 47182 (0.0008) [2023-10-08 01:47:24,735][52059] Updated weights for policy 1, policy_version 47192 (0.0007) [2023-10-08 01:47:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 96043008. Throughput: 0: 1719.3, 1: 1730.1. Samples: 24019004. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:26,211][50642] Avg episode reward: [(0, '20.920'), (1, '23.670')] [2023-10-08 01:47:26,710][52060] Updated weights for policy 0, policy_version 46600 (0.0007) [2023-10-08 01:47:27,082][52060] Updated weights for policy 0, policy_version 46610 (0.0007) [2023-10-08 01:47:27,458][52060] Updated weights for policy 0, policy_version 46620 (0.0007) [2023-10-08 01:47:28,652][52059] Updated weights for policy 1, policy_version 47202 (0.0008) [2023-10-08 01:47:29,022][52059] Updated weights for policy 1, policy_version 47212 (0.0007) [2023-10-08 01:47:29,385][52059] Updated weights for policy 1, policy_version 47222 (0.0007) [2023-10-08 01:47:29,748][52059] Updated weights for policy 1, policy_version 47232 (0.0009) [2023-10-08 01:47:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 96108544. Throughput: 0: 1721.1, 1: 1718.5. Samples: 24039822. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:31,211][50642] Avg episode reward: [(0, '19.810'), (1, '20.950')] [2023-10-08 01:47:31,452][52060] Updated weights for policy 0, policy_version 46630 (0.0007) [2023-10-08 01:47:31,811][52060] Updated weights for policy 0, policy_version 46640 (0.0010) [2023-10-08 01:47:32,181][52060] Updated weights for policy 0, policy_version 46650 (0.0007) [2023-10-08 01:47:33,804][52059] Updated weights for policy 1, policy_version 47242 (0.0011) [2023-10-08 01:47:34,170][52059] Updated weights for policy 1, policy_version 47252 (0.0010) [2023-10-08 01:47:34,537][52059] Updated weights for policy 1, policy_version 47262 (0.0011) [2023-10-08 01:47:36,030][52060] Updated weights for policy 0, policy_version 46660 (0.0009) [2023-10-08 01:47:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 96174080. Throughput: 0: 1709.7, 1: 1741.5. Samples: 24050000. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:36,211][50642] Avg episode reward: [(0, '19.380'), (1, '18.700')] [2023-10-08 01:47:36,405][52060] Updated weights for policy 0, policy_version 46670 (0.0010) [2023-10-08 01:47:36,768][52060] Updated weights for policy 0, policy_version 46680 (0.0009) [2023-10-08 01:47:38,529][52059] Updated weights for policy 1, policy_version 47272 (0.0012) [2023-10-08 01:47:38,895][52059] Updated weights for policy 1, policy_version 47282 (0.0012) [2023-10-08 01:47:39,260][52059] Updated weights for policy 1, policy_version 47292 (0.0010) [2023-10-08 01:47:40,879][52060] Updated weights for policy 0, policy_version 46690 (0.0009) [2023-10-08 01:47:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 96239616. Throughput: 0: 1716.2, 1: 1717.5. Samples: 24070422. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:41,211][50642] Avg episode reward: [(0, '21.240'), (1, '23.030')] [2023-10-08 01:47:41,257][52060] Updated weights for policy 0, policy_version 46700 (0.0009) [2023-10-08 01:47:41,626][52060] Updated weights for policy 0, policy_version 46710 (0.0008) [2023-10-08 01:47:41,993][52060] Updated weights for policy 0, policy_version 46720 (0.0008) [2023-10-08 01:47:43,220][52059] Updated weights for policy 1, policy_version 47302 (0.0009) [2023-10-08 01:47:43,589][52059] Updated weights for policy 1, policy_version 47312 (0.0009) [2023-10-08 01:47:43,954][52059] Updated weights for policy 1, policy_version 47322 (0.0007) [2023-10-08 01:47:45,894][52060] Updated weights for policy 0, policy_version 46730 (0.0009) [2023-10-08 01:47:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 96305152. Throughput: 0: 1709.6, 1: 1730.4. Samples: 24091412. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:47:46,211][50642] Avg episode reward: [(0, '20.260'), (1, '22.690')] [2023-10-08 01:47:46,259][52060] Updated weights for policy 0, policy_version 46740 (0.0007) [2023-10-08 01:47:46,632][52060] Updated weights for policy 0, policy_version 46750 (0.0007) [2023-10-08 01:47:47,756][52059] Updated weights for policy 1, policy_version 47332 (0.0007) [2023-10-08 01:47:48,123][52059] Updated weights for policy 1, policy_version 47342 (0.0009) [2023-10-08 01:47:48,479][52059] Updated weights for policy 1, policy_version 47352 (0.0008) [2023-10-08 01:47:50,560][52060] Updated weights for policy 0, policy_version 46760 (0.0007) [2023-10-08 01:47:50,934][52060] Updated weights for policy 0, policy_version 46770 (0.0008) [2023-10-08 01:47:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 96370688. Throughput: 0: 1723.3, 1: 1714.7. Samples: 24101248. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:47:51,211][50642] Avg episode reward: [(0, '20.460'), (1, '17.840')] [2023-10-08 01:47:51,299][52060] Updated weights for policy 0, policy_version 46780 (0.0008) [2023-10-08 01:47:52,486][52059] Updated weights for policy 1, policy_version 47362 (0.0008) [2023-10-08 01:47:52,849][52059] Updated weights for policy 1, policy_version 47372 (0.0007) [2023-10-08 01:47:53,225][52059] Updated weights for policy 1, policy_version 47382 (0.0008) [2023-10-08 01:47:53,596][52059] Updated weights for policy 1, policy_version 47392 (0.0010) [2023-10-08 01:47:55,317][52060] Updated weights for policy 0, policy_version 46790 (0.0008) [2023-10-08 01:47:55,701][52060] Updated weights for policy 0, policy_version 46800 (0.0009) [2023-10-08 01:47:56,069][52060] Updated weights for policy 0, policy_version 46810 (0.0008) [2023-10-08 01:47:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 96436224. Throughput: 0: 1719.9, 1: 1715.3. Samples: 24122276. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:47:56,211][50642] Avg episode reward: [(0, '22.120'), (1, '19.830')] [2023-10-08 01:47:57,591][52059] Updated weights for policy 1, policy_version 47402 (0.0010) [2023-10-08 01:47:57,960][52059] Updated weights for policy 1, policy_version 47412 (0.0009) [2023-10-08 01:47:58,325][52059] Updated weights for policy 1, policy_version 47422 (0.0009) [2023-10-08 01:47:59,917][52060] Updated weights for policy 0, policy_version 46820 (0.0008) [2023-10-08 01:48:00,285][52060] Updated weights for policy 0, policy_version 46830 (0.0010) [2023-10-08 01:48:00,658][52060] Updated weights for policy 0, policy_version 46840 (0.0010) [2023-10-08 01:48:01,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 96534528. Throughput: 0: 1693.3, 1: 1744.6. Samples: 24142546. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:48:01,211][50642] Avg episode reward: [(0, '20.950'), (1, '24.640')] [2023-10-08 01:48:01,223][51710] Saving new best policy, reward=24.640! [2023-10-08 01:48:02,228][52059] Updated weights for policy 1, policy_version 47432 (0.0007) [2023-10-08 01:48:02,582][52059] Updated weights for policy 1, policy_version 47442 (0.0007) [2023-10-08 01:48:02,949][52059] Updated weights for policy 1, policy_version 47452 (0.0007) [2023-10-08 01:48:04,591][52060] Updated weights for policy 0, policy_version 46850 (0.0008) [2023-10-08 01:48:04,974][52060] Updated weights for policy 0, policy_version 46860 (0.0007) [2023-10-08 01:48:05,339][52060] Updated weights for policy 0, policy_version 46870 (0.0008) [2023-10-08 01:48:05,708][52060] Updated weights for policy 0, policy_version 46880 (0.0008) [2023-10-08 01:48:06,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 96600064. Throughput: 0: 1719.8, 1: 1708.0. Samples: 24153038. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:48:06,211][50642] Avg episode reward: [(0, '20.150'), (1, '20.200')] [2023-10-08 01:48:06,956][52059] Updated weights for policy 1, policy_version 47462 (0.0007) [2023-10-08 01:48:07,321][52059] Updated weights for policy 1, policy_version 47472 (0.0008) [2023-10-08 01:48:07,686][52059] Updated weights for policy 1, policy_version 47482 (0.0007) [2023-10-08 01:48:09,623][52060] Updated weights for policy 0, policy_version 46890 (0.0009) [2023-10-08 01:48:09,995][52060] Updated weights for policy 0, policy_version 46900 (0.0008) [2023-10-08 01:48:10,357][52060] Updated weights for policy 0, policy_version 46910 (0.0007) [2023-10-08 01:48:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 96665600. Throughput: 0: 1711.0, 1: 1727.4. Samples: 24173732. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:48:11,211][50642] Avg episode reward: [(0, '22.530'), (1, '18.840')] [2023-10-08 01:48:11,607][52059] Updated weights for policy 1, policy_version 47492 (0.0008) [2023-10-08 01:48:11,977][52059] Updated weights for policy 1, policy_version 47502 (0.0007) [2023-10-08 01:48:12,331][52059] Updated weights for policy 1, policy_version 47512 (0.0007) [2023-10-08 01:48:14,445][52060] Updated weights for policy 0, policy_version 46920 (0.0008) [2023-10-08 01:48:14,806][52060] Updated weights for policy 0, policy_version 46930 (0.0009) [2023-10-08 01:48:15,170][52060] Updated weights for policy 0, policy_version 46940 (0.0007) [2023-10-08 01:48:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 96731136. Throughput: 0: 1694.2, 1: 1737.6. Samples: 24194252. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) [2023-10-08 01:48:16,211][50642] Avg episode reward: [(0, '22.090'), (1, '22.490')] [2023-10-08 01:48:16,284][52059] Updated weights for policy 1, policy_version 47522 (0.0007) [2023-10-08 01:48:16,649][52059] Updated weights for policy 1, policy_version 47532 (0.0010) [2023-10-08 01:48:17,014][52059] Updated weights for policy 1, policy_version 47542 (0.0009) [2023-10-08 01:48:17,374][52059] Updated weights for policy 1, policy_version 47552 (0.0009) [2023-10-08 01:48:19,234][52060] Updated weights for policy 0, policy_version 46950 (0.0009) [2023-10-08 01:48:19,599][52060] Updated weights for policy 0, policy_version 46960 (0.0007) [2023-10-08 01:48:19,964][52060] Updated weights for policy 0, policy_version 46970 (0.0008) [2023-10-08 01:48:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 96796672. Throughput: 0: 1728.2, 1: 1717.0. Samples: 24205036. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:21,211][50642] Avg episode reward: [(0, '20.240'), (1, '23.220')] [2023-10-08 01:48:21,283][52059] Updated weights for policy 1, policy_version 47562 (0.0008) [2023-10-08 01:48:21,646][52059] Updated weights for policy 1, policy_version 47572 (0.0010) [2023-10-08 01:48:22,011][52059] Updated weights for policy 1, policy_version 47582 (0.0010) [2023-10-08 01:48:23,843][52060] Updated weights for policy 0, policy_version 46980 (0.0010) [2023-10-08 01:48:24,208][52060] Updated weights for policy 0, policy_version 46990 (0.0008) [2023-10-08 01:48:24,577][52060] Updated weights for policy 0, policy_version 47000 (0.0009) [2023-10-08 01:48:25,775][52059] Updated weights for policy 1, policy_version 47592 (0.0008) [2023-10-08 01:48:26,146][52059] Updated weights for policy 1, policy_version 47602 (0.0011) [2023-10-08 01:48:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 96862208. Throughput: 0: 1705.3, 1: 1744.0. Samples: 24225640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:26,211][50642] Avg episode reward: [(0, '20.420'), (1, '19.420')] [2023-10-08 01:48:26,511][52059] Updated weights for policy 1, policy_version 47612 (0.0008) [2023-10-08 01:48:28,338][52060] Updated weights for policy 0, policy_version 47010 (0.0008) [2023-10-08 01:48:28,711][52060] Updated weights for policy 0, policy_version 47020 (0.0009) [2023-10-08 01:48:29,070][52060] Updated weights for policy 0, policy_version 47030 (0.0009) [2023-10-08 01:48:29,441][52060] Updated weights for policy 0, policy_version 47040 (0.0008) [2023-10-08 01:48:30,355][52059] Updated weights for policy 1, policy_version 47622 (0.0009) [2023-10-08 01:48:30,724][52059] Updated weights for policy 1, policy_version 47632 (0.0009) [2023-10-08 01:48:31,086][52059] Updated weights for policy 1, policy_version 47642 (0.0007) [2023-10-08 01:48:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 96927744. Throughput: 0: 1709.5, 1: 1731.3. Samples: 24246246. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:31,211][50642] Avg episode reward: [(0, '20.980'), (1, '21.960')] [2023-10-08 01:48:33,622][52060] Updated weights for policy 0, policy_version 47050 (0.0009) [2023-10-08 01:48:33,986][52060] Updated weights for policy 0, policy_version 47060 (0.0008) [2023-10-08 01:48:34,357][52060] Updated weights for policy 0, policy_version 47070 (0.0009) [2023-10-08 01:48:35,057][52059] Updated weights for policy 1, policy_version 47652 (0.0007) [2023-10-08 01:48:35,433][52059] Updated weights for policy 1, policy_version 47662 (0.0009) [2023-10-08 01:48:35,795][52059] Updated weights for policy 1, policy_version 47672 (0.0010) [2023-10-08 01:48:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 97026048. Throughput: 0: 1710.3, 1: 1746.2. Samples: 24256792. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:36,211][50642] Avg episode reward: [(0, '20.260'), (1, '22.870')] [2023-10-08 01:48:38,466][52060] Updated weights for policy 0, policy_version 47080 (0.0010) [2023-10-08 01:48:38,836][52060] Updated weights for policy 0, policy_version 47090 (0.0008) [2023-10-08 01:48:39,205][52060] Updated weights for policy 0, policy_version 47100 (0.0007) [2023-10-08 01:48:39,645][52059] Updated weights for policy 1, policy_version 47682 (0.0009) [2023-10-08 01:48:40,013][52059] Updated weights for policy 1, policy_version 47692 (0.0009) [2023-10-08 01:48:40,377][52059] Updated weights for policy 1, policy_version 47702 (0.0007) [2023-10-08 01:48:40,732][52059] Updated weights for policy 1, policy_version 47712 (0.0007) [2023-10-08 01:48:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 97091584. Throughput: 0: 1691.8, 1: 1750.3. Samples: 24277168. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:41,211][50642] Avg episode reward: [(0, '20.230'), (1, '19.240')] [2023-10-08 01:48:43,198][52060] Updated weights for policy 0, policy_version 47110 (0.0007) [2023-10-08 01:48:43,581][52060] Updated weights for policy 0, policy_version 47120 (0.0008) [2023-10-08 01:48:43,949][52060] Updated weights for policy 0, policy_version 47130 (0.0009) [2023-10-08 01:48:44,636][52059] Updated weights for policy 1, policy_version 47722 (0.0009) [2023-10-08 01:48:45,005][52059] Updated weights for policy 1, policy_version 47732 (0.0009) [2023-10-08 01:48:45,369][52059] Updated weights for policy 1, policy_version 47742 (0.0009) [2023-10-08 01:48:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 97157120. Throughput: 0: 1715.7, 1: 1725.8. Samples: 24297414. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:46,211][50642] Avg episode reward: [(0, '20.250'), (1, '21.720')] [2023-10-08 01:48:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000047744_48889856.pth... [2023-10-08 01:48:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000047136_48267264.pth... [2023-10-08 01:48:46,253][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000046112_47218688.pth [2023-10-08 01:48:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000045536_46628864.pth [2023-10-08 01:48:47,904][52060] Updated weights for policy 0, policy_version 47140 (0.0010) [2023-10-08 01:48:48,272][52060] Updated weights for policy 0, policy_version 47150 (0.0007) [2023-10-08 01:48:48,635][52060] Updated weights for policy 0, policy_version 47160 (0.0009) [2023-10-08 01:48:49,330][52059] Updated weights for policy 1, policy_version 47752 (0.0011) [2023-10-08 01:48:49,706][52059] Updated weights for policy 1, policy_version 47762 (0.0009) [2023-10-08 01:48:50,068][52059] Updated weights for policy 1, policy_version 47772 (0.0007) [2023-10-08 01:48:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 97222656. Throughput: 0: 1691.7, 1: 1753.9. Samples: 24308092. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:51,211][50642] Avg episode reward: [(0, '21.140'), (1, '21.660')] [2023-10-08 01:48:52,733][52060] Updated weights for policy 0, policy_version 47170 (0.0008) [2023-10-08 01:48:53,098][52060] Updated weights for policy 0, policy_version 47180 (0.0007) [2023-10-08 01:48:53,461][52060] Updated weights for policy 0, policy_version 47190 (0.0009) [2023-10-08 01:48:53,838][52060] Updated weights for policy 0, policy_version 47200 (0.0010) [2023-10-08 01:48:54,089][52059] Updated weights for policy 1, policy_version 47782 (0.0008) [2023-10-08 01:48:54,458][52059] Updated weights for policy 1, policy_version 47792 (0.0011) [2023-10-08 01:48:54,808][52059] Updated weights for policy 1, policy_version 47802 (0.0010) [2023-10-08 01:48:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13662.6). Total num frames: 97288192. Throughput: 0: 1696.7, 1: 1728.9. Samples: 24327886. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:48:56,211][50642] Avg episode reward: [(0, '20.800'), (1, '20.050')] [2023-10-08 01:48:57,804][52060] Updated weights for policy 0, policy_version 47210 (0.0010) [2023-10-08 01:48:58,174][52060] Updated weights for policy 0, policy_version 47220 (0.0008) [2023-10-08 01:48:58,551][52060] Updated weights for policy 0, policy_version 47230 (0.0008) [2023-10-08 01:48:58,796][52059] Updated weights for policy 1, policy_version 47812 (0.0009) [2023-10-08 01:48:59,162][52059] Updated weights for policy 1, policy_version 47822 (0.0009) [2023-10-08 01:48:59,520][52059] Updated weights for policy 1, policy_version 47832 (0.0009) [2023-10-08 01:49:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 97353728. Throughput: 0: 1718.8, 1: 1724.7. Samples: 24349212. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:49:01,211][50642] Avg episode reward: [(0, '20.620'), (1, '20.240')] [2023-10-08 01:49:02,422][52060] Updated weights for policy 0, policy_version 47240 (0.0008) [2023-10-08 01:49:02,792][52060] Updated weights for policy 0, policy_version 47250 (0.0009) [2023-10-08 01:49:03,169][52060] Updated weights for policy 0, policy_version 47260 (0.0008) [2023-10-08 01:49:03,418][52059] Updated weights for policy 1, policy_version 47842 (0.0009) [2023-10-08 01:49:03,788][52059] Updated weights for policy 1, policy_version 47852 (0.0009) [2023-10-08 01:49:04,153][52059] Updated weights for policy 1, policy_version 47862 (0.0008) [2023-10-08 01:49:04,524][52059] Updated weights for policy 1, policy_version 47872 (0.0009) [2023-10-08 01:49:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 97419264. Throughput: 0: 1687.9, 1: 1744.7. Samples: 24359504. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:49:06,211][50642] Avg episode reward: [(0, '20.310'), (1, '21.910')] [2023-10-08 01:49:06,978][52060] Updated weights for policy 0, policy_version 47270 (0.0008) [2023-10-08 01:49:07,341][52060] Updated weights for policy 0, policy_version 47280 (0.0008) [2023-10-08 01:49:07,722][52060] Updated weights for policy 0, policy_version 47290 (0.0009) [2023-10-08 01:49:08,301][52059] Updated weights for policy 1, policy_version 47882 (0.0010) [2023-10-08 01:49:08,664][52059] Updated weights for policy 1, policy_version 47892 (0.0012) [2023-10-08 01:49:09,030][52059] Updated weights for policy 1, policy_version 47902 (0.0008) [2023-10-08 01:49:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 97484800. Throughput: 0: 1713.7, 1: 1724.6. Samples: 24380366. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:49:11,211][50642] Avg episode reward: [(0, '20.950'), (1, '20.930')] [2023-10-08 01:49:11,694][52060] Updated weights for policy 0, policy_version 47300 (0.0009) [2023-10-08 01:49:12,064][52060] Updated weights for policy 0, policy_version 47310 (0.0008) [2023-10-08 01:49:12,426][52060] Updated weights for policy 0, policy_version 47320 (0.0009) [2023-10-08 01:49:12,925][52059] Updated weights for policy 1, policy_version 47912 (0.0009) [2023-10-08 01:49:13,299][52059] Updated weights for policy 1, policy_version 47922 (0.0008) [2023-10-08 01:49:13,666][52059] Updated weights for policy 1, policy_version 47932 (0.0007) [2023-10-08 01:49:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 97550336. Throughput: 0: 1720.5, 1: 1736.3. Samples: 24401802. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:49:16,211][50642] Avg episode reward: [(0, '20.250'), (1, '19.400')] [2023-10-08 01:49:16,283][52060] Updated weights for policy 0, policy_version 47330 (0.0008) [2023-10-08 01:49:16,647][52060] Updated weights for policy 0, policy_version 47340 (0.0010) [2023-10-08 01:49:17,017][52060] Updated weights for policy 0, policy_version 47350 (0.0010) [2023-10-08 01:49:17,370][52060] Updated weights for policy 0, policy_version 47360 (0.0010) [2023-10-08 01:49:17,581][52059] Updated weights for policy 1, policy_version 47942 (0.0008) [2023-10-08 01:49:17,945][52059] Updated weights for policy 1, policy_version 47952 (0.0009) [2023-10-08 01:49:18,307][52059] Updated weights for policy 1, policy_version 47962 (0.0009) [2023-10-08 01:49:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 97615872. Throughput: 0: 1708.0, 1: 1723.3. Samples: 24411204. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:49:21,211][50642] Avg episode reward: [(0, '20.840'), (1, '21.630')] [2023-10-08 01:49:21,393][52060] Updated weights for policy 0, policy_version 47370 (0.0007) [2023-10-08 01:49:21,759][52060] Updated weights for policy 0, policy_version 47380 (0.0010) [2023-10-08 01:49:22,131][52060] Updated weights for policy 0, policy_version 47390 (0.0008) [2023-10-08 01:49:22,317][52059] Updated weights for policy 1, policy_version 47972 (0.0008) [2023-10-08 01:49:22,693][52059] Updated weights for policy 1, policy_version 47982 (0.0007) [2023-10-08 01:49:23,049][52059] Updated weights for policy 1, policy_version 47992 (0.0007) [2023-10-08 01:49:26,074][52060] Updated weights for policy 0, policy_version 47400 (0.0008) [2023-10-08 01:49:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 97681408. Throughput: 0: 1726.1, 1: 1725.7. Samples: 24432500. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:49:26,211][50642] Avg episode reward: [(0, '21.800'), (1, '21.970')] [2023-10-08 01:49:26,449][52060] Updated weights for policy 0, policy_version 47410 (0.0008) [2023-10-08 01:49:26,821][52060] Updated weights for policy 0, policy_version 47420 (0.0008) [2023-10-08 01:49:26,975][52059] Updated weights for policy 1, policy_version 48002 (0.0009) [2023-10-08 01:49:27,343][52059] Updated weights for policy 1, policy_version 48012 (0.0009) [2023-10-08 01:49:27,717][52059] Updated weights for policy 1, policy_version 48022 (0.0010) [2023-10-08 01:49:28,082][52059] Updated weights for policy 1, policy_version 48032 (0.0007) [2023-10-08 01:49:30,930][52060] Updated weights for policy 0, policy_version 47430 (0.0008) [2023-10-08 01:49:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 97746944. Throughput: 0: 1724.5, 1: 1749.9. Samples: 24453764. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:49:31,211][50642] Avg episode reward: [(0, '21.680'), (1, '21.210')] [2023-10-08 01:49:31,313][52060] Updated weights for policy 0, policy_version 47440 (0.0009) [2023-10-08 01:49:31,680][52060] Updated weights for policy 0, policy_version 47450 (0.0007) [2023-10-08 01:49:31,898][52059] Updated weights for policy 1, policy_version 48042 (0.0008) [2023-10-08 01:49:32,266][52059] Updated weights for policy 1, policy_version 48052 (0.0008) [2023-10-08 01:49:32,625][52059] Updated weights for policy 1, policy_version 48062 (0.0008) [2023-10-08 01:49:35,718][52060] Updated weights for policy 0, policy_version 47460 (0.0008) [2023-10-08 01:49:36,088][52060] Updated weights for policy 0, policy_version 47470 (0.0008) [2023-10-08 01:49:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 97812480. Throughput: 0: 1724.8, 1: 1723.6. Samples: 24463268. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:49:36,211][50642] Avg episode reward: [(0, '21.290'), (1, '21.140')] [2023-10-08 01:49:36,443][52060] Updated weights for policy 0, policy_version 47480 (0.0008) [2023-10-08 01:49:36,556][52059] Updated weights for policy 1, policy_version 48072 (0.0009) [2023-10-08 01:49:36,926][52059] Updated weights for policy 1, policy_version 48082 (0.0009) [2023-10-08 01:49:37,298][52059] Updated weights for policy 1, policy_version 48092 (0.0008) [2023-10-08 01:49:40,423][52060] Updated weights for policy 0, policy_version 47490 (0.0008) [2023-10-08 01:49:40,802][52060] Updated weights for policy 0, policy_version 47500 (0.0008) [2023-10-08 01:49:41,165][52060] Updated weights for policy 0, policy_version 47510 (0.0008) [2023-10-08 01:49:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 97878016. Throughput: 0: 1727.2, 1: 1749.9. Samples: 24484352. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:49:41,211][50642] Avg episode reward: [(0, '21.380'), (1, '22.000')] [2023-10-08 01:49:41,299][52059] Updated weights for policy 1, policy_version 48102 (0.0009) [2023-10-08 01:49:41,530][52060] Updated weights for policy 0, policy_version 47520 (0.0009) [2023-10-08 01:49:41,673][52059] Updated weights for policy 1, policy_version 48112 (0.0009) [2023-10-08 01:49:42,031][52059] Updated weights for policy 1, policy_version 48122 (0.0010) [2023-10-08 01:49:45,375][52060] Updated weights for policy 0, policy_version 47530 (0.0008) [2023-10-08 01:49:45,744][52060] Updated weights for policy 0, policy_version 47540 (0.0008) [2023-10-08 01:49:45,939][52059] Updated weights for policy 1, policy_version 48132 (0.0009) [2023-10-08 01:49:46,105][52060] Updated weights for policy 0, policy_version 47550 (0.0008) [2023-10-08 01:49:46,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 97976320. Throughput: 0: 1703.3, 1: 1758.2. Samples: 24504978. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:49:46,211][50642] Avg episode reward: [(0, '20.800'), (1, '21.330')] [2023-10-08 01:49:46,298][52059] Updated weights for policy 1, policy_version 48142 (0.0009) [2023-10-08 01:49:46,670][52059] Updated weights for policy 1, policy_version 48152 (0.0009) [2023-10-08 01:49:50,065][52060] Updated weights for policy 0, policy_version 47560 (0.0008) [2023-10-08 01:49:50,431][52060] Updated weights for policy 0, policy_version 47570 (0.0008) [2023-10-08 01:49:50,540][52059] Updated weights for policy 1, policy_version 48162 (0.0008) [2023-10-08 01:49:50,803][52060] Updated weights for policy 0, policy_version 47580 (0.0008) [2023-10-08 01:49:50,899][52059] Updated weights for policy 1, policy_version 48172 (0.0008) [2023-10-08 01:49:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 98041856. Throughput: 0: 1721.3, 1: 1738.4. Samples: 24515194. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:49:51,211][50642] Avg episode reward: [(0, '21.560'), (1, '21.460')] [2023-10-08 01:49:51,261][52059] Updated weights for policy 1, policy_version 48182 (0.0008) [2023-10-08 01:49:51,630][52059] Updated weights for policy 1, policy_version 48192 (0.0008) [2023-10-08 01:49:54,880][52060] Updated weights for policy 0, policy_version 47590 (0.0008) [2023-10-08 01:49:55,250][52060] Updated weights for policy 0, policy_version 47600 (0.0007) [2023-10-08 01:49:55,551][52059] Updated weights for policy 1, policy_version 48202 (0.0008) [2023-10-08 01:49:55,616][52060] Updated weights for policy 0, policy_version 47610 (0.0008) [2023-10-08 01:49:55,925][52059] Updated weights for policy 1, policy_version 48212 (0.0010) [2023-10-08 01:49:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 98107392. Throughput: 0: 1713.7, 1: 1752.4. Samples: 24536344. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:49:56,211][50642] Avg episode reward: [(0, '19.600'), (1, '23.190')] [2023-10-08 01:49:56,278][52059] Updated weights for policy 1, policy_version 48222 (0.0009) [2023-10-08 01:49:59,669][52060] Updated weights for policy 0, policy_version 47620 (0.0007) [2023-10-08 01:50:00,035][52060] Updated weights for policy 0, policy_version 47630 (0.0007) [2023-10-08 01:50:00,138][52059] Updated weights for policy 1, policy_version 48232 (0.0007) [2023-10-08 01:50:00,402][52060] Updated weights for policy 0, policy_version 47640 (0.0007) [2023-10-08 01:50:00,496][52059] Updated weights for policy 1, policy_version 48242 (0.0007) [2023-10-08 01:50:00,856][52059] Updated weights for policy 1, policy_version 48252 (0.0008) [2023-10-08 01:50:01,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 98205696. Throughput: 0: 1681.2, 1: 1729.9. Samples: 24555304. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:01,211][50642] Avg episode reward: [(0, '19.370'), (1, '22.380')] [2023-10-08 01:50:04,377][52060] Updated weights for policy 0, policy_version 47650 (0.0009) [2023-10-08 01:50:04,686][52059] Updated weights for policy 1, policy_version 48262 (0.0008) [2023-10-08 01:50:04,760][52060] Updated weights for policy 0, policy_version 47660 (0.0007) [2023-10-08 01:50:05,054][52059] Updated weights for policy 1, policy_version 48272 (0.0008) [2023-10-08 01:50:05,130][52060] Updated weights for policy 0, policy_version 47670 (0.0009) [2023-10-08 01:50:05,415][52059] Updated weights for policy 1, policy_version 48282 (0.0008) [2023-10-08 01:50:05,495][52060] Updated weights for policy 0, policy_version 47680 (0.0007) [2023-10-08 01:50:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 98271232. Throughput: 0: 1713.9, 1: 1752.3. Samples: 24567180. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:06,211][50642] Avg episode reward: [(0, '21.820'), (1, '21.500')] [2023-10-08 01:50:09,427][52059] Updated weights for policy 1, policy_version 48292 (0.0009) [2023-10-08 01:50:09,539][52060] Updated weights for policy 0, policy_version 47690 (0.0007) [2023-10-08 01:50:09,789][52059] Updated weights for policy 1, policy_version 48302 (0.0009) [2023-10-08 01:50:09,898][52060] Updated weights for policy 0, policy_version 47700 (0.0007) [2023-10-08 01:50:10,155][52059] Updated weights for policy 1, policy_version 48312 (0.0010) [2023-10-08 01:50:10,268][52060] Updated weights for policy 0, policy_version 47710 (0.0008) [2023-10-08 01:50:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 98336768. Throughput: 0: 1700.2, 1: 1739.4. Samples: 24587282. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:11,211][50642] Avg episode reward: [(0, '20.670'), (1, '22.170')] [2023-10-08 01:50:14,185][52060] Updated weights for policy 0, policy_version 47720 (0.0008) [2023-10-08 01:50:14,197][52059] Updated weights for policy 1, policy_version 48322 (0.0009) [2023-10-08 01:50:14,562][52060] Updated weights for policy 0, policy_version 47730 (0.0009) [2023-10-08 01:50:14,564][52059] Updated weights for policy 1, policy_version 48332 (0.0008) [2023-10-08 01:50:14,923][52060] Updated weights for policy 0, policy_version 47740 (0.0009) [2023-10-08 01:50:14,926][52059] Updated weights for policy 1, policy_version 48342 (0.0007) [2023-10-08 01:50:15,282][52059] Updated weights for policy 1, policy_version 48352 (0.0008) [2023-10-08 01:50:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 98402304. Throughput: 0: 1690.9, 1: 1717.8. Samples: 24607156. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:16,211][50642] Avg episode reward: [(0, '19.630'), (1, '22.160')] [2023-10-08 01:50:19,131][52060] Updated weights for policy 0, policy_version 47750 (0.0007) [2023-10-08 01:50:19,298][52059] Updated weights for policy 1, policy_version 48362 (0.0010) [2023-10-08 01:50:19,513][52060] Updated weights for policy 0, policy_version 47760 (0.0009) [2023-10-08 01:50:19,655][52059] Updated weights for policy 1, policy_version 48372 (0.0008) [2023-10-08 01:50:19,884][52060] Updated weights for policy 0, policy_version 47770 (0.0007) [2023-10-08 01:50:20,023][52059] Updated weights for policy 1, policy_version 48382 (0.0007) [2023-10-08 01:50:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 98467840. Throughput: 0: 1714.0, 1: 1746.8. Samples: 24619002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:21,211][50642] Avg episode reward: [(0, '21.240'), (1, '22.900')] [2023-10-08 01:50:23,942][52059] Updated weights for policy 1, policy_version 48392 (0.0007) [2023-10-08 01:50:23,992][52060] Updated weights for policy 0, policy_version 47780 (0.0007) [2023-10-08 01:50:24,305][52059] Updated weights for policy 1, policy_version 48402 (0.0007) [2023-10-08 01:50:24,366][52060] Updated weights for policy 0, policy_version 47790 (0.0008) [2023-10-08 01:50:24,673][52059] Updated weights for policy 1, policy_version 48412 (0.0007) [2023-10-08 01:50:24,729][52060] Updated weights for policy 0, policy_version 47800 (0.0008) [2023-10-08 01:50:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 98533376. Throughput: 0: 1687.7, 1: 1718.1. Samples: 24637614. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:26,211][50642] Avg episode reward: [(0, '21.600'), (1, '21.330')] [2023-10-08 01:50:28,520][52059] Updated weights for policy 1, policy_version 48422 (0.0009) [2023-10-08 01:50:28,640][52060] Updated weights for policy 0, policy_version 47810 (0.0007) [2023-10-08 01:50:28,882][52059] Updated weights for policy 1, policy_version 48432 (0.0008) [2023-10-08 01:50:29,015][52060] Updated weights for policy 0, policy_version 47820 (0.0007) [2023-10-08 01:50:29,247][52059] Updated weights for policy 1, policy_version 48442 (0.0007) [2023-10-08 01:50:29,381][52060] Updated weights for policy 0, policy_version 47830 (0.0007) [2023-10-08 01:50:29,748][52060] Updated weights for policy 0, policy_version 47840 (0.0007) [2023-10-08 01:50:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 98598912. Throughput: 0: 1700.4, 1: 1717.5. Samples: 24658780. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:31,211][50642] Avg episode reward: [(0, '19.700'), (1, '23.890')] [2023-10-08 01:50:33,214][52059] Updated weights for policy 1, policy_version 48452 (0.0008) [2023-10-08 01:50:33,579][52059] Updated weights for policy 1, policy_version 48462 (0.0008) [2023-10-08 01:50:33,804][52060] Updated weights for policy 0, policy_version 47850 (0.0007) [2023-10-08 01:50:33,944][52059] Updated weights for policy 1, policy_version 48472 (0.0009) [2023-10-08 01:50:34,168][52060] Updated weights for policy 0, policy_version 47860 (0.0009) [2023-10-08 01:50:34,536][52060] Updated weights for policy 0, policy_version 47870 (0.0011) [2023-10-08 01:50:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 98664448. Throughput: 0: 1700.1, 1: 1726.8. Samples: 24669406. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:36,211][50642] Avg episode reward: [(0, '21.660'), (1, '22.450')] [2023-10-08 01:50:37,774][52059] Updated weights for policy 1, policy_version 48482 (0.0007) [2023-10-08 01:50:38,127][52059] Updated weights for policy 1, policy_version 48492 (0.0007) [2023-10-08 01:50:38,496][52059] Updated weights for policy 1, policy_version 48502 (0.0007) [2023-10-08 01:50:38,581][52060] Updated weights for policy 0, policy_version 47880 (0.0008) [2023-10-08 01:50:38,859][52059] Updated weights for policy 1, policy_version 48512 (0.0007) [2023-10-08 01:50:38,955][52060] Updated weights for policy 0, policy_version 47890 (0.0009) [2023-10-08 01:50:39,316][52060] Updated weights for policy 0, policy_version 47900 (0.0008) [2023-10-08 01:50:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 98729984. Throughput: 0: 1678.3, 1: 1718.4. Samples: 24689194. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:41,211][50642] Avg episode reward: [(0, '22.200'), (1, '21.770')] [2023-10-08 01:50:42,829][52059] Updated weights for policy 1, policy_version 48522 (0.0010) [2023-10-08 01:50:43,192][52059] Updated weights for policy 1, policy_version 48532 (0.0009) [2023-10-08 01:50:43,202][52060] Updated weights for policy 0, policy_version 47910 (0.0007) [2023-10-08 01:50:43,553][52059] Updated weights for policy 1, policy_version 48542 (0.0009) [2023-10-08 01:50:43,564][52060] Updated weights for policy 0, policy_version 47920 (0.0008) [2023-10-08 01:50:43,925][52060] Updated weights for policy 0, policy_version 47930 (0.0008) [2023-10-08 01:50:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 98795520. Throughput: 0: 1707.7, 1: 1735.7. Samples: 24710258. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:46,211][50642] Avg episode reward: [(0, '20.630'), (1, '22.260')] [2023-10-08 01:50:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000047936_49086464.pth... [2023-10-08 01:50:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000048544_49709056.pth... [2023-10-08 01:50:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000046336_47448064.pth [2023-10-08 01:50:46,254][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000047936_49086464.pth [2023-10-08 01:50:46,262][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000046944_48070656.pth [2023-10-08 01:50:46,268][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000048544_49709056.pth [2023-10-08 01:50:47,500][52059] Updated weights for policy 1, policy_version 48552 (0.0007) [2023-10-08 01:50:47,858][52059] Updated weights for policy 1, policy_version 48562 (0.0008) [2023-10-08 01:50:47,920][52060] Updated weights for policy 0, policy_version 47940 (0.0009) [2023-10-08 01:50:48,229][52059] Updated weights for policy 1, policy_version 48572 (0.0010) [2023-10-08 01:50:48,285][52060] Updated weights for policy 0, policy_version 47950 (0.0010) [2023-10-08 01:50:48,656][52060] Updated weights for policy 0, policy_version 47960 (0.0008) [2023-10-08 01:50:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 98861056. Throughput: 0: 1679.6, 1: 1711.6. Samples: 24719788. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:51,211][50642] Avg episode reward: [(0, '19.370'), (1, '21.360')] [2023-10-08 01:50:52,248][52059] Updated weights for policy 1, policy_version 48582 (0.0010) [2023-10-08 01:50:52,607][52059] Updated weights for policy 1, policy_version 48592 (0.0007) [2023-10-08 01:50:52,720][52060] Updated weights for policy 0, policy_version 47970 (0.0007) [2023-10-08 01:50:52,964][52059] Updated weights for policy 1, policy_version 48602 (0.0007) [2023-10-08 01:50:53,089][52060] Updated weights for policy 0, policy_version 47980 (0.0009) [2023-10-08 01:50:53,461][52060] Updated weights for policy 0, policy_version 47990 (0.0009) [2023-10-08 01:50:53,834][52060] Updated weights for policy 0, policy_version 48000 (0.0010) [2023-10-08 01:50:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 98926592. Throughput: 0: 1688.3, 1: 1722.8. Samples: 24740786. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:50:56,211][50642] Avg episode reward: [(0, '20.150'), (1, '22.770')] [2023-10-08 01:50:56,854][52059] Updated weights for policy 1, policy_version 48612 (0.0007) [2023-10-08 01:50:57,211][52059] Updated weights for policy 1, policy_version 48622 (0.0007) [2023-10-08 01:50:57,577][52059] Updated weights for policy 1, policy_version 48632 (0.0007) [2023-10-08 01:50:57,793][52060] Updated weights for policy 0, policy_version 48010 (0.0009) [2023-10-08 01:50:58,164][52060] Updated weights for policy 0, policy_version 48020 (0.0007) [2023-10-08 01:50:58,533][52060] Updated weights for policy 0, policy_version 48030 (0.0007) [2023-10-08 01:51:01,211][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 98992128. Throughput: 0: 1699.1, 1: 1735.8. Samples: 24761726. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:51:01,212][50642] Avg episode reward: [(0, '20.760'), (1, '23.120')] [2023-10-08 01:51:01,633][52059] Updated weights for policy 1, policy_version 48642 (0.0009) [2023-10-08 01:51:02,001][52059] Updated weights for policy 1, policy_version 48652 (0.0009) [2023-10-08 01:51:02,376][52059] Updated weights for policy 1, policy_version 48662 (0.0008) [2023-10-08 01:51:02,505][52060] Updated weights for policy 0, policy_version 48040 (0.0009) [2023-10-08 01:51:02,741][52059] Updated weights for policy 1, policy_version 48672 (0.0008) [2023-10-08 01:51:02,862][52060] Updated weights for policy 0, policy_version 48050 (0.0009) [2023-10-08 01:51:03,234][52060] Updated weights for policy 0, policy_version 48060 (0.0009) [2023-10-08 01:51:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 99057664. Throughput: 0: 1672.4, 1: 1709.3. Samples: 24771180. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:51:06,211][50642] Avg episode reward: [(0, '19.170'), (1, '21.200')] [2023-10-08 01:51:06,631][52059] Updated weights for policy 1, policy_version 48682 (0.0007) [2023-10-08 01:51:06,992][52059] Updated weights for policy 1, policy_version 48692 (0.0009) [2023-10-08 01:51:07,363][52059] Updated weights for policy 1, policy_version 48702 (0.0007) [2023-10-08 01:51:07,489][52060] Updated weights for policy 0, policy_version 48070 (0.0008) [2023-10-08 01:51:07,875][52060] Updated weights for policy 0, policy_version 48080 (0.0011) [2023-10-08 01:51:08,243][52060] Updated weights for policy 0, policy_version 48090 (0.0008) [2023-10-08 01:51:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 99123200. Throughput: 0: 1696.0, 1: 1743.7. Samples: 24792402. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:11,211][50642] Avg episode reward: [(0, '19.900'), (1, '20.470')] [2023-10-08 01:51:11,231][52059] Updated weights for policy 1, policy_version 48712 (0.0009) [2023-10-08 01:51:11,590][52059] Updated weights for policy 1, policy_version 48722 (0.0009) [2023-10-08 01:51:11,951][52059] Updated weights for policy 1, policy_version 48732 (0.0007) [2023-10-08 01:51:12,158][52060] Updated weights for policy 0, policy_version 48100 (0.0008) [2023-10-08 01:51:12,533][52060] Updated weights for policy 0, policy_version 48110 (0.0009) [2023-10-08 01:51:12,898][52060] Updated weights for policy 0, policy_version 48120 (0.0009) [2023-10-08 01:51:15,662][52059] Updated weights for policy 1, policy_version 48742 (0.0008) [2023-10-08 01:51:16,043][52059] Updated weights for policy 1, policy_version 48752 (0.0008) [2023-10-08 01:51:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 99188736. Throughput: 0: 1703.3, 1: 1734.9. Samples: 24813502. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:16,211][50642] Avg episode reward: [(0, '20.960'), (1, '22.460')] [2023-10-08 01:51:16,404][52059] Updated weights for policy 1, policy_version 48762 (0.0007) [2023-10-08 01:51:17,016][52060] Updated weights for policy 0, policy_version 48130 (0.0008) [2023-10-08 01:51:17,382][52060] Updated weights for policy 0, policy_version 48140 (0.0010) [2023-10-08 01:51:17,751][52060] Updated weights for policy 0, policy_version 48150 (0.0009) [2023-10-08 01:51:18,119][52060] Updated weights for policy 0, policy_version 48160 (0.0009) [2023-10-08 01:51:20,336][52059] Updated weights for policy 1, policy_version 48772 (0.0008) [2023-10-08 01:51:20,699][52059] Updated weights for policy 1, policy_version 48782 (0.0008) [2023-10-08 01:51:21,072][52059] Updated weights for policy 1, policy_version 48792 (0.0009) [2023-10-08 01:51:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 99254272. Throughput: 0: 1685.9, 1: 1737.3. Samples: 24823450. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:21,211][50642] Avg episode reward: [(0, '19.630'), (1, '19.340')] [2023-10-08 01:51:22,125][52060] Updated weights for policy 0, policy_version 48170 (0.0008) [2023-10-08 01:51:22,495][52060] Updated weights for policy 0, policy_version 48180 (0.0008) [2023-10-08 01:51:22,860][52060] Updated weights for policy 0, policy_version 48190 (0.0007) [2023-10-08 01:51:25,089][52059] Updated weights for policy 1, policy_version 48802 (0.0010) [2023-10-08 01:51:25,455][52059] Updated weights for policy 1, policy_version 48812 (0.0009) [2023-10-08 01:51:25,822][52059] Updated weights for policy 1, policy_version 48822 (0.0008) [2023-10-08 01:51:26,183][52059] Updated weights for policy 1, policy_version 48832 (0.0009) [2023-10-08 01:51:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 99352576. Throughput: 0: 1712.4, 1: 1748.1. Samples: 24844918. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:26,211][50642] Avg episode reward: [(0, '20.940'), (1, '20.270')] [2023-10-08 01:51:26,968][52060] Updated weights for policy 0, policy_version 48200 (0.0009) [2023-10-08 01:51:27,346][52060] Updated weights for policy 0, policy_version 48210 (0.0009) [2023-10-08 01:51:27,722][52060] Updated weights for policy 0, policy_version 48220 (0.0008) [2023-10-08 01:51:30,057][52059] Updated weights for policy 1, policy_version 48842 (0.0008) [2023-10-08 01:51:30,422][52059] Updated weights for policy 1, policy_version 48852 (0.0009) [2023-10-08 01:51:30,790][52059] Updated weights for policy 1, policy_version 48862 (0.0009) [2023-10-08 01:51:31,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 99418112. Throughput: 0: 1713.6, 1: 1730.4. Samples: 24865234. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:31,211][50642] Avg episode reward: [(0, '21.040'), (1, '21.800')] [2023-10-08 01:51:31,593][52060] Updated weights for policy 0, policy_version 48230 (0.0007) [2023-10-08 01:51:31,960][52060] Updated weights for policy 0, policy_version 48240 (0.0008) [2023-10-08 01:51:32,333][52060] Updated weights for policy 0, policy_version 48250 (0.0007) [2023-10-08 01:51:34,638][52059] Updated weights for policy 1, policy_version 48872 (0.0010) [2023-10-08 01:51:34,998][52059] Updated weights for policy 1, policy_version 48882 (0.0007) [2023-10-08 01:51:35,362][52059] Updated weights for policy 1, policy_version 48892 (0.0007) [2023-10-08 01:51:36,081][52060] Updated weights for policy 0, policy_version 48260 (0.0008) [2023-10-08 01:51:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 99483648. Throughput: 0: 1710.0, 1: 1758.8. Samples: 24875880. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:36,211][50642] Avg episode reward: [(0, '19.750'), (1, '22.730')] [2023-10-08 01:51:36,450][52060] Updated weights for policy 0, policy_version 48270 (0.0009) [2023-10-08 01:51:36,819][52060] Updated weights for policy 0, policy_version 48280 (0.0009) [2023-10-08 01:51:39,245][52059] Updated weights for policy 1, policy_version 48902 (0.0009) [2023-10-08 01:51:39,606][52059] Updated weights for policy 1, policy_version 48912 (0.0008) [2023-10-08 01:51:39,977][52059] Updated weights for policy 1, policy_version 48922 (0.0007) [2023-10-08 01:51:40,604][52060] Updated weights for policy 0, policy_version 48290 (0.0009) [2023-10-08 01:51:40,977][52060] Updated weights for policy 0, policy_version 48300 (0.0008) [2023-10-08 01:51:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 99549184. Throughput: 0: 1724.5, 1: 1743.9. Samples: 24896866. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 01:51:41,211][50642] Avg episode reward: [(0, '19.590'), (1, '21.580')] [2023-10-08 01:51:41,352][52060] Updated weights for policy 0, policy_version 48310 (0.0008) [2023-10-08 01:51:41,714][52060] Updated weights for policy 0, policy_version 48320 (0.0010) [2023-10-08 01:51:43,710][52059] Updated weights for policy 1, policy_version 48932 (0.0008) [2023-10-08 01:51:44,076][52059] Updated weights for policy 1, policy_version 48942 (0.0008) [2023-10-08 01:51:44,443][52059] Updated weights for policy 1, policy_version 48952 (0.0009) [2023-10-08 01:51:45,600][52060] Updated weights for policy 0, policy_version 48330 (0.0008) [2023-10-08 01:51:45,972][52060] Updated weights for policy 0, policy_version 48340 (0.0008) [2023-10-08 01:51:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 99614720. Throughput: 0: 1717.2, 1: 1739.5. Samples: 24917278. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:51:46,211][50642] Avg episode reward: [(0, '21.120'), (1, '22.510')] [2023-10-08 01:51:46,330][52060] Updated weights for policy 0, policy_version 48350 (0.0010) [2023-10-08 01:51:48,356][52059] Updated weights for policy 1, policy_version 48962 (0.0007) [2023-10-08 01:51:48,717][52059] Updated weights for policy 1, policy_version 48972 (0.0007) [2023-10-08 01:51:49,081][52059] Updated weights for policy 1, policy_version 48982 (0.0008) [2023-10-08 01:51:49,451][52059] Updated weights for policy 1, policy_version 48992 (0.0007) [2023-10-08 01:51:50,307][52060] Updated weights for policy 0, policy_version 48360 (0.0009) [2023-10-08 01:51:50,679][52060] Updated weights for policy 0, policy_version 48370 (0.0011) [2023-10-08 01:51:51,037][52060] Updated weights for policy 0, policy_version 48380 (0.0008) [2023-10-08 01:51:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 99713024. Throughput: 0: 1733.1, 1: 1752.7. Samples: 24928044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:51:51,211][50642] Avg episode reward: [(0, '19.480'), (1, '23.640')] [2023-10-08 01:51:53,416][52059] Updated weights for policy 1, policy_version 49002 (0.0008) [2023-10-08 01:51:53,775][52059] Updated weights for policy 1, policy_version 49012 (0.0008) [2023-10-08 01:51:54,150][52059] Updated weights for policy 1, policy_version 49022 (0.0008) [2023-10-08 01:51:55,088][52060] Updated weights for policy 0, policy_version 48390 (0.0009) [2023-10-08 01:51:55,459][52060] Updated weights for policy 0, policy_version 48400 (0.0010) [2023-10-08 01:51:55,830][52060] Updated weights for policy 0, policy_version 48410 (0.0009) [2023-10-08 01:51:56,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 99778560. Throughput: 0: 1738.1, 1: 1736.9. Samples: 24948778. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:51:56,211][50642] Avg episode reward: [(0, '19.810'), (1, '20.640')] [2023-10-08 01:51:58,082][52059] Updated weights for policy 1, policy_version 49032 (0.0009) [2023-10-08 01:51:58,441][52059] Updated weights for policy 1, policy_version 49042 (0.0007) [2023-10-08 01:51:58,822][52059] Updated weights for policy 1, policy_version 49052 (0.0008) [2023-10-08 01:51:59,898][52060] Updated weights for policy 0, policy_version 48420 (0.0009) [2023-10-08 01:52:00,268][52060] Updated weights for policy 0, policy_version 48430 (0.0010) [2023-10-08 01:52:00,640][52060] Updated weights for policy 0, policy_version 48440 (0.0010) [2023-10-08 01:52:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.6, 300 sec: 13884.7). Total num frames: 99844096. Throughput: 0: 1706.6, 1: 1746.5. Samples: 24968892. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:52:01,211][50642] Avg episode reward: [(0, '21.520'), (1, '20.120')] [2023-10-08 01:52:02,755][52059] Updated weights for policy 1, policy_version 49062 (0.0007) [2023-10-08 01:52:03,125][52059] Updated weights for policy 1, policy_version 49072 (0.0009) [2023-10-08 01:52:03,490][52059] Updated weights for policy 1, policy_version 49082 (0.0008) [2023-10-08 01:52:04,625][52060] Updated weights for policy 0, policy_version 48450 (0.0009) [2023-10-08 01:52:04,990][52060] Updated weights for policy 0, policy_version 48460 (0.0007) [2023-10-08 01:52:05,363][52060] Updated weights for policy 0, policy_version 48470 (0.0010) [2023-10-08 01:52:05,732][52060] Updated weights for policy 0, policy_version 48480 (0.0010) [2023-10-08 01:52:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 99909632. Throughput: 0: 1734.0, 1: 1730.8. Samples: 24979368. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:52:06,211][50642] Avg episode reward: [(0, '20.220'), (1, '23.960')] [2023-10-08 01:52:07,428][52059] Updated weights for policy 1, policy_version 49092 (0.0010) [2023-10-08 01:52:07,786][52059] Updated weights for policy 1, policy_version 49102 (0.0008) [2023-10-08 01:52:08,160][52059] Updated weights for policy 1, policy_version 49112 (0.0007) [2023-10-08 01:52:09,719][52060] Updated weights for policy 0, policy_version 48490 (0.0008) [2023-10-08 01:52:10,089][52060] Updated weights for policy 0, policy_version 48500 (0.0008) [2023-10-08 01:52:10,452][52060] Updated weights for policy 0, policy_version 48510 (0.0007) [2023-10-08 01:52:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 99975168. Throughput: 0: 1719.7, 1: 1728.4. Samples: 25000086. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:52:11,211][50642] Avg episode reward: [(0, '19.680'), (1, '22.380')] [2023-10-08 01:52:11,978][52059] Updated weights for policy 1, policy_version 49122 (0.0009) [2023-10-08 01:52:12,342][52059] Updated weights for policy 1, policy_version 49132 (0.0009) [2023-10-08 01:52:12,713][52059] Updated weights for policy 1, policy_version 49142 (0.0007) [2023-10-08 01:52:13,073][52059] Updated weights for policy 1, policy_version 49152 (0.0008) [2023-10-08 01:52:14,505][52060] Updated weights for policy 0, policy_version 48520 (0.0009) [2023-10-08 01:52:14,870][52060] Updated weights for policy 0, policy_version 48530 (0.0011) [2023-10-08 01:52:15,244][52060] Updated weights for policy 0, policy_version 48540 (0.0010) [2023-10-08 01:52:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 100040704. Throughput: 0: 1700.9, 1: 1751.4. Samples: 25020586. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:16,211][50642] Avg episode reward: [(0, '21.420'), (1, '22.180')] [2023-10-08 01:52:17,134][52059] Updated weights for policy 1, policy_version 49162 (0.0008) [2023-10-08 01:52:17,496][52059] Updated weights for policy 1, policy_version 49172 (0.0008) [2023-10-08 01:52:17,872][52059] Updated weights for policy 1, policy_version 49182 (0.0008) [2023-10-08 01:52:19,406][52060] Updated weights for policy 0, policy_version 48550 (0.0010) [2023-10-08 01:52:19,770][52060] Updated weights for policy 0, policy_version 48560 (0.0008) [2023-10-08 01:52:20,144][52060] Updated weights for policy 0, policy_version 48570 (0.0011) [2023-10-08 01:52:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 100106240. Throughput: 0: 1733.2, 1: 1721.0. Samples: 25031320. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:21,211][50642] Avg episode reward: [(0, '19.990'), (1, '23.900')] [2023-10-08 01:52:21,731][52059] Updated weights for policy 1, policy_version 49192 (0.0009) [2023-10-08 01:52:22,096][52059] Updated weights for policy 1, policy_version 49202 (0.0007) [2023-10-08 01:52:22,464][52059] Updated weights for policy 1, policy_version 49212 (0.0008) [2023-10-08 01:52:24,039][52060] Updated weights for policy 0, policy_version 48580 (0.0010) [2023-10-08 01:52:24,411][52060] Updated weights for policy 0, policy_version 48590 (0.0009) [2023-10-08 01:52:24,784][52060] Updated weights for policy 0, policy_version 48600 (0.0007) [2023-10-08 01:52:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100171776. Throughput: 0: 1699.4, 1: 1737.6. Samples: 25051530. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:26,211][50642] Avg episode reward: [(0, '19.350'), (1, '23.060')] [2023-10-08 01:52:26,453][52059] Updated weights for policy 1, policy_version 49222 (0.0010) [2023-10-08 01:52:26,821][52059] Updated weights for policy 1, policy_version 49232 (0.0010) [2023-10-08 01:52:27,182][52059] Updated weights for policy 1, policy_version 49242 (0.0010) [2023-10-08 01:52:28,450][52060] Updated weights for policy 0, policy_version 48610 (0.0007) [2023-10-08 01:52:28,823][52060] Updated weights for policy 0, policy_version 48620 (0.0007) [2023-10-08 01:52:29,192][52060] Updated weights for policy 0, policy_version 48630 (0.0007) [2023-10-08 01:52:29,553][52060] Updated weights for policy 0, policy_version 48640 (0.0008) [2023-10-08 01:52:31,057][52059] Updated weights for policy 1, policy_version 49252 (0.0009) [2023-10-08 01:52:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100237312. Throughput: 0: 1707.2, 1: 1749.5. Samples: 25072834. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:31,211][50642] Avg episode reward: [(0, '21.790'), (1, '20.720')] [2023-10-08 01:52:31,426][52059] Updated weights for policy 1, policy_version 49262 (0.0008) [2023-10-08 01:52:31,790][52059] Updated weights for policy 1, policy_version 49272 (0.0011) [2023-10-08 01:52:33,484][52060] Updated weights for policy 0, policy_version 48650 (0.0010) [2023-10-08 01:52:33,850][52060] Updated weights for policy 0, policy_version 48660 (0.0008) [2023-10-08 01:52:34,212][52060] Updated weights for policy 0, policy_version 48670 (0.0008) [2023-10-08 01:52:35,686][52059] Updated weights for policy 1, policy_version 49282 (0.0007) [2023-10-08 01:52:36,058][52059] Updated weights for policy 1, policy_version 49292 (0.0008) [2023-10-08 01:52:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100302848. Throughput: 0: 1704.0, 1: 1735.5. Samples: 25082820. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:36,211][50642] Avg episode reward: [(0, '21.190'), (1, '19.470')] [2023-10-08 01:52:36,430][52059] Updated weights for policy 1, policy_version 49302 (0.0009) [2023-10-08 01:52:36,793][52059] Updated weights for policy 1, policy_version 49312 (0.0009) [2023-10-08 01:52:38,344][52060] Updated weights for policy 0, policy_version 48680 (0.0009) [2023-10-08 01:52:38,717][52060] Updated weights for policy 0, policy_version 48690 (0.0007) [2023-10-08 01:52:39,082][52060] Updated weights for policy 0, policy_version 48700 (0.0010) [2023-10-08 01:52:40,641][52059] Updated weights for policy 1, policy_version 49322 (0.0009) [2023-10-08 01:52:41,002][52059] Updated weights for policy 1, policy_version 49332 (0.0008) [2023-10-08 01:52:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 100368384. Throughput: 0: 1693.0, 1: 1745.3. Samples: 25103502. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:41,211][50642] Avg episode reward: [(0, '19.210'), (1, '22.750')] [2023-10-08 01:52:41,368][52059] Updated weights for policy 1, policy_version 49342 (0.0010) [2023-10-08 01:52:43,071][52060] Updated weights for policy 0, policy_version 48710 (0.0009) [2023-10-08 01:52:43,450][52060] Updated weights for policy 0, policy_version 48720 (0.0008) [2023-10-08 01:52:43,812][52060] Updated weights for policy 0, policy_version 48730 (0.0009) [2023-10-08 01:52:45,409][52059] Updated weights for policy 1, policy_version 49352 (0.0010) [2023-10-08 01:52:45,766][52059] Updated weights for policy 1, policy_version 49362 (0.0010) [2023-10-08 01:52:46,135][52059] Updated weights for policy 1, policy_version 49372 (0.0007) [2023-10-08 01:52:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100433920. Throughput: 0: 1725.1, 1: 1724.0. Samples: 25124100. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 01:52:46,211][50642] Avg episode reward: [(0, '21.460'), (1, '19.970')] [2023-10-08 01:52:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000048736_49905664.pth... [2023-10-08 01:52:46,253][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000047136_48267264.pth [2023-10-08 01:52:46,287][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000049376_50561024.pth... [2023-10-08 01:52:46,325][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000047744_48889856.pth [2023-10-08 01:52:47,739][52060] Updated weights for policy 0, policy_version 48740 (0.0008) [2023-10-08 01:52:48,118][52060] Updated weights for policy 0, policy_version 48750 (0.0008) [2023-10-08 01:52:48,482][52060] Updated weights for policy 0, policy_version 48760 (0.0012) [2023-10-08 01:52:49,812][52059] Updated weights for policy 1, policy_version 49382 (0.0007) [2023-10-08 01:52:50,168][52059] Updated weights for policy 1, policy_version 49392 (0.0007) [2023-10-08 01:52:50,536][52059] Updated weights for policy 1, policy_version 49402 (0.0008) [2023-10-08 01:52:51,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 100532224. Throughput: 0: 1696.9, 1: 1751.8. Samples: 25134562. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:52:51,211][50642] Avg episode reward: [(0, '20.840'), (1, '20.850')] [2023-10-08 01:52:52,448][52060] Updated weights for policy 0, policy_version 48770 (0.0009) [2023-10-08 01:52:52,817][52060] Updated weights for policy 0, policy_version 48780 (0.0009) [2023-10-08 01:52:53,186][52060] Updated weights for policy 0, policy_version 48790 (0.0008) [2023-10-08 01:52:53,553][52060] Updated weights for policy 0, policy_version 48800 (0.0009) [2023-10-08 01:52:54,548][52059] Updated weights for policy 1, policy_version 49412 (0.0008) [2023-10-08 01:52:54,909][52059] Updated weights for policy 1, policy_version 49422 (0.0007) [2023-10-08 01:52:55,268][52059] Updated weights for policy 1, policy_version 49432 (0.0007) [2023-10-08 01:52:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 100597760. Throughput: 0: 1710.9, 1: 1743.1. Samples: 25155516. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:52:56,211][50642] Avg episode reward: [(0, '19.270'), (1, '22.840')] [2023-10-08 01:52:57,380][52060] Updated weights for policy 0, policy_version 48810 (0.0009) [2023-10-08 01:52:57,754][52060] Updated weights for policy 0, policy_version 48820 (0.0008) [2023-10-08 01:52:58,127][52060] Updated weights for policy 0, policy_version 48830 (0.0007) [2023-10-08 01:52:59,364][52059] Updated weights for policy 1, policy_version 49442 (0.0008) [2023-10-08 01:52:59,727][52059] Updated weights for policy 1, policy_version 49452 (0.0007) [2023-10-08 01:53:00,098][52059] Updated weights for policy 1, policy_version 49462 (0.0008) [2023-10-08 01:53:00,459][52059] Updated weights for policy 1, policy_version 49472 (0.0009) [2023-10-08 01:53:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100663296. Throughput: 0: 1736.1, 1: 1721.4. Samples: 25176174. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:53:01,211][50642] Avg episode reward: [(0, '20.210'), (1, '22.560')] [2023-10-08 01:53:02,063][52060] Updated weights for policy 0, policy_version 48840 (0.0007) [2023-10-08 01:53:02,444][52060] Updated weights for policy 0, policy_version 48850 (0.0008) [2023-10-08 01:53:02,814][52060] Updated weights for policy 0, policy_version 48860 (0.0009) [2023-10-08 01:53:04,246][52059] Updated weights for policy 1, policy_version 49482 (0.0010) [2023-10-08 01:53:04,613][52059] Updated weights for policy 1, policy_version 49492 (0.0009) [2023-10-08 01:53:04,980][52059] Updated weights for policy 1, policy_version 49502 (0.0008) [2023-10-08 01:53:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 100728832. Throughput: 0: 1705.2, 1: 1756.4. Samples: 25187088. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:53:06,211][50642] Avg episode reward: [(0, '20.050'), (1, '20.930')] [2023-10-08 01:53:06,835][52060] Updated weights for policy 0, policy_version 48870 (0.0008) [2023-10-08 01:53:07,201][52060] Updated weights for policy 0, policy_version 48880 (0.0008) [2023-10-08 01:53:07,571][52060] Updated weights for policy 0, policy_version 48890 (0.0008) [2023-10-08 01:53:08,717][52059] Updated weights for policy 1, policy_version 49512 (0.0008) [2023-10-08 01:53:09,076][52059] Updated weights for policy 1, policy_version 49522 (0.0008) [2023-10-08 01:53:09,442][52059] Updated weights for policy 1, policy_version 49532 (0.0008) [2023-10-08 01:53:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100794368. Throughput: 0: 1734.3, 1: 1731.7. Samples: 25207500. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:53:11,211][50642] Avg episode reward: [(0, '19.500'), (1, '21.570')] [2023-10-08 01:53:11,506][52060] Updated weights for policy 0, policy_version 48900 (0.0009) [2023-10-08 01:53:11,872][52060] Updated weights for policy 0, policy_version 48910 (0.0009) [2023-10-08 01:53:12,246][52060] Updated weights for policy 0, policy_version 48920 (0.0010) [2023-10-08 01:53:13,297][52059] Updated weights for policy 1, policy_version 49542 (0.0007) [2023-10-08 01:53:13,651][52059] Updated weights for policy 1, policy_version 49552 (0.0008) [2023-10-08 01:53:14,021][52059] Updated weights for policy 1, policy_version 49562 (0.0010) [2023-10-08 01:53:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100859904. Throughput: 0: 1733.6, 1: 1735.3. Samples: 25228934. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:53:16,211][50642] Avg episode reward: [(0, '21.570'), (1, '21.380')] [2023-10-08 01:53:16,261][52060] Updated weights for policy 0, policy_version 48930 (0.0009) [2023-10-08 01:53:16,631][52060] Updated weights for policy 0, policy_version 48940 (0.0007) [2023-10-08 01:53:17,009][52060] Updated weights for policy 0, policy_version 48950 (0.0009) [2023-10-08 01:53:17,367][52060] Updated weights for policy 0, policy_version 48960 (0.0010) [2023-10-08 01:53:17,858][52059] Updated weights for policy 1, policy_version 49572 (0.0009) [2023-10-08 01:53:18,226][52059] Updated weights for policy 1, policy_version 49582 (0.0008) [2023-10-08 01:53:18,583][52059] Updated weights for policy 1, policy_version 49592 (0.0009) [2023-10-08 01:53:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100925440. Throughput: 0: 1723.1, 1: 1739.1. Samples: 25238622. Policy #0 lag: (min: 29.0, avg: 32.5, max: 61.0) [2023-10-08 01:53:21,211][50642] Avg episode reward: [(0, '21.020'), (1, '19.410')] [2023-10-08 01:53:21,433][52060] Updated weights for policy 0, policy_version 48970 (0.0010) [2023-10-08 01:53:21,798][52060] Updated weights for policy 0, policy_version 48980 (0.0008) [2023-10-08 01:53:22,166][52060] Updated weights for policy 0, policy_version 48990 (0.0010) [2023-10-08 01:53:22,495][52059] Updated weights for policy 1, policy_version 49602 (0.0008) [2023-10-08 01:53:22,851][52059] Updated weights for policy 1, policy_version 49612 (0.0010) [2023-10-08 01:53:23,220][52059] Updated weights for policy 1, policy_version 49622 (0.0009) [2023-10-08 01:53:23,587][52059] Updated weights for policy 1, policy_version 49632 (0.0010) [2023-10-08 01:53:26,053][52060] Updated weights for policy 0, policy_version 49000 (0.0008) [2023-10-08 01:53:26,211][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 100990976. Throughput: 0: 1732.9, 1: 1746.1. Samples: 25260056. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:26,212][50642] Avg episode reward: [(0, '20.040'), (1, '22.390')] [2023-10-08 01:53:26,424][52060] Updated weights for policy 0, policy_version 49010 (0.0009) [2023-10-08 01:53:26,784][52060] Updated weights for policy 0, policy_version 49020 (0.0009) [2023-10-08 01:53:27,433][52059] Updated weights for policy 1, policy_version 49642 (0.0007) [2023-10-08 01:53:27,794][52059] Updated weights for policy 1, policy_version 49652 (0.0008) [2023-10-08 01:53:28,149][52059] Updated weights for policy 1, policy_version 49662 (0.0008) [2023-10-08 01:53:30,766][52060] Updated weights for policy 0, policy_version 49030 (0.0009) [2023-10-08 01:53:31,143][52060] Updated weights for policy 0, policy_version 49040 (0.0010) [2023-10-08 01:53:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 101056512. Throughput: 0: 1720.5, 1: 1763.1. Samples: 25280866. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:31,211][50642] Avg episode reward: [(0, '21.070'), (1, '20.160')] [2023-10-08 01:53:31,512][52060] Updated weights for policy 0, policy_version 49050 (0.0009) [2023-10-08 01:53:32,205][52059] Updated weights for policy 1, policy_version 49672 (0.0011) [2023-10-08 01:53:32,566][52059] Updated weights for policy 1, policy_version 49682 (0.0010) [2023-10-08 01:53:32,930][52059] Updated weights for policy 1, policy_version 49692 (0.0007) [2023-10-08 01:53:35,581][52060] Updated weights for policy 0, policy_version 49060 (0.0009) [2023-10-08 01:53:35,950][52060] Updated weights for policy 0, policy_version 49070 (0.0008) [2023-10-08 01:53:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 101122048. Throughput: 0: 1727.2, 1: 1735.7. Samples: 25290396. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:36,212][50642] Avg episode reward: [(0, '20.310'), (1, '20.420')] [2023-10-08 01:53:36,328][52060] Updated weights for policy 0, policy_version 49080 (0.0008) [2023-10-08 01:53:36,859][52059] Updated weights for policy 1, policy_version 49702 (0.0008) [2023-10-08 01:53:37,225][52059] Updated weights for policy 1, policy_version 49712 (0.0008) [2023-10-08 01:53:37,580][52059] Updated weights for policy 1, policy_version 49722 (0.0009) [2023-10-08 01:53:40,173][52060] Updated weights for policy 0, policy_version 49090 (0.0009) [2023-10-08 01:53:40,545][52060] Updated weights for policy 0, policy_version 49100 (0.0010) [2023-10-08 01:53:40,914][52060] Updated weights for policy 0, policy_version 49110 (0.0009) [2023-10-08 01:53:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 101187584. Throughput: 0: 1727.9, 1: 1747.1. Samples: 25311892. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:41,211][50642] Avg episode reward: [(0, '19.380'), (1, '21.020')] [2023-10-08 01:53:41,284][52060] Updated weights for policy 0, policy_version 49120 (0.0007) [2023-10-08 01:53:41,492][52059] Updated weights for policy 1, policy_version 49732 (0.0009) [2023-10-08 01:53:41,851][52059] Updated weights for policy 1, policy_version 49742 (0.0010) [2023-10-08 01:53:42,226][52059] Updated weights for policy 1, policy_version 49752 (0.0009) [2023-10-08 01:53:45,151][52060] Updated weights for policy 0, policy_version 49130 (0.0007) [2023-10-08 01:53:45,510][52060] Updated weights for policy 0, policy_version 49140 (0.0009) [2023-10-08 01:53:45,888][52060] Updated weights for policy 0, policy_version 49150 (0.0009) [2023-10-08 01:53:46,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 101285888. Throughput: 0: 1696.2, 1: 1766.2. Samples: 25331982. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:46,211][50642] Avg episode reward: [(0, '18.000'), (1, '22.540')] [2023-10-08 01:53:46,237][52059] Updated weights for policy 1, policy_version 49762 (0.0009) [2023-10-08 01:53:46,605][52059] Updated weights for policy 1, policy_version 49772 (0.0009) [2023-10-08 01:53:46,966][52059] Updated weights for policy 1, policy_version 49782 (0.0009) [2023-10-08 01:53:47,327][52059] Updated weights for policy 1, policy_version 49792 (0.0007) [2023-10-08 01:53:49,812][52060] Updated weights for policy 0, policy_version 49160 (0.0010) [2023-10-08 01:53:50,186][52060] Updated weights for policy 0, policy_version 49170 (0.0009) [2023-10-08 01:53:50,564][52060] Updated weights for policy 0, policy_version 49180 (0.0008) [2023-10-08 01:53:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 101351424. Throughput: 0: 1719.7, 1: 1728.3. Samples: 25342248. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:51,211][50642] Avg episode reward: [(0, '18.050'), (1, '20.660')] [2023-10-08 01:53:51,494][52059] Updated weights for policy 1, policy_version 49802 (0.0009) [2023-10-08 01:53:51,865][52059] Updated weights for policy 1, policy_version 49812 (0.0008) [2023-10-08 01:53:52,240][52059] Updated weights for policy 1, policy_version 49822 (0.0009) [2023-10-08 01:53:54,550][52060] Updated weights for policy 0, policy_version 49190 (0.0009) [2023-10-08 01:53:54,919][52060] Updated weights for policy 0, policy_version 49200 (0.0007) [2023-10-08 01:53:55,290][52060] Updated weights for policy 0, policy_version 49210 (0.0010) [2023-10-08 01:53:56,129][52059] Updated weights for policy 1, policy_version 49832 (0.0007) [2023-10-08 01:53:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 101416960. Throughput: 0: 1707.0, 1: 1751.3. Samples: 25363124. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:53:56,211][50642] Avg episode reward: [(0, '16.590'), (1, '19.310')] [2023-10-08 01:53:56,492][52059] Updated weights for policy 1, policy_version 49842 (0.0007) [2023-10-08 01:53:56,852][52059] Updated weights for policy 1, policy_version 49852 (0.0007) [2023-10-08 01:53:59,425][52060] Updated weights for policy 0, policy_version 49220 (0.0010) [2023-10-08 01:53:59,789][52060] Updated weights for policy 0, policy_version 49230 (0.0007) [2023-10-08 01:54:00,162][52060] Updated weights for policy 0, policy_version 49240 (0.0009) [2023-10-08 01:54:00,802][52059] Updated weights for policy 1, policy_version 49862 (0.0008) [2023-10-08 01:54:01,157][52059] Updated weights for policy 1, policy_version 49872 (0.0008) [2023-10-08 01:54:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 101482496. Throughput: 0: 1694.6, 1: 1740.9. Samples: 25383532. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:54:01,211][50642] Avg episode reward: [(0, '16.900'), (1, '22.320')] [2023-10-08 01:54:01,532][52059] Updated weights for policy 1, policy_version 49882 (0.0008) [2023-10-08 01:54:04,197][52060] Updated weights for policy 0, policy_version 49250 (0.0008) [2023-10-08 01:54:04,570][52060] Updated weights for policy 0, policy_version 49260 (0.0008) [2023-10-08 01:54:04,937][52060] Updated weights for policy 0, policy_version 49270 (0.0007) [2023-10-08 01:54:05,302][52060] Updated weights for policy 0, policy_version 49280 (0.0008) [2023-10-08 01:54:05,409][52059] Updated weights for policy 1, policy_version 49892 (0.0009) [2023-10-08 01:54:05,780][52059] Updated weights for policy 1, policy_version 49902 (0.0010) [2023-10-08 01:54:06,132][52059] Updated weights for policy 1, policy_version 49912 (0.0007) [2023-10-08 01:54:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 101548032. Throughput: 0: 1721.1, 1: 1737.4. Samples: 25394254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:54:06,211][50642] Avg episode reward: [(0, '17.770'), (1, '20.350')] [2023-10-08 01:54:09,313][52060] Updated weights for policy 0, policy_version 49290 (0.0007) [2023-10-08 01:54:09,680][52060] Updated weights for policy 0, policy_version 49300 (0.0007) [2023-10-08 01:54:10,046][52059] Updated weights for policy 1, policy_version 49922 (0.0007) [2023-10-08 01:54:10,049][52060] Updated weights for policy 0, policy_version 49310 (0.0008) [2023-10-08 01:54:10,406][52059] Updated weights for policy 1, policy_version 49932 (0.0009) [2023-10-08 01:54:10,771][52059] Updated weights for policy 1, policy_version 49942 (0.0009) [2023-10-08 01:54:11,142][52059] Updated weights for policy 1, policy_version 49952 (0.0011) [2023-10-08 01:54:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 101646336. Throughput: 0: 1700.9, 1: 1737.8. Samples: 25414796. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:54:11,211][50642] Avg episode reward: [(0, '19.450'), (1, '21.760')] [2023-10-08 01:54:13,723][52060] Updated weights for policy 0, policy_version 49320 (0.0007) [2023-10-08 01:54:14,096][52060] Updated weights for policy 0, policy_version 49330 (0.0008) [2023-10-08 01:54:14,468][52060] Updated weights for policy 0, policy_version 49340 (0.0008) [2023-10-08 01:54:15,091][52059] Updated weights for policy 1, policy_version 49962 (0.0008) [2023-10-08 01:54:15,457][52059] Updated weights for policy 1, policy_version 49972 (0.0009) [2023-10-08 01:54:15,814][52059] Updated weights for policy 1, policy_version 49982 (0.0009) [2023-10-08 01:54:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 101711872. Throughput: 0: 1705.3, 1: 1710.4. Samples: 25434570. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:54:16,211][50642] Avg episode reward: [(0, '19.800'), (1, '22.650')] [2023-10-08 01:54:18,401][52060] Updated weights for policy 0, policy_version 49350 (0.0007) [2023-10-08 01:54:18,788][52060] Updated weights for policy 0, policy_version 49360 (0.0007) [2023-10-08 01:54:19,150][52060] Updated weights for policy 0, policy_version 49370 (0.0009) [2023-10-08 01:54:19,732][52059] Updated weights for policy 1, policy_version 49992 (0.0009) [2023-10-08 01:54:20,118][52059] Updated weights for policy 1, policy_version 50002 (0.0008) [2023-10-08 01:54:20,475][52059] Updated weights for policy 1, policy_version 50012 (0.0008) [2023-10-08 01:54:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 101777408. Throughput: 0: 1712.9, 1: 1741.7. Samples: 25445854. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:54:21,211][50642] Avg episode reward: [(0, '21.070'), (1, '23.400')] [2023-10-08 01:54:23,268][52060] Updated weights for policy 0, policy_version 49380 (0.0008) [2023-10-08 01:54:23,634][52060] Updated weights for policy 0, policy_version 49390 (0.0007) [2023-10-08 01:54:24,003][52060] Updated weights for policy 0, policy_version 49400 (0.0008) [2023-10-08 01:54:24,403][52059] Updated weights for policy 1, policy_version 50022 (0.0007) [2023-10-08 01:54:24,768][52059] Updated weights for policy 1, policy_version 50032 (0.0007) [2023-10-08 01:54:25,129][52059] Updated weights for policy 1, policy_version 50042 (0.0008) [2023-10-08 01:54:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 101842944. Throughput: 0: 1701.0, 1: 1721.1. Samples: 25465886. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:54:26,211][50642] Avg episode reward: [(0, '21.170'), (1, '22.480')] [2023-10-08 01:54:27,946][52060] Updated weights for policy 0, policy_version 49410 (0.0008) [2023-10-08 01:54:28,317][52060] Updated weights for policy 0, policy_version 49420 (0.0009) [2023-10-08 01:54:28,682][52060] Updated weights for policy 0, policy_version 49430 (0.0008) [2023-10-08 01:54:28,949][52059] Updated weights for policy 1, policy_version 50052 (0.0007) [2023-10-08 01:54:29,050][52060] Updated weights for policy 0, policy_version 49440 (0.0008) [2023-10-08 01:54:29,312][52059] Updated weights for policy 1, policy_version 50062 (0.0008) [2023-10-08 01:54:29,687][52059] Updated weights for policy 1, policy_version 50072 (0.0009) [2023-10-08 01:54:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 101908480. Throughput: 0: 1723.8, 1: 1714.2. Samples: 25486692. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:54:31,211][50642] Avg episode reward: [(0, '19.200'), (1, '22.530')] [2023-10-08 01:54:32,893][52060] Updated weights for policy 0, policy_version 49450 (0.0007) [2023-10-08 01:54:33,263][52060] Updated weights for policy 0, policy_version 49460 (0.0007) [2023-10-08 01:54:33,624][52060] Updated weights for policy 0, policy_version 49470 (0.0007) [2023-10-08 01:54:33,662][52059] Updated weights for policy 1, policy_version 50082 (0.0007) [2023-10-08 01:54:34,027][52059] Updated weights for policy 1, policy_version 50092 (0.0008) [2023-10-08 01:54:34,385][52059] Updated weights for policy 1, policy_version 50102 (0.0008) [2023-10-08 01:54:34,749][52059] Updated weights for policy 1, policy_version 50112 (0.0008) [2023-10-08 01:54:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.6, 300 sec: 13884.7). Total num frames: 101974016. Throughput: 0: 1701.6, 1: 1740.1. Samples: 25497124. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:54:36,211][50642] Avg episode reward: [(0, '21.160'), (1, '22.630')] [2023-10-08 01:54:37,573][52060] Updated weights for policy 0, policy_version 49480 (0.0010) [2023-10-08 01:54:37,939][52060] Updated weights for policy 0, policy_version 49490 (0.0010) [2023-10-08 01:54:38,312][52060] Updated weights for policy 0, policy_version 49500 (0.0008) [2023-10-08 01:54:38,523][52059] Updated weights for policy 1, policy_version 50122 (0.0008) [2023-10-08 01:54:38,883][52059] Updated weights for policy 1, policy_version 50132 (0.0008) [2023-10-08 01:54:39,245][52059] Updated weights for policy 1, policy_version 50142 (0.0010) [2023-10-08 01:54:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 102039552. Throughput: 0: 1716.0, 1: 1720.3. Samples: 25517756. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:54:41,211][50642] Avg episode reward: [(0, '18.890'), (1, '22.740')] [2023-10-08 01:54:42,213][52060] Updated weights for policy 0, policy_version 49510 (0.0007) [2023-10-08 01:54:42,585][52060] Updated weights for policy 0, policy_version 49520 (0.0008) [2023-10-08 01:54:42,956][52060] Updated weights for policy 0, policy_version 49530 (0.0009) [2023-10-08 01:54:43,154][52059] Updated weights for policy 1, policy_version 50152 (0.0009) [2023-10-08 01:54:43,513][52059] Updated weights for policy 1, policy_version 50162 (0.0008) [2023-10-08 01:54:43,882][52059] Updated weights for policy 1, policy_version 50172 (0.0009) [2023-10-08 01:54:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 102105088. Throughput: 0: 1732.3, 1: 1727.5. Samples: 25539224. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:54:46,211][50642] Avg episode reward: [(0, '18.850'), (1, '21.810')] [2023-10-08 01:54:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000050176_51380224.pth... [2023-10-08 01:54:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000049536_50724864.pth... [2023-10-08 01:54:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000047936_49086464.pth [2023-10-08 01:54:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000048544_49709056.pth [2023-10-08 01:54:46,831][52060] Updated weights for policy 0, policy_version 49540 (0.0007) [2023-10-08 01:54:47,195][52060] Updated weights for policy 0, policy_version 49550 (0.0008) [2023-10-08 01:54:47,558][52060] Updated weights for policy 0, policy_version 49560 (0.0008) [2023-10-08 01:54:47,916][52059] Updated weights for policy 1, policy_version 50182 (0.0009) [2023-10-08 01:54:48,287][52059] Updated weights for policy 1, policy_version 50192 (0.0011) [2023-10-08 01:54:48,650][52059] Updated weights for policy 1, policy_version 50202 (0.0012) [2023-10-08 01:54:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 102170624. Throughput: 0: 1707.2, 1: 1727.0. Samples: 25548790. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:54:51,211][50642] Avg episode reward: [(0, '21.400'), (1, '23.110')] [2023-10-08 01:54:51,703][52060] Updated weights for policy 0, policy_version 49570 (0.0010) [2023-10-08 01:54:52,073][52060] Updated weights for policy 0, policy_version 49580 (0.0009) [2023-10-08 01:54:52,448][52060] Updated weights for policy 0, policy_version 49590 (0.0009) [2023-10-08 01:54:52,617][52059] Updated weights for policy 1, policy_version 50212 (0.0009) [2023-10-08 01:54:52,812][52060] Updated weights for policy 0, policy_version 49600 (0.0008) [2023-10-08 01:54:52,987][52059] Updated weights for policy 1, policy_version 50222 (0.0008) [2023-10-08 01:54:53,344][52059] Updated weights for policy 1, policy_version 50232 (0.0009) [2023-10-08 01:54:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 102236160. Throughput: 0: 1722.0, 1: 1723.6. Samples: 25569848. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:54:56,211][50642] Avg episode reward: [(0, '19.750'), (1, '23.190')] [2023-10-08 01:54:56,777][52060] Updated weights for policy 0, policy_version 49610 (0.0009) [2023-10-08 01:54:57,154][52060] Updated weights for policy 0, policy_version 49620 (0.0009) [2023-10-08 01:54:57,197][52059] Updated weights for policy 1, policy_version 50242 (0.0009) [2023-10-08 01:54:57,522][52060] Updated weights for policy 0, policy_version 49630 (0.0007) [2023-10-08 01:54:57,561][52059] Updated weights for policy 1, policy_version 50252 (0.0008) [2023-10-08 01:54:57,920][52059] Updated weights for policy 1, policy_version 50262 (0.0008) [2023-10-08 01:54:58,284][52059] Updated weights for policy 1, policy_version 50272 (0.0009) [2023-10-08 01:55:01,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 102301696. Throughput: 0: 1726.4, 1: 1749.2. Samples: 25590974. Policy #0 lag: (min: 31.0, avg: 37.0, max: 63.0) [2023-10-08 01:55:01,211][50642] Avg episode reward: [(0, '19.580'), (1, '23.300')] [2023-10-08 01:55:01,674][52060] Updated weights for policy 0, policy_version 49640 (0.0007) [2023-10-08 01:55:02,036][52060] Updated weights for policy 0, policy_version 49650 (0.0007) [2023-10-08 01:55:02,266][52059] Updated weights for policy 1, policy_version 50282 (0.0007) [2023-10-08 01:55:02,414][52060] Updated weights for policy 0, policy_version 49660 (0.0007) [2023-10-08 01:55:02,629][52059] Updated weights for policy 1, policy_version 50292 (0.0008) [2023-10-08 01:55:03,003][52059] Updated weights for policy 1, policy_version 50302 (0.0010) [2023-10-08 01:55:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 102367232. Throughput: 0: 1712.2, 1: 1719.7. Samples: 25600288. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:55:06,211][50642] Avg episode reward: [(0, '21.860'), (1, '19.660')] [2023-10-08 01:55:06,427][52060] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-10-08 01:55:06,815][52060] Updated weights for policy 0, policy_version 49680 (0.0007) [2023-10-08 01:55:07,034][52059] Updated weights for policy 1, policy_version 50312 (0.0007) [2023-10-08 01:55:07,190][52060] Updated weights for policy 0, policy_version 49690 (0.0007) [2023-10-08 01:55:07,396][52059] Updated weights for policy 1, policy_version 50322 (0.0007) [2023-10-08 01:55:07,759][52059] Updated weights for policy 1, policy_version 50332 (0.0007) [2023-10-08 01:55:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 102432768. Throughput: 0: 1719.6, 1: 1734.4. Samples: 25621316. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:55:11,211][50642] Avg episode reward: [(0, '21.130'), (1, '21.660')] [2023-10-08 01:55:11,272][52060] Updated weights for policy 0, policy_version 49700 (0.0008) [2023-10-08 01:55:11,578][52059] Updated weights for policy 1, policy_version 50342 (0.0009) [2023-10-08 01:55:11,641][52060] Updated weights for policy 0, policy_version 49710 (0.0010) [2023-10-08 01:55:11,949][52059] Updated weights for policy 1, policy_version 50352 (0.0007) [2023-10-08 01:55:12,017][52060] Updated weights for policy 0, policy_version 49720 (0.0010) [2023-10-08 01:55:12,314][52059] Updated weights for policy 1, policy_version 50362 (0.0009) [2023-10-08 01:55:15,932][52060] Updated weights for policy 0, policy_version 49730 (0.0008) [2023-10-08 01:55:16,201][52059] Updated weights for policy 1, policy_version 50372 (0.0009) [2023-10-08 01:55:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 102498304. Throughput: 0: 1720.4, 1: 1742.6. Samples: 25642526. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:55:16,211][50642] Avg episode reward: [(0, '18.990'), (1, '16.800')] [2023-10-08 01:55:16,307][52060] Updated weights for policy 0, policy_version 49740 (0.0008) [2023-10-08 01:55:16,564][52059] Updated weights for policy 1, policy_version 50382 (0.0007) [2023-10-08 01:55:16,672][52060] Updated weights for policy 0, policy_version 49750 (0.0010) [2023-10-08 01:55:16,933][52059] Updated weights for policy 1, policy_version 50392 (0.0007) [2023-10-08 01:55:17,049][52060] Updated weights for policy 0, policy_version 49760 (0.0008) [2023-10-08 01:55:20,900][52059] Updated weights for policy 1, policy_version 50402 (0.0009) [2023-10-08 01:55:21,055][52060] Updated weights for policy 0, policy_version 49770 (0.0009) [2023-10-08 01:55:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 102563840. Throughput: 0: 1716.4, 1: 1720.1. Samples: 25651766. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:55:21,211][50642] Avg episode reward: [(0, '22.070'), (1, '16.120')] [2023-10-08 01:55:21,265][52059] Updated weights for policy 1, policy_version 50412 (0.0008) [2023-10-08 01:55:21,422][52060] Updated weights for policy 0, policy_version 49780 (0.0007) [2023-10-08 01:55:21,636][52059] Updated weights for policy 1, policy_version 50422 (0.0007) [2023-10-08 01:55:21,795][52060] Updated weights for policy 0, policy_version 49790 (0.0008) [2023-10-08 01:55:21,998][52059] Updated weights for policy 1, policy_version 50432 (0.0010) [2023-10-08 01:55:25,681][52060] Updated weights for policy 0, policy_version 49800 (0.0008) [2023-10-08 01:55:25,909][52059] Updated weights for policy 1, policy_version 50442 (0.0009) [2023-10-08 01:55:26,053][52060] Updated weights for policy 0, policy_version 49810 (0.0010) [2023-10-08 01:55:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 102629376. Throughput: 0: 1711.2, 1: 1739.0. Samples: 25673014. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:55:26,211][50642] Avg episode reward: [(0, '21.890'), (1, '18.680')] [2023-10-08 01:55:26,284][52059] Updated weights for policy 1, policy_version 50452 (0.0009) [2023-10-08 01:55:26,417][52060] Updated weights for policy 0, policy_version 49820 (0.0009) [2023-10-08 01:55:26,651][52059] Updated weights for policy 1, policy_version 50462 (0.0008) [2023-10-08 01:55:30,278][52060] Updated weights for policy 0, policy_version 49830 (0.0010) [2023-10-08 01:55:30,640][52060] Updated weights for policy 0, policy_version 49840 (0.0009) [2023-10-08 01:55:30,711][52059] Updated weights for policy 1, policy_version 50472 (0.0009) [2023-10-08 01:55:31,018][52060] Updated weights for policy 0, policy_version 49850 (0.0009) [2023-10-08 01:55:31,075][52059] Updated weights for policy 1, policy_version 50482 (0.0008) [2023-10-08 01:55:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 102694912. Throughput: 0: 1694.1, 1: 1721.3. Samples: 25692920. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 01:55:31,211][50642] Avg episode reward: [(0, '19.290'), (1, '17.060')] [2023-10-08 01:55:31,434][52059] Updated weights for policy 1, policy_version 50492 (0.0007) [2023-10-08 01:55:34,961][52060] Updated weights for policy 0, policy_version 49860 (0.0009) [2023-10-08 01:55:35,190][52059] Updated weights for policy 1, policy_version 50502 (0.0008) [2023-10-08 01:55:35,330][52060] Updated weights for policy 0, policy_version 49870 (0.0008) [2023-10-08 01:55:35,550][52059] Updated weights for policy 1, policy_version 50512 (0.0009) [2023-10-08 01:55:35,700][52060] Updated weights for policy 0, policy_version 49880 (0.0009) [2023-10-08 01:55:35,907][52059] Updated weights for policy 1, policy_version 50522 (0.0008) [2023-10-08 01:55:36,210][50642] Fps is (10 sec: 19660.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 102825984. Throughput: 0: 1709.7, 1: 1735.6. Samples: 25703830. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:55:36,211][50642] Avg episode reward: [(0, '20.440'), (1, '16.620')] [2023-10-08 01:55:39,768][52060] Updated weights for policy 0, policy_version 49890 (0.0008) [2023-10-08 01:55:39,889][52059] Updated weights for policy 1, policy_version 50532 (0.0010) [2023-10-08 01:55:40,132][52060] Updated weights for policy 0, policy_version 49900 (0.0007) [2023-10-08 01:55:40,243][52059] Updated weights for policy 1, policy_version 50542 (0.0008) [2023-10-08 01:55:40,494][52060] Updated weights for policy 0, policy_version 49910 (0.0007) [2023-10-08 01:55:40,610][52059] Updated weights for policy 1, policy_version 50552 (0.0008) [2023-10-08 01:55:40,867][52060] Updated weights for policy 0, policy_version 49920 (0.0007) [2023-10-08 01:55:41,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 102891520. Throughput: 0: 1713.7, 1: 1730.0. Samples: 25724818. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:55:41,211][50642] Avg episode reward: [(0, '22.810'), (1, '18.140')] [2023-10-08 01:55:44,859][52059] Updated weights for policy 1, policy_version 50562 (0.0008) [2023-10-08 01:55:44,922][52060] Updated weights for policy 0, policy_version 49930 (0.0009) [2023-10-08 01:55:45,221][52059] Updated weights for policy 1, policy_version 50572 (0.0007) [2023-10-08 01:55:45,291][52060] Updated weights for policy 0, policy_version 49940 (0.0010) [2023-10-08 01:55:45,587][52059] Updated weights for policy 1, policy_version 50582 (0.0009) [2023-10-08 01:55:45,658][52060] Updated weights for policy 0, policy_version 49950 (0.0008) [2023-10-08 01:55:45,945][52059] Updated weights for policy 1, policy_version 50592 (0.0008) [2023-10-08 01:55:46,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 102957056. Throughput: 0: 1689.3, 1: 1705.5. Samples: 25743740. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:55:46,211][50642] Avg episode reward: [(0, '18.770'), (1, '21.220')] [2023-10-08 01:55:49,631][52060] Updated weights for policy 0, policy_version 49960 (0.0008) [2023-10-08 01:55:49,883][52059] Updated weights for policy 1, policy_version 50602 (0.0009) [2023-10-08 01:55:50,005][52060] Updated weights for policy 0, policy_version 49970 (0.0007) [2023-10-08 01:55:50,249][52059] Updated weights for policy 1, policy_version 50612 (0.0009) [2023-10-08 01:55:50,363][52060] Updated weights for policy 0, policy_version 49980 (0.0008) [2023-10-08 01:55:50,618][52059] Updated weights for policy 1, policy_version 50622 (0.0008) [2023-10-08 01:55:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 103022592. Throughput: 0: 1719.2, 1: 1732.8. Samples: 25755632. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:55:51,211][50642] Avg episode reward: [(0, '19.530'), (1, '18.530')] [2023-10-08 01:55:54,386][52060] Updated weights for policy 0, policy_version 49990 (0.0008) [2023-10-08 01:55:54,615][52059] Updated weights for policy 1, policy_version 50632 (0.0008) [2023-10-08 01:55:54,764][52060] Updated weights for policy 0, policy_version 50000 (0.0007) [2023-10-08 01:55:54,987][52059] Updated weights for policy 1, policy_version 50642 (0.0007) [2023-10-08 01:55:55,134][52060] Updated weights for policy 0, policy_version 50010 (0.0008) [2023-10-08 01:55:55,353][52059] Updated weights for policy 1, policy_version 50652 (0.0007) [2023-10-08 01:55:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 103088128. Throughput: 0: 1709.2, 1: 1722.7. Samples: 25775752. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:55:56,211][50642] Avg episode reward: [(0, '22.980'), (1, '17.360')] [2023-10-08 01:55:58,931][52060] Updated weights for policy 0, policy_version 50020 (0.0008) [2023-10-08 01:55:59,299][52060] Updated weights for policy 0, policy_version 50030 (0.0007) [2023-10-08 01:55:59,378][52059] Updated weights for policy 1, policy_version 50662 (0.0007) [2023-10-08 01:55:59,667][52060] Updated weights for policy 0, policy_version 50040 (0.0008) [2023-10-08 01:55:59,743][52059] Updated weights for policy 1, policy_version 50672 (0.0009) [2023-10-08 01:56:00,107][52059] Updated weights for policy 1, policy_version 50682 (0.0008) [2023-10-08 01:56:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 103153664. Throughput: 0: 1697.8, 1: 1703.1. Samples: 25795568. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:56:01,211][50642] Avg episode reward: [(0, '21.280'), (1, '23.740')] [2023-10-08 01:56:03,647][52060] Updated weights for policy 0, policy_version 50050 (0.0011) [2023-10-08 01:56:03,926][52059] Updated weights for policy 1, policy_version 50692 (0.0008) [2023-10-08 01:56:04,017][52060] Updated weights for policy 0, policy_version 50060 (0.0009) [2023-10-08 01:56:04,287][52059] Updated weights for policy 1, policy_version 50702 (0.0007) [2023-10-08 01:56:04,379][52060] Updated weights for policy 0, policy_version 50070 (0.0007) [2023-10-08 01:56:04,645][52059] Updated weights for policy 1, policy_version 50712 (0.0009) [2023-10-08 01:56:04,746][52060] Updated weights for policy 0, policy_version 50080 (0.0007) [2023-10-08 01:56:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 103219200. Throughput: 0: 1722.6, 1: 1734.6. Samples: 25807340. Policy #0 lag: (min: 24.0, avg: 50.0, max: 56.0) [2023-10-08 01:56:06,211][50642] Avg episode reward: [(0, '18.000'), (1, '21.620')] [2023-10-08 01:56:08,535][52060] Updated weights for policy 0, policy_version 50090 (0.0008) [2023-10-08 01:56:08,733][52059] Updated weights for policy 1, policy_version 50722 (0.0007) [2023-10-08 01:56:08,900][52060] Updated weights for policy 0, policy_version 50100 (0.0007) [2023-10-08 01:56:09,094][52059] Updated weights for policy 1, policy_version 50732 (0.0009) [2023-10-08 01:56:09,268][52060] Updated weights for policy 0, policy_version 50110 (0.0008) [2023-10-08 01:56:09,466][52059] Updated weights for policy 1, policy_version 50742 (0.0008) [2023-10-08 01:56:09,826][52059] Updated weights for policy 1, policy_version 50752 (0.0009) [2023-10-08 01:56:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 103284736. Throughput: 0: 1701.8, 1: 1707.3. Samples: 25826426. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:11,211][50642] Avg episode reward: [(0, '22.020'), (1, '20.290')] [2023-10-08 01:56:13,216][52060] Updated weights for policy 0, policy_version 50120 (0.0007) [2023-10-08 01:56:13,576][52060] Updated weights for policy 0, policy_version 50130 (0.0007) [2023-10-08 01:56:13,861][52059] Updated weights for policy 1, policy_version 50762 (0.0008) [2023-10-08 01:56:13,945][52060] Updated weights for policy 0, policy_version 50140 (0.0007) [2023-10-08 01:56:14,224][52059] Updated weights for policy 1, policy_version 50772 (0.0008) [2023-10-08 01:56:14,596][52059] Updated weights for policy 1, policy_version 50782 (0.0008) [2023-10-08 01:56:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 103350272. Throughput: 0: 1716.3, 1: 1721.8. Samples: 25847634. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:16,211][50642] Avg episode reward: [(0, '22.870'), (1, '21.540')] [2023-10-08 01:56:17,995][52060] Updated weights for policy 0, policy_version 50150 (0.0007) [2023-10-08 01:56:18,369][52060] Updated weights for policy 0, policy_version 50160 (0.0007) [2023-10-08 01:56:18,485][52059] Updated weights for policy 1, policy_version 50792 (0.0008) [2023-10-08 01:56:18,728][52060] Updated weights for policy 0, policy_version 50170 (0.0007) [2023-10-08 01:56:18,838][52059] Updated weights for policy 1, policy_version 50802 (0.0008) [2023-10-08 01:56:19,200][52059] Updated weights for policy 1, policy_version 50812 (0.0008) [2023-10-08 01:56:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 103415808. Throughput: 0: 1701.3, 1: 1719.8. Samples: 25857780. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:21,211][50642] Avg episode reward: [(0, '17.770'), (1, '23.220')] [2023-10-08 01:56:22,859][52060] Updated weights for policy 0, policy_version 50180 (0.0007) [2023-10-08 01:56:23,115][52059] Updated weights for policy 1, policy_version 50822 (0.0007) [2023-10-08 01:56:23,225][52060] Updated weights for policy 0, policy_version 50190 (0.0009) [2023-10-08 01:56:23,494][52059] Updated weights for policy 1, policy_version 50832 (0.0008) [2023-10-08 01:56:23,600][52060] Updated weights for policy 0, policy_version 50200 (0.0008) [2023-10-08 01:56:23,858][52059] Updated weights for policy 1, policy_version 50842 (0.0009) [2023-10-08 01:56:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 103481344. Throughput: 0: 1700.2, 1: 1709.2. Samples: 25878244. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:26,211][50642] Avg episode reward: [(0, '20.290'), (1, '19.490')] [2023-10-08 01:56:27,429][52060] Updated weights for policy 0, policy_version 50210 (0.0008) [2023-10-08 01:56:27,808][52060] Updated weights for policy 0, policy_version 50220 (0.0008) [2023-10-08 01:56:27,830][52059] Updated weights for policy 1, policy_version 50852 (0.0008) [2023-10-08 01:56:28,169][52060] Updated weights for policy 0, policy_version 50230 (0.0008) [2023-10-08 01:56:28,183][52059] Updated weights for policy 1, policy_version 50862 (0.0008) [2023-10-08 01:56:28,536][52060] Updated weights for policy 0, policy_version 50240 (0.0008) [2023-10-08 01:56:28,546][52059] Updated weights for policy 1, policy_version 50872 (0.0007) [2023-10-08 01:56:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 103546880. Throughput: 0: 1730.3, 1: 1737.8. Samples: 25899804. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:31,211][50642] Avg episode reward: [(0, '23.370'), (1, '21.850')] [2023-10-08 01:56:32,384][52059] Updated weights for policy 1, policy_version 50882 (0.0009) [2023-10-08 01:56:32,669][52060] Updated weights for policy 0, policy_version 50250 (0.0008) [2023-10-08 01:56:32,746][52059] Updated weights for policy 1, policy_version 50892 (0.0008) [2023-10-08 01:56:33,042][52060] Updated weights for policy 0, policy_version 50260 (0.0007) [2023-10-08 01:56:33,107][52059] Updated weights for policy 1, policy_version 50902 (0.0009) [2023-10-08 01:56:33,409][52060] Updated weights for policy 0, policy_version 50270 (0.0007) [2023-10-08 01:56:33,457][52059] Updated weights for policy 1, policy_version 50912 (0.0008) [2023-10-08 01:56:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 103612416. Throughput: 0: 1697.8, 1: 1711.1. Samples: 25909034. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:36,211][50642] Avg episode reward: [(0, '18.250'), (1, '23.550')] [2023-10-08 01:56:37,170][52059] Updated weights for policy 1, policy_version 50922 (0.0007) [2023-10-08 01:56:37,525][52059] Updated weights for policy 1, policy_version 50932 (0.0009) [2023-10-08 01:56:37,538][52060] Updated weights for policy 0, policy_version 50280 (0.0008) [2023-10-08 01:56:37,886][52059] Updated weights for policy 1, policy_version 50942 (0.0008) [2023-10-08 01:56:37,909][52060] Updated weights for policy 0, policy_version 50290 (0.0008) [2023-10-08 01:56:38,274][52060] Updated weights for policy 0, policy_version 50300 (0.0009) [2023-10-08 01:56:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 103677952. Throughput: 0: 1712.1, 1: 1725.2. Samples: 25930428. Policy #0 lag: (min: 31.0, avg: 35.6, max: 63.0) [2023-10-08 01:56:41,211][50642] Avg episode reward: [(0, '18.920'), (1, '22.830')] [2023-10-08 01:56:41,991][52059] Updated weights for policy 1, policy_version 50952 (0.0009) [2023-10-08 01:56:42,276][52060] Updated weights for policy 0, policy_version 50310 (0.0009) [2023-10-08 01:56:42,366][52059] Updated weights for policy 1, policy_version 50962 (0.0009) [2023-10-08 01:56:42,648][52060] Updated weights for policy 0, policy_version 50320 (0.0007) [2023-10-08 01:56:42,732][52059] Updated weights for policy 1, policy_version 50972 (0.0007) [2023-10-08 01:56:43,021][52060] Updated weights for policy 0, policy_version 50330 (0.0010) [2023-10-08 01:56:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 103743488. Throughput: 0: 1720.9, 1: 1742.0. Samples: 25951398. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:56:46,211][50642] Avg episode reward: [(0, '21.810'), (1, '20.510')] [2023-10-08 01:56:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000050336_51544064.pth... [2023-10-08 01:56:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000050976_52199424.pth... [2023-10-08 01:56:46,259][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000049376_50561024.pth [2023-10-08 01:56:46,260][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000048736_49905664.pth [2023-10-08 01:56:46,725][52059] Updated weights for policy 1, policy_version 50982 (0.0008) [2023-10-08 01:56:46,927][52060] Updated weights for policy 0, policy_version 50340 (0.0009) [2023-10-08 01:56:47,082][52059] Updated weights for policy 1, policy_version 50992 (0.0007) [2023-10-08 01:56:47,296][52060] Updated weights for policy 0, policy_version 50350 (0.0007) [2023-10-08 01:56:47,454][52059] Updated weights for policy 1, policy_version 51002 (0.0007) [2023-10-08 01:56:47,661][52060] Updated weights for policy 0, policy_version 50360 (0.0008) [2023-10-08 01:56:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 103809024. Throughput: 0: 1698.1, 1: 1709.3. Samples: 25960674. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:56:51,211][50642] Avg episode reward: [(0, '21.150'), (1, '22.900')] [2023-10-08 01:56:51,340][52059] Updated weights for policy 1, policy_version 51012 (0.0008) [2023-10-08 01:56:51,690][52060] Updated weights for policy 0, policy_version 50370 (0.0009) [2023-10-08 01:56:51,704][52059] Updated weights for policy 1, policy_version 51022 (0.0009) [2023-10-08 01:56:52,058][52060] Updated weights for policy 0, policy_version 50380 (0.0010) [2023-10-08 01:56:52,060][52059] Updated weights for policy 1, policy_version 51032 (0.0008) [2023-10-08 01:56:52,423][52060] Updated weights for policy 0, policy_version 50390 (0.0009) [2023-10-08 01:56:52,788][52060] Updated weights for policy 0, policy_version 50400 (0.0010) [2023-10-08 01:56:56,073][52059] Updated weights for policy 1, policy_version 51042 (0.0008) [2023-10-08 01:56:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 103874560. Throughput: 0: 1718.3, 1: 1739.9. Samples: 25982044. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:56:56,211][50642] Avg episode reward: [(0, '18.310'), (1, '25.060')] [2023-10-08 01:56:56,442][52059] Updated weights for policy 1, policy_version 51052 (0.0008) [2023-10-08 01:56:56,762][52060] Updated weights for policy 0, policy_version 50410 (0.0008) [2023-10-08 01:56:56,804][52059] Updated weights for policy 1, policy_version 51062 (0.0008) [2023-10-08 01:56:57,125][52060] Updated weights for policy 0, policy_version 50420 (0.0008) [2023-10-08 01:56:57,162][51710] Saving new best policy, reward=25.060! [2023-10-08 01:56:57,164][52059] Updated weights for policy 1, policy_version 51072 (0.0010) [2023-10-08 01:56:57,502][52060] Updated weights for policy 0, policy_version 50430 (0.0009) [2023-10-08 01:57:01,174][52059] Updated weights for policy 1, policy_version 51082 (0.0008) [2023-10-08 01:57:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 103940096. Throughput: 0: 1720.9, 1: 1736.9. Samples: 26003238. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:57:01,211][50642] Avg episode reward: [(0, '20.650'), (1, '22.570')] [2023-10-08 01:57:01,377][52060] Updated weights for policy 0, policy_version 50440 (0.0008) [2023-10-08 01:57:01,530][52059] Updated weights for policy 1, policy_version 51092 (0.0008) [2023-10-08 01:57:01,746][52060] Updated weights for policy 0, policy_version 50450 (0.0008) [2023-10-08 01:57:01,890][52059] Updated weights for policy 1, policy_version 51102 (0.0008) [2023-10-08 01:57:02,117][52060] Updated weights for policy 0, policy_version 50460 (0.0009) [2023-10-08 01:57:05,922][52059] Updated weights for policy 1, policy_version 51112 (0.0008) [2023-10-08 01:57:05,947][52060] Updated weights for policy 0, policy_version 50470 (0.0008) [2023-10-08 01:57:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 104005632. Throughput: 0: 1719.0, 1: 1724.9. Samples: 26012758. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:57:06,211][50642] Avg episode reward: [(0, '22.640'), (1, '21.780')] [2023-10-08 01:57:06,281][52059] Updated weights for policy 1, policy_version 51122 (0.0009) [2023-10-08 01:57:06,315][52060] Updated weights for policy 0, policy_version 50480 (0.0007) [2023-10-08 01:57:06,641][52059] Updated weights for policy 1, policy_version 51132 (0.0008) [2023-10-08 01:57:06,676][52060] Updated weights for policy 0, policy_version 50490 (0.0008) [2023-10-08 01:57:10,607][52060] Updated weights for policy 0, policy_version 50500 (0.0007) [2023-10-08 01:57:10,678][52059] Updated weights for policy 1, policy_version 51142 (0.0011) [2023-10-08 01:57:10,974][52060] Updated weights for policy 0, policy_version 50510 (0.0009) [2023-10-08 01:57:11,039][52059] Updated weights for policy 1, policy_version 51152 (0.0008) [2023-10-08 01:57:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 104071168. Throughput: 0: 1721.7, 1: 1738.9. Samples: 26033974. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:57:11,211][50642] Avg episode reward: [(0, '19.340'), (1, '23.350')] [2023-10-08 01:57:11,337][52060] Updated weights for policy 0, policy_version 50520 (0.0010) [2023-10-08 01:57:11,411][52059] Updated weights for policy 1, policy_version 51162 (0.0008) [2023-10-08 01:57:15,268][52059] Updated weights for policy 1, policy_version 51172 (0.0008) [2023-10-08 01:57:15,270][52060] Updated weights for policy 0, policy_version 50530 (0.0009) [2023-10-08 01:57:15,638][52059] Updated weights for policy 1, policy_version 51182 (0.0008) [2023-10-08 01:57:15,644][52060] Updated weights for policy 0, policy_version 50540 (0.0009) [2023-10-08 01:57:16,001][52059] Updated weights for policy 1, policy_version 51192 (0.0007) [2023-10-08 01:57:16,008][52060] Updated weights for policy 0, policy_version 50550 (0.0010) [2023-10-08 01:57:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 104136704. Throughput: 0: 1704.3, 1: 1719.6. Samples: 26053880. Policy #0 lag: (min: 18.0, avg: 19.9, max: 49.0) [2023-10-08 01:57:16,211][50642] Avg episode reward: [(0, '20.360'), (1, '24.210')] [2023-10-08 01:57:16,372][52060] Updated weights for policy 0, policy_version 50560 (0.0008) [2023-10-08 01:57:19,871][52059] Updated weights for policy 1, policy_version 51202 (0.0007) [2023-10-08 01:57:20,236][52059] Updated weights for policy 1, policy_version 51212 (0.0008) [2023-10-08 01:57:20,490][52060] Updated weights for policy 0, policy_version 50570 (0.0009) [2023-10-08 01:57:20,601][52059] Updated weights for policy 1, policy_version 51222 (0.0007) [2023-10-08 01:57:20,869][52060] Updated weights for policy 0, policy_version 50580 (0.0009) [2023-10-08 01:57:20,964][52059] Updated weights for policy 1, policy_version 51232 (0.0007) [2023-10-08 01:57:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 104235008. Throughput: 0: 1722.0, 1: 1739.3. Samples: 26064790. Policy #0 lag: (min: 10.0, avg: 15.8, max: 42.0) [2023-10-08 01:57:21,211][50642] Avg episode reward: [(0, '21.530'), (1, '20.350')] [2023-10-08 01:57:21,232][52060] Updated weights for policy 0, policy_version 50590 (0.0009) [2023-10-08 01:57:24,894][52059] Updated weights for policy 1, policy_version 51242 (0.0008) [2023-10-08 01:57:25,197][52060] Updated weights for policy 0, policy_version 50600 (0.0009) [2023-10-08 01:57:25,256][52059] Updated weights for policy 1, policy_version 51252 (0.0007) [2023-10-08 01:57:25,557][52060] Updated weights for policy 0, policy_version 50610 (0.0008) [2023-10-08 01:57:25,629][52059] Updated weights for policy 1, policy_version 51262 (0.0010) [2023-10-08 01:57:25,932][52060] Updated weights for policy 0, policy_version 50620 (0.0007) [2023-10-08 01:57:26,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 104333312. Throughput: 0: 1723.6, 1: 1730.4. Samples: 26085854. Policy #0 lag: (min: 10.0, avg: 15.8, max: 42.0) [2023-10-08 01:57:26,211][50642] Avg episode reward: [(0, '21.140'), (1, '21.830')] [2023-10-08 01:57:29,739][52059] Updated weights for policy 1, policy_version 51272 (0.0009) [2023-10-08 01:57:30,051][52060] Updated weights for policy 0, policy_version 50630 (0.0008) [2023-10-08 01:57:30,118][52059] Updated weights for policy 1, policy_version 51282 (0.0007) [2023-10-08 01:57:30,443][52060] Updated weights for policy 0, policy_version 50640 (0.0007) [2023-10-08 01:57:30,483][52059] Updated weights for policy 1, policy_version 51292 (0.0007) [2023-10-08 01:57:30,806][52060] Updated weights for policy 0, policy_version 50650 (0.0009) [2023-10-08 01:57:31,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 104398848. Throughput: 0: 1699.3, 1: 1711.8. Samples: 26104898. Policy #0 lag: (min: 10.0, avg: 15.8, max: 42.0) [2023-10-08 01:57:31,211][50642] Avg episode reward: [(0, '19.170'), (1, '23.140')] [2023-10-08 01:57:34,392][52059] Updated weights for policy 1, policy_version 51302 (0.0010) [2023-10-08 01:57:34,752][52059] Updated weights for policy 1, policy_version 51312 (0.0011) [2023-10-08 01:57:34,923][52060] Updated weights for policy 0, policy_version 50660 (0.0009) [2023-10-08 01:57:35,114][52059] Updated weights for policy 1, policy_version 51322 (0.0009) [2023-10-08 01:57:35,282][52060] Updated weights for policy 0, policy_version 50670 (0.0007) [2023-10-08 01:57:35,651][52060] Updated weights for policy 0, policy_version 50680 (0.0009) [2023-10-08 01:57:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 104464384. Throughput: 0: 1721.4, 1: 1742.9. Samples: 26116568. Policy #0 lag: (min: 10.0, avg: 15.8, max: 42.0) [2023-10-08 01:57:36,211][50642] Avg episode reward: [(0, '22.220'), (1, '21.540')] [2023-10-08 01:57:39,044][52059] Updated weights for policy 1, policy_version 51332 (0.0008) [2023-10-08 01:57:39,408][52059] Updated weights for policy 1, policy_version 51342 (0.0007) [2023-10-08 01:57:39,619][52060] Updated weights for policy 0, policy_version 50690 (0.0008) [2023-10-08 01:57:39,780][52059] Updated weights for policy 1, policy_version 51352 (0.0007) [2023-10-08 01:57:39,980][52060] Updated weights for policy 0, policy_version 50700 (0.0007) [2023-10-08 01:57:40,350][52060] Updated weights for policy 0, policy_version 50710 (0.0007) [2023-10-08 01:57:40,725][52060] Updated weights for policy 0, policy_version 50720 (0.0010) [2023-10-08 01:57:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 104529920. Throughput: 0: 1715.4, 1: 1713.6. Samples: 26136348. Policy #0 lag: (min: 10.0, avg: 15.8, max: 42.0) [2023-10-08 01:57:41,211][50642] Avg episode reward: [(0, '20.550'), (1, '19.080')] [2023-10-08 01:57:43,821][52059] Updated weights for policy 1, policy_version 51362 (0.0009) [2023-10-08 01:57:44,191][52059] Updated weights for policy 1, policy_version 51372 (0.0007) [2023-10-08 01:57:44,552][52059] Updated weights for policy 1, policy_version 51382 (0.0008) [2023-10-08 01:57:44,719][52060] Updated weights for policy 0, policy_version 50730 (0.0009) [2023-10-08 01:57:44,914][52059] Updated weights for policy 1, policy_version 51392 (0.0009) [2023-10-08 01:57:45,089][52060] Updated weights for policy 0, policy_version 50740 (0.0007) [2023-10-08 01:57:45,462][52060] Updated weights for policy 0, policy_version 50750 (0.0008) [2023-10-08 01:57:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 104595456. Throughput: 0: 1690.1, 1: 1712.7. Samples: 26156366. Policy #0 lag: (min: 10.0, avg: 15.8, max: 42.0) [2023-10-08 01:57:46,211][50642] Avg episode reward: [(0, '18.310'), (1, '22.880')] [2023-10-08 01:57:48,809][52059] Updated weights for policy 1, policy_version 51402 (0.0008) [2023-10-08 01:57:49,176][52059] Updated weights for policy 1, policy_version 51412 (0.0007) [2023-10-08 01:57:49,377][52060] Updated weights for policy 0, policy_version 50760 (0.0007) [2023-10-08 01:57:49,543][52059] Updated weights for policy 1, policy_version 51422 (0.0007) [2023-10-08 01:57:49,743][52060] Updated weights for policy 0, policy_version 50770 (0.0007) [2023-10-08 01:57:50,114][52060] Updated weights for policy 0, policy_version 50780 (0.0007) [2023-10-08 01:57:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 104660992. Throughput: 0: 1719.8, 1: 1729.6. Samples: 26167980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:57:51,211][50642] Avg episode reward: [(0, '20.490'), (1, '24.130')] [2023-10-08 01:57:53,537][52059] Updated weights for policy 1, policy_version 51432 (0.0008) [2023-10-08 01:57:53,895][52059] Updated weights for policy 1, policy_version 51442 (0.0007) [2023-10-08 01:57:54,035][52060] Updated weights for policy 0, policy_version 50790 (0.0008) [2023-10-08 01:57:54,263][52059] Updated weights for policy 1, policy_version 51452 (0.0009) [2023-10-08 01:57:54,401][52060] Updated weights for policy 0, policy_version 50800 (0.0007) [2023-10-08 01:57:54,769][52060] Updated weights for policy 0, policy_version 50810 (0.0007) [2023-10-08 01:57:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 104726528. Throughput: 0: 1695.9, 1: 1709.1. Samples: 26187198. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:57:56,211][50642] Avg episode reward: [(0, '21.100'), (1, '20.480')] [2023-10-08 01:57:58,239][52059] Updated weights for policy 1, policy_version 51462 (0.0008) [2023-10-08 01:57:58,600][52059] Updated weights for policy 1, policy_version 51472 (0.0009) [2023-10-08 01:57:58,624][52060] Updated weights for policy 0, policy_version 50820 (0.0007) [2023-10-08 01:57:58,967][52059] Updated weights for policy 1, policy_version 51482 (0.0010) [2023-10-08 01:57:59,000][52060] Updated weights for policy 0, policy_version 50830 (0.0009) [2023-10-08 01:57:59,365][52060] Updated weights for policy 0, policy_version 50840 (0.0009) [2023-10-08 01:58:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 104792064. Throughput: 0: 1702.4, 1: 1722.0. Samples: 26207976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:58:01,211][50642] Avg episode reward: [(0, '18.100'), (1, '20.520')] [2023-10-08 01:58:02,832][52059] Updated weights for policy 1, policy_version 51492 (0.0008) [2023-10-08 01:58:03,202][52059] Updated weights for policy 1, policy_version 51502 (0.0009) [2023-10-08 01:58:03,402][52060] Updated weights for policy 0, policy_version 50850 (0.0007) [2023-10-08 01:58:03,574][52059] Updated weights for policy 1, policy_version 51512 (0.0008) [2023-10-08 01:58:03,768][52060] Updated weights for policy 0, policy_version 50860 (0.0007) [2023-10-08 01:58:04,126][52060] Updated weights for policy 0, policy_version 50870 (0.0008) [2023-10-08 01:58:04,500][52060] Updated weights for policy 0, policy_version 50880 (0.0007) [2023-10-08 01:58:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 104857600. Throughput: 0: 1703.7, 1: 1703.2. Samples: 26218102. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:58:06,211][50642] Avg episode reward: [(0, '20.000'), (1, '23.560')] [2023-10-08 01:58:07,518][52059] Updated weights for policy 1, policy_version 51522 (0.0010) [2023-10-08 01:58:07,879][52059] Updated weights for policy 1, policy_version 51532 (0.0011) [2023-10-08 01:58:08,251][52059] Updated weights for policy 1, policy_version 51542 (0.0008) [2023-10-08 01:58:08,500][52060] Updated weights for policy 0, policy_version 50890 (0.0009) [2023-10-08 01:58:08,604][52059] Updated weights for policy 1, policy_version 51552 (0.0009) [2023-10-08 01:58:08,875][52060] Updated weights for policy 0, policy_version 50900 (0.0008) [2023-10-08 01:58:09,239][52060] Updated weights for policy 0, policy_version 50910 (0.0010) [2023-10-08 01:58:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 104923136. Throughput: 0: 1686.3, 1: 1709.2. Samples: 26238650. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:58:11,211][50642] Avg episode reward: [(0, '22.920'), (1, '21.400')] [2023-10-08 01:58:12,544][52059] Updated weights for policy 1, policy_version 51562 (0.0007) [2023-10-08 01:58:12,914][52059] Updated weights for policy 1, policy_version 51572 (0.0008) [2023-10-08 01:58:13,248][52060] Updated weights for policy 0, policy_version 50920 (0.0008) [2023-10-08 01:58:13,272][52059] Updated weights for policy 1, policy_version 51582 (0.0007) [2023-10-08 01:58:13,615][52060] Updated weights for policy 0, policy_version 50930 (0.0009) [2023-10-08 01:58:13,968][52060] Updated weights for policy 0, policy_version 50940 (0.0010) [2023-10-08 01:58:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 104988672. Throughput: 0: 1715.0, 1: 1731.8. Samples: 26260006. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:58:16,211][50642] Avg episode reward: [(0, '19.680'), (1, '18.750')] [2023-10-08 01:58:17,280][52059] Updated weights for policy 1, policy_version 51592 (0.0009) [2023-10-08 01:58:17,662][52059] Updated weights for policy 1, policy_version 51602 (0.0011) [2023-10-08 01:58:18,026][52059] Updated weights for policy 1, policy_version 51612 (0.0009) [2023-10-08 01:58:18,031][52060] Updated weights for policy 0, policy_version 50950 (0.0008) [2023-10-08 01:58:18,398][52060] Updated weights for policy 0, policy_version 50960 (0.0007) [2023-10-08 01:58:18,764][52060] Updated weights for policy 0, policy_version 50970 (0.0007) [2023-10-08 01:58:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 105054208. Throughput: 0: 1699.5, 1: 1696.3. Samples: 26269378. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 01:58:21,211][50642] Avg episode reward: [(0, '20.950'), (1, '21.620')] [2023-10-08 01:58:21,925][52059] Updated weights for policy 1, policy_version 51622 (0.0009) [2023-10-08 01:58:22,288][52059] Updated weights for policy 1, policy_version 51632 (0.0010) [2023-10-08 01:58:22,657][52059] Updated weights for policy 1, policy_version 51642 (0.0009) [2023-10-08 01:58:22,780][52060] Updated weights for policy 0, policy_version 50980 (0.0009) [2023-10-08 01:58:23,157][52060] Updated weights for policy 0, policy_version 50990 (0.0009) [2023-10-08 01:58:23,536][52060] Updated weights for policy 0, policy_version 51000 (0.0009) [2023-10-08 01:58:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 105119744. Throughput: 0: 1699.9, 1: 1723.7. Samples: 26290410. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:26,211][50642] Avg episode reward: [(0, '22.690'), (1, '26.170')] [2023-10-08 01:58:26,212][51710] Saving new best policy, reward=26.170! [2023-10-08 01:58:26,542][52059] Updated weights for policy 1, policy_version 51652 (0.0008) [2023-10-08 01:58:26,908][52059] Updated weights for policy 1, policy_version 51662 (0.0007) [2023-10-08 01:58:27,280][52059] Updated weights for policy 1, policy_version 51672 (0.0007) [2023-10-08 01:58:27,462][52060] Updated weights for policy 0, policy_version 51010 (0.0009) [2023-10-08 01:58:27,818][52060] Updated weights for policy 0, policy_version 51020 (0.0008) [2023-10-08 01:58:28,184][52060] Updated weights for policy 0, policy_version 51030 (0.0009) [2023-10-08 01:58:28,550][52060] Updated weights for policy 0, policy_version 51040 (0.0008) [2023-10-08 01:58:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 105185280. Throughput: 0: 1723.4, 1: 1721.6. Samples: 26311390. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:31,211][50642] Avg episode reward: [(0, '20.960'), (1, '18.810')] [2023-10-08 01:58:31,286][52059] Updated weights for policy 1, policy_version 51682 (0.0007) [2023-10-08 01:58:31,658][52059] Updated weights for policy 1, policy_version 51692 (0.0012) [2023-10-08 01:58:32,019][52059] Updated weights for policy 1, policy_version 51702 (0.0010) [2023-10-08 01:58:32,384][52059] Updated weights for policy 1, policy_version 51712 (0.0007) [2023-10-08 01:58:32,541][52060] Updated weights for policy 0, policy_version 51050 (0.0010) [2023-10-08 01:58:32,905][52060] Updated weights for policy 0, policy_version 51060 (0.0010) [2023-10-08 01:58:33,275][52060] Updated weights for policy 0, policy_version 51070 (0.0011) [2023-10-08 01:58:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 105250816. Throughput: 0: 1692.3, 1: 1700.8. Samples: 26320668. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:36,211][50642] Avg episode reward: [(0, '18.960'), (1, '18.180')] [2023-10-08 01:58:36,272][52059] Updated weights for policy 1, policy_version 51722 (0.0008) [2023-10-08 01:58:36,631][52059] Updated weights for policy 1, policy_version 51732 (0.0008) [2023-10-08 01:58:37,007][52059] Updated weights for policy 1, policy_version 51742 (0.0008) [2023-10-08 01:58:37,233][52060] Updated weights for policy 0, policy_version 51080 (0.0008) [2023-10-08 01:58:37,596][52060] Updated weights for policy 0, policy_version 51090 (0.0007) [2023-10-08 01:58:37,971][52060] Updated weights for policy 0, policy_version 51100 (0.0008) [2023-10-08 01:58:41,025][52059] Updated weights for policy 1, policy_version 51752 (0.0010) [2023-10-08 01:58:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 105316352. Throughput: 0: 1719.0, 1: 1720.2. Samples: 26341962. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:41,211][50642] Avg episode reward: [(0, '20.300'), (1, '21.450')] [2023-10-08 01:58:41,378][52059] Updated weights for policy 1, policy_version 51762 (0.0008) [2023-10-08 01:58:41,744][52059] Updated weights for policy 1, policy_version 51772 (0.0009) [2023-10-08 01:58:41,965][52060] Updated weights for policy 0, policy_version 51110 (0.0008) [2023-10-08 01:58:42,344][52060] Updated weights for policy 0, policy_version 51120 (0.0009) [2023-10-08 01:58:42,702][52060] Updated weights for policy 0, policy_version 51130 (0.0009) [2023-10-08 01:58:45,747][52059] Updated weights for policy 1, policy_version 51782 (0.0009) [2023-10-08 01:58:46,114][52059] Updated weights for policy 1, policy_version 51792 (0.0007) [2023-10-08 01:58:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 105381888. Throughput: 0: 1724.8, 1: 1715.7. Samples: 26362798. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:46,211][50642] Avg episode reward: [(0, '22.040'), (1, '23.380')] [2023-10-08 01:58:46,217][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000051136_52363264.pth... [2023-10-08 01:58:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000049536_50724864.pth [2023-10-08 01:58:46,478][52059] Updated weights for policy 1, policy_version 51802 (0.0008) [2023-10-08 01:58:46,618][52060] Updated weights for policy 0, policy_version 51140 (0.0008) [2023-10-08 01:58:46,685][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000051808_53051392.pth... [2023-10-08 01:58:46,713][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000050176_51380224.pth [2023-10-08 01:58:46,986][52060] Updated weights for policy 0, policy_version 51150 (0.0010) [2023-10-08 01:58:47,357][52060] Updated weights for policy 0, policy_version 51160 (0.0010) [2023-10-08 01:58:50,589][52059] Updated weights for policy 1, policy_version 51812 (0.0007) [2023-10-08 01:58:50,947][52059] Updated weights for policy 1, policy_version 51822 (0.0008) [2023-10-08 01:58:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 105447424. Throughput: 0: 1704.7, 1: 1719.8. Samples: 26372206. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:51,211][50642] Avg episode reward: [(0, '18.730'), (1, '19.450')] [2023-10-08 01:58:51,292][52060] Updated weights for policy 0, policy_version 51170 (0.0011) [2023-10-08 01:58:51,316][52059] Updated weights for policy 1, policy_version 51832 (0.0009) [2023-10-08 01:58:51,662][52060] Updated weights for policy 0, policy_version 51180 (0.0008) [2023-10-08 01:58:52,030][52060] Updated weights for policy 0, policy_version 51190 (0.0009) [2023-10-08 01:58:52,398][52060] Updated weights for policy 0, policy_version 51200 (0.0007) [2023-10-08 01:58:55,249][52059] Updated weights for policy 1, policy_version 51842 (0.0009) [2023-10-08 01:58:55,608][52059] Updated weights for policy 1, policy_version 51852 (0.0008) [2023-10-08 01:58:55,972][52059] Updated weights for policy 1, policy_version 51862 (0.0009) [2023-10-08 01:58:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 105512960. Throughput: 0: 1720.8, 1: 1722.6. Samples: 26393602. Policy #0 lag: (min: 13.0, avg: 21.0, max: 45.0) [2023-10-08 01:58:56,211][50642] Avg episode reward: [(0, '20.060'), (1, '20.080')] [2023-10-08 01:58:56,342][52059] Updated weights for policy 1, policy_version 51872 (0.0007) [2023-10-08 01:58:56,395][52060] Updated weights for policy 0, policy_version 51210 (0.0009) [2023-10-08 01:58:56,764][52060] Updated weights for policy 0, policy_version 51220 (0.0009) [2023-10-08 01:58:57,133][52060] Updated weights for policy 0, policy_version 51230 (0.0009) [2023-10-08 01:59:00,254][52059] Updated weights for policy 1, policy_version 51882 (0.0008) [2023-10-08 01:59:00,622][52059] Updated weights for policy 1, policy_version 51892 (0.0007) [2023-10-08 01:59:00,990][52059] Updated weights for policy 1, policy_version 51902 (0.0008) [2023-10-08 01:59:01,028][52060] Updated weights for policy 0, policy_version 51240 (0.0008) [2023-10-08 01:59:01,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 105611264. Throughput: 0: 1720.8, 1: 1702.6. Samples: 26414056. Policy #0 lag: (min: 21.0, avg: 28.2, max: 53.0) [2023-10-08 01:59:01,211][50642] Avg episode reward: [(0, '22.410'), (1, '24.650')] [2023-10-08 01:59:01,389][52060] Updated weights for policy 0, policy_version 51250 (0.0010) [2023-10-08 01:59:01,764][52060] Updated weights for policy 0, policy_version 51260 (0.0010) [2023-10-08 01:59:04,837][52059] Updated weights for policy 1, policy_version 51912 (0.0009) [2023-10-08 01:59:05,208][52059] Updated weights for policy 1, policy_version 51922 (0.0009) [2023-10-08 01:59:05,570][52059] Updated weights for policy 1, policy_version 51932 (0.0007) [2023-10-08 01:59:05,765][52060] Updated weights for policy 0, policy_version 51270 (0.0009) [2023-10-08 01:59:06,145][52060] Updated weights for policy 0, policy_version 51280 (0.0011) [2023-10-08 01:59:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 105676800. Throughput: 0: 1719.2, 1: 1737.2. Samples: 26424914. Policy #0 lag: (min: 21.0, avg: 28.2, max: 53.0) [2023-10-08 01:59:06,211][50642] Avg episode reward: [(0, '21.940'), (1, '21.180')] [2023-10-08 01:59:06,526][52060] Updated weights for policy 0, policy_version 51290 (0.0007) [2023-10-08 01:59:09,398][52059] Updated weights for policy 1, policy_version 51942 (0.0009) [2023-10-08 01:59:09,758][52059] Updated weights for policy 1, policy_version 51952 (0.0009) [2023-10-08 01:59:10,129][52059] Updated weights for policy 1, policy_version 51962 (0.0007) [2023-10-08 01:59:10,451][52060] Updated weights for policy 0, policy_version 51300 (0.0008) [2023-10-08 01:59:10,817][52060] Updated weights for policy 0, policy_version 51310 (0.0009) [2023-10-08 01:59:11,179][52060] Updated weights for policy 0, policy_version 51320 (0.0008) [2023-10-08 01:59:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 105742336. Throughput: 0: 1728.5, 1: 1721.1. Samples: 26445642. Policy #0 lag: (min: 21.0, avg: 28.2, max: 53.0) [2023-10-08 01:59:11,211][50642] Avg episode reward: [(0, '20.460'), (1, '19.440')] [2023-10-08 01:59:14,032][52059] Updated weights for policy 1, policy_version 51972 (0.0009) [2023-10-08 01:59:14,409][52059] Updated weights for policy 1, policy_version 51982 (0.0007) [2023-10-08 01:59:14,780][52059] Updated weights for policy 1, policy_version 51992 (0.0009) [2023-10-08 01:59:15,210][52060] Updated weights for policy 0, policy_version 51330 (0.0010) [2023-10-08 01:59:15,578][52060] Updated weights for policy 0, policy_version 51340 (0.0009) [2023-10-08 01:59:15,944][52060] Updated weights for policy 0, policy_version 51350 (0.0009) [2023-10-08 01:59:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 105807872. Throughput: 0: 1714.6, 1: 1715.6. Samples: 26465748. Policy #0 lag: (min: 21.0, avg: 28.2, max: 53.0) [2023-10-08 01:59:16,211][50642] Avg episode reward: [(0, '20.090'), (1, '21.690')] [2023-10-08 01:59:16,312][52060] Updated weights for policy 0, policy_version 51360 (0.0007) [2023-10-08 01:59:18,810][52059] Updated weights for policy 1, policy_version 52002 (0.0007) [2023-10-08 01:59:19,178][52059] Updated weights for policy 1, policy_version 52012 (0.0009) [2023-10-08 01:59:19,553][52059] Updated weights for policy 1, policy_version 52022 (0.0009) [2023-10-08 01:59:19,914][52059] Updated weights for policy 1, policy_version 52032 (0.0007) [2023-10-08 01:59:20,189][52060] Updated weights for policy 0, policy_version 51370 (0.0008) [2023-10-08 01:59:20,552][52060] Updated weights for policy 0, policy_version 51380 (0.0007) [2023-10-08 01:59:20,917][52060] Updated weights for policy 0, policy_version 51390 (0.0008) [2023-10-08 01:59:21,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 105906176. Throughput: 0: 1733.9, 1: 1745.9. Samples: 26477258. Policy #0 lag: (min: 21.0, avg: 28.2, max: 53.0) [2023-10-08 01:59:21,211][50642] Avg episode reward: [(0, '18.670'), (1, '25.790')] [2023-10-08 01:59:23,800][52059] Updated weights for policy 1, policy_version 52042 (0.0007) [2023-10-08 01:59:24,159][52059] Updated weights for policy 1, policy_version 52052 (0.0009) [2023-10-08 01:59:24,530][52059] Updated weights for policy 1, policy_version 52062 (0.0007) [2023-10-08 01:59:24,905][52060] Updated weights for policy 0, policy_version 51400 (0.0008) [2023-10-08 01:59:25,282][52060] Updated weights for policy 0, policy_version 51410 (0.0009) [2023-10-08 01:59:25,647][52060] Updated weights for policy 0, policy_version 51420 (0.0011) [2023-10-08 01:59:26,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 105971712. Throughput: 0: 1727.2, 1: 1722.1. Samples: 26497182. Policy #0 lag: (min: 21.0, avg: 28.2, max: 53.0) [2023-10-08 01:59:26,211][50642] Avg episode reward: [(0, '16.610'), (1, '18.900')] [2023-10-08 01:59:28,426][52059] Updated weights for policy 1, policy_version 52072 (0.0009) [2023-10-08 01:59:28,783][52059] Updated weights for policy 1, policy_version 52082 (0.0007) [2023-10-08 01:59:29,147][52059] Updated weights for policy 1, policy_version 52092 (0.0008) [2023-10-08 01:59:29,571][52060] Updated weights for policy 0, policy_version 51430 (0.0009) [2023-10-08 01:59:29,946][52060] Updated weights for policy 0, policy_version 51440 (0.0008) [2023-10-08 01:59:30,314][52060] Updated weights for policy 0, policy_version 51450 (0.0008) [2023-10-08 01:59:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 106037248. Throughput: 0: 1703.1, 1: 1735.0. Samples: 26517512. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 01:59:31,211][50642] Avg episode reward: [(0, '16.350'), (1, '19.610')] [2023-10-08 01:59:33,024][52059] Updated weights for policy 1, policy_version 52102 (0.0009) [2023-10-08 01:59:33,388][52059] Updated weights for policy 1, policy_version 52112 (0.0008) [2023-10-08 01:59:33,758][52059] Updated weights for policy 1, policy_version 52122 (0.0007) [2023-10-08 01:59:34,339][52060] Updated weights for policy 0, policy_version 51460 (0.0009) [2023-10-08 01:59:34,699][52060] Updated weights for policy 0, policy_version 51470 (0.0010) [2023-10-08 01:59:35,074][52060] Updated weights for policy 0, policy_version 51480 (0.0008) [2023-10-08 01:59:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 106102784. Throughput: 0: 1736.9, 1: 1734.4. Samples: 26528416. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 01:59:36,211][50642] Avg episode reward: [(0, '14.760'), (1, '25.370')] [2023-10-08 01:59:37,496][52059] Updated weights for policy 1, policy_version 52132 (0.0008) [2023-10-08 01:59:37,863][52059] Updated weights for policy 1, policy_version 52142 (0.0007) [2023-10-08 01:59:38,219][52059] Updated weights for policy 1, policy_version 52152 (0.0009) [2023-10-08 01:59:38,956][52060] Updated weights for policy 0, policy_version 51490 (0.0008) [2023-10-08 01:59:39,337][52060] Updated weights for policy 0, policy_version 51500 (0.0008) [2023-10-08 01:59:39,710][52060] Updated weights for policy 0, policy_version 51510 (0.0007) [2023-10-08 01:59:40,078][52060] Updated weights for policy 0, policy_version 51520 (0.0010) [2023-10-08 01:59:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 106168320. Throughput: 0: 1714.6, 1: 1733.2. Samples: 26548752. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 01:59:41,211][50642] Avg episode reward: [(0, '16.120'), (1, '23.030')] [2023-10-08 01:59:42,280][52059] Updated weights for policy 1, policy_version 52162 (0.0010) [2023-10-08 01:59:42,649][52059] Updated weights for policy 1, policy_version 52172 (0.0008) [2023-10-08 01:59:43,003][52059] Updated weights for policy 1, policy_version 52182 (0.0010) [2023-10-08 01:59:43,365][52059] Updated weights for policy 1, policy_version 52192 (0.0009) [2023-10-08 01:59:44,032][52060] Updated weights for policy 0, policy_version 51530 (0.0011) [2023-10-08 01:59:44,405][52060] Updated weights for policy 0, policy_version 51540 (0.0010) [2023-10-08 01:59:44,769][52060] Updated weights for policy 0, policy_version 51550 (0.0007) [2023-10-08 01:59:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 106233856. Throughput: 0: 1707.8, 1: 1749.7. Samples: 26569644. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 01:59:46,211][50642] Avg episode reward: [(0, '15.720'), (1, '20.130')] [2023-10-08 01:59:47,336][52059] Updated weights for policy 1, policy_version 52202 (0.0008) [2023-10-08 01:59:47,704][52059] Updated weights for policy 1, policy_version 52212 (0.0009) [2023-10-08 01:59:48,064][52059] Updated weights for policy 1, policy_version 52222 (0.0008) [2023-10-08 01:59:48,639][52060] Updated weights for policy 0, policy_version 51560 (0.0007) [2023-10-08 01:59:49,015][52060] Updated weights for policy 0, policy_version 51570 (0.0010) [2023-10-08 01:59:49,382][52060] Updated weights for policy 0, policy_version 51580 (0.0010) [2023-10-08 01:59:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 106299392. Throughput: 0: 1721.4, 1: 1724.0. Samples: 26579956. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 01:59:51,211][50642] Avg episode reward: [(0, '15.890'), (1, '21.150')] [2023-10-08 01:59:51,994][52059] Updated weights for policy 1, policy_version 52232 (0.0009) [2023-10-08 01:59:52,370][52059] Updated weights for policy 1, policy_version 52242 (0.0009) [2023-10-08 01:59:52,734][52059] Updated weights for policy 1, policy_version 52252 (0.0007) [2023-10-08 01:59:53,308][52060] Updated weights for policy 0, policy_version 51590 (0.0008) [2023-10-08 01:59:53,683][52060] Updated weights for policy 0, policy_version 51600 (0.0008) [2023-10-08 01:59:54,048][52060] Updated weights for policy 0, policy_version 51610 (0.0009) [2023-10-08 01:59:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 106364928. Throughput: 0: 1702.5, 1: 1740.3. Samples: 26600568. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 01:59:56,211][50642] Avg episode reward: [(0, '17.430'), (1, '24.590')] [2023-10-08 01:59:56,604][52059] Updated weights for policy 1, policy_version 52262 (0.0008) [2023-10-08 01:59:56,971][52059] Updated weights for policy 1, policy_version 52272 (0.0007) [2023-10-08 01:59:57,339][52059] Updated weights for policy 1, policy_version 52282 (0.0007) [2023-10-08 01:59:58,153][52060] Updated weights for policy 0, policy_version 51620 (0.0009) [2023-10-08 01:59:58,565][52060] Updated weights for policy 0, policy_version 51630 (0.0010) [2023-10-08 01:59:58,936][52060] Updated weights for policy 0, policy_version 51640 (0.0009) [2023-10-08 02:00:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 106430464. Throughput: 0: 1710.8, 1: 1757.5. Samples: 26621822. Policy #0 lag: (min: 21.0, avg: 22.8, max: 50.0) [2023-10-08 02:00:01,211][50642] Avg episode reward: [(0, '16.530'), (1, '17.930')] [2023-10-08 02:00:01,303][52059] Updated weights for policy 1, policy_version 52292 (0.0009) [2023-10-08 02:00:01,662][52059] Updated weights for policy 1, policy_version 52302 (0.0008) [2023-10-08 02:00:02,027][52059] Updated weights for policy 1, policy_version 52312 (0.0008) [2023-10-08 02:00:02,922][52060] Updated weights for policy 0, policy_version 51650 (0.0008) [2023-10-08 02:00:03,301][52060] Updated weights for policy 0, policy_version 51660 (0.0008) [2023-10-08 02:00:03,674][52060] Updated weights for policy 0, policy_version 51670 (0.0009) [2023-10-08 02:00:04,040][52060] Updated weights for policy 0, policy_version 51680 (0.0010) [2023-10-08 02:00:05,895][52059] Updated weights for policy 1, policy_version 52322 (0.0008) [2023-10-08 02:00:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 106496000. Throughput: 0: 1700.2, 1: 1730.5. Samples: 26631642. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:06,211][50642] Avg episode reward: [(0, '17.410'), (1, '18.420')] [2023-10-08 02:00:06,248][52059] Updated weights for policy 1, policy_version 52332 (0.0007) [2023-10-08 02:00:06,619][52059] Updated weights for policy 1, policy_version 52342 (0.0007) [2023-10-08 02:00:06,981][52059] Updated weights for policy 1, policy_version 52352 (0.0007) [2023-10-08 02:00:07,904][52060] Updated weights for policy 0, policy_version 51690 (0.0008) [2023-10-08 02:00:08,264][52060] Updated weights for policy 0, policy_version 51700 (0.0007) [2023-10-08 02:00:08,646][52060] Updated weights for policy 0, policy_version 51710 (0.0008) [2023-10-08 02:00:10,608][52059] Updated weights for policy 1, policy_version 52362 (0.0008) [2023-10-08 02:00:10,974][52059] Updated weights for policy 1, policy_version 52372 (0.0008) [2023-10-08 02:00:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 106561536. Throughput: 0: 1698.1, 1: 1765.2. Samples: 26653028. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:11,211][50642] Avg episode reward: [(0, '21.160'), (1, '22.240')] [2023-10-08 02:00:11,346][52059] Updated weights for policy 1, policy_version 52382 (0.0010) [2023-10-08 02:00:12,631][52060] Updated weights for policy 0, policy_version 51720 (0.0011) [2023-10-08 02:00:13,011][52060] Updated weights for policy 0, policy_version 51730 (0.0009) [2023-10-08 02:00:13,370][52060] Updated weights for policy 0, policy_version 51740 (0.0007) [2023-10-08 02:00:15,245][52059] Updated weights for policy 1, policy_version 52392 (0.0009) [2023-10-08 02:00:15,598][52059] Updated weights for policy 1, policy_version 52402 (0.0009) [2023-10-08 02:00:15,963][52059] Updated weights for policy 1, policy_version 52412 (0.0010) [2023-10-08 02:00:16,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 106659840. Throughput: 0: 1723.0, 1: 1741.2. Samples: 26673402. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:16,211][50642] Avg episode reward: [(0, '20.520'), (1, '26.120')] [2023-10-08 02:00:17,340][52060] Updated weights for policy 0, policy_version 51750 (0.0009) [2023-10-08 02:00:17,717][52060] Updated weights for policy 0, policy_version 51760 (0.0008) [2023-10-08 02:00:18,084][52060] Updated weights for policy 0, policy_version 51770 (0.0009) [2023-10-08 02:00:19,895][52059] Updated weights for policy 1, policy_version 52422 (0.0009) [2023-10-08 02:00:20,264][52059] Updated weights for policy 1, policy_version 52432 (0.0007) [2023-10-08 02:00:20,618][52059] Updated weights for policy 1, policy_version 52442 (0.0008) [2023-10-08 02:00:21,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 106725376. Throughput: 0: 1691.3, 1: 1762.3. Samples: 26683828. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:21,211][50642] Avg episode reward: [(0, '18.240'), (1, '20.880')] [2023-10-08 02:00:21,993][52060] Updated weights for policy 0, policy_version 51780 (0.0008) [2023-10-08 02:00:22,362][52060] Updated weights for policy 0, policy_version 51790 (0.0007) [2023-10-08 02:00:22,731][52060] Updated weights for policy 0, policy_version 51800 (0.0009) [2023-10-08 02:00:24,536][52059] Updated weights for policy 1, policy_version 52452 (0.0008) [2023-10-08 02:00:24,903][52059] Updated weights for policy 1, policy_version 52462 (0.0008) [2023-10-08 02:00:25,261][52059] Updated weights for policy 1, policy_version 52472 (0.0007) [2023-10-08 02:00:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 106790912. Throughput: 0: 1715.5, 1: 1756.0. Samples: 26704966. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:26,211][50642] Avg episode reward: [(0, '20.730'), (1, '20.440')] [2023-10-08 02:00:26,750][52060] Updated weights for policy 0, policy_version 51810 (0.0009) [2023-10-08 02:00:27,115][52060] Updated weights for policy 0, policy_version 51820 (0.0008) [2023-10-08 02:00:27,481][52060] Updated weights for policy 0, policy_version 51830 (0.0009) [2023-10-08 02:00:27,848][52060] Updated weights for policy 0, policy_version 51840 (0.0011) [2023-10-08 02:00:29,197][52059] Updated weights for policy 1, policy_version 52482 (0.0009) [2023-10-08 02:00:29,570][52059] Updated weights for policy 1, policy_version 52492 (0.0008) [2023-10-08 02:00:29,942][52059] Updated weights for policy 1, policy_version 52502 (0.0007) [2023-10-08 02:00:30,306][52059] Updated weights for policy 1, policy_version 52512 (0.0007) [2023-10-08 02:00:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 106856448. Throughput: 0: 1718.5, 1: 1741.0. Samples: 26725324. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:31,211][50642] Avg episode reward: [(0, '21.200'), (1, '25.150')] [2023-10-08 02:00:31,941][52060] Updated weights for policy 0, policy_version 51850 (0.0010) [2023-10-08 02:00:32,303][52060] Updated weights for policy 0, policy_version 51860 (0.0010) [2023-10-08 02:00:32,683][52060] Updated weights for policy 0, policy_version 51870 (0.0008) [2023-10-08 02:00:34,179][52059] Updated weights for policy 1, policy_version 52522 (0.0009) [2023-10-08 02:00:34,546][52059] Updated weights for policy 1, policy_version 52532 (0.0009) [2023-10-08 02:00:34,912][52059] Updated weights for policy 1, policy_version 52542 (0.0008) [2023-10-08 02:00:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 106921984. Throughput: 0: 1700.3, 1: 1768.3. Samples: 26736046. Policy #0 lag: (min: 31.0, avg: 36.0, max: 63.0) [2023-10-08 02:00:36,211][50642] Avg episode reward: [(0, '19.170'), (1, '22.440')] [2023-10-08 02:00:36,659][52060] Updated weights for policy 0, policy_version 51880 (0.0007) [2023-10-08 02:00:37,020][52060] Updated weights for policy 0, policy_version 51890 (0.0007) [2023-10-08 02:00:37,391][52060] Updated weights for policy 0, policy_version 51900 (0.0008) [2023-10-08 02:00:38,723][52059] Updated weights for policy 1, policy_version 52552 (0.0009) [2023-10-08 02:00:39,084][52059] Updated weights for policy 1, policy_version 52562 (0.0007) [2023-10-08 02:00:39,447][52059] Updated weights for policy 1, policy_version 52572 (0.0007) [2023-10-08 02:00:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 106987520. Throughput: 0: 1716.4, 1: 1742.4. Samples: 26756216. Policy #0 lag: (min: 6.0, avg: 6.0, max: 9.0) [2023-10-08 02:00:41,211][50642] Avg episode reward: [(0, '19.370'), (1, '20.770')] [2023-10-08 02:00:41,314][52060] Updated weights for policy 0, policy_version 51910 (0.0010) [2023-10-08 02:00:41,671][52060] Updated weights for policy 0, policy_version 51920 (0.0007) [2023-10-08 02:00:42,047][52060] Updated weights for policy 0, policy_version 51930 (0.0008) [2023-10-08 02:00:43,516][52059] Updated weights for policy 1, policy_version 52582 (0.0008) [2023-10-08 02:00:43,878][52059] Updated weights for policy 1, policy_version 52592 (0.0010) [2023-10-08 02:00:44,251][52059] Updated weights for policy 1, policy_version 52602 (0.0008) [2023-10-08 02:00:46,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 107053056. Throughput: 0: 1714.7, 1: 1725.4. Samples: 26776626. Policy #0 lag: (min: 6.0, avg: 6.0, max: 9.0) [2023-10-08 02:00:46,211][50642] Avg episode reward: [(0, '20.560'), (1, '22.660')] [2023-10-08 02:00:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000052608_53870592.pth... [2023-10-08 02:00:46,261][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000050976_52199424.pth [2023-10-08 02:00:46,300][52060] Updated weights for policy 0, policy_version 51940 (0.0010) [2023-10-08 02:00:46,693][52060] Updated weights for policy 0, policy_version 51950 (0.0010) [2023-10-08 02:00:47,067][52060] Updated weights for policy 0, policy_version 51960 (0.0010) [2023-10-08 02:00:47,358][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000051968_53215232.pth... [2023-10-08 02:00:47,386][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000050336_51544064.pth [2023-10-08 02:00:48,463][52059] Updated weights for policy 1, policy_version 52612 (0.0009) [2023-10-08 02:00:48,829][52059] Updated weights for policy 1, policy_version 52622 (0.0009) [2023-10-08 02:00:49,192][52059] Updated weights for policy 1, policy_version 52632 (0.0007) [2023-10-08 02:00:51,049][52060] Updated weights for policy 0, policy_version 51970 (0.0010) [2023-10-08 02:00:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 107118592. Throughput: 0: 1701.8, 1: 1736.8. Samples: 26786380. Policy #0 lag: (min: 6.0, avg: 6.0, max: 9.0) [2023-10-08 02:00:51,211][50642] Avg episode reward: [(0, '20.180'), (1, '24.020')] [2023-10-08 02:00:51,411][52060] Updated weights for policy 0, policy_version 51980 (0.0007) [2023-10-08 02:00:51,777][52060] Updated weights for policy 0, policy_version 51990 (0.0007) [2023-10-08 02:00:52,145][52060] Updated weights for policy 0, policy_version 52000 (0.0010) [2023-10-08 02:00:53,026][52059] Updated weights for policy 1, policy_version 52642 (0.0008) [2023-10-08 02:00:53,397][52059] Updated weights for policy 1, policy_version 52652 (0.0007) [2023-10-08 02:00:53,754][52059] Updated weights for policy 1, policy_version 52662 (0.0008) [2023-10-08 02:00:54,112][52059] Updated weights for policy 1, policy_version 52672 (0.0007) [2023-10-08 02:00:56,027][52060] Updated weights for policy 0, policy_version 52010 (0.0007) [2023-10-08 02:00:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 107184128. Throughput: 0: 1711.6, 1: 1714.3. Samples: 26807194. Policy #0 lag: (min: 6.0, avg: 6.0, max: 9.0) [2023-10-08 02:00:56,211][50642] Avg episode reward: [(0, '18.600'), (1, '22.260')] [2023-10-08 02:00:56,401][52060] Updated weights for policy 0, policy_version 52020 (0.0007) [2023-10-08 02:00:56,772][52060] Updated weights for policy 0, policy_version 52030 (0.0007) [2023-10-08 02:00:58,011][52059] Updated weights for policy 1, policy_version 52682 (0.0008) [2023-10-08 02:00:58,379][52059] Updated weights for policy 1, policy_version 52692 (0.0008) [2023-10-08 02:00:58,745][52059] Updated weights for policy 1, policy_version 52702 (0.0008) [2023-10-08 02:01:00,571][52060] Updated weights for policy 0, policy_version 52040 (0.0010) [2023-10-08 02:01:00,946][52060] Updated weights for policy 0, policy_version 52050 (0.0011) [2023-10-08 02:01:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 107249664. Throughput: 0: 1704.6, 1: 1735.9. Samples: 26828224. Policy #0 lag: (min: 6.0, avg: 6.0, max: 9.0) [2023-10-08 02:01:01,211][50642] Avg episode reward: [(0, '17.970'), (1, '19.790')] [2023-10-08 02:01:01,309][52060] Updated weights for policy 0, policy_version 52060 (0.0010) [2023-10-08 02:01:02,573][52059] Updated weights for policy 1, policy_version 52712 (0.0009) [2023-10-08 02:01:02,941][52059] Updated weights for policy 1, policy_version 52722 (0.0007) [2023-10-08 02:01:03,297][52059] Updated weights for policy 1, policy_version 52732 (0.0008) [2023-10-08 02:01:05,186][52060] Updated weights for policy 0, policy_version 52070 (0.0007) [2023-10-08 02:01:05,560][52060] Updated weights for policy 0, policy_version 52080 (0.0010) [2023-10-08 02:01:05,926][52060] Updated weights for policy 0, policy_version 52090 (0.0007) [2023-10-08 02:01:06,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 107347968. Throughput: 0: 1723.1, 1: 1708.4. Samples: 26838242. Policy #0 lag: (min: 6.0, avg: 6.0, max: 9.0) [2023-10-08 02:01:06,212][50642] Avg episode reward: [(0, '19.170'), (1, '23.230')] [2023-10-08 02:01:07,226][52059] Updated weights for policy 1, policy_version 52742 (0.0007) [2023-10-08 02:01:07,591][52059] Updated weights for policy 1, policy_version 52752 (0.0007) [2023-10-08 02:01:07,959][52059] Updated weights for policy 1, policy_version 52762 (0.0009) [2023-10-08 02:01:10,062][52060] Updated weights for policy 0, policy_version 52100 (0.0007) [2023-10-08 02:01:10,427][52060] Updated weights for policy 0, policy_version 52110 (0.0007) [2023-10-08 02:01:10,787][52060] Updated weights for policy 0, policy_version 52120 (0.0008) [2023-10-08 02:01:11,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 107413504. Throughput: 0: 1720.7, 1: 1720.8. Samples: 26859836. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:11,211][50642] Avg episode reward: [(0, '19.150'), (1, '21.890')] [2023-10-08 02:01:11,702][52059] Updated weights for policy 1, policy_version 52772 (0.0009) [2023-10-08 02:01:12,076][52059] Updated weights for policy 1, policy_version 52782 (0.0010) [2023-10-08 02:01:12,445][52059] Updated weights for policy 1, policy_version 52792 (0.0009) [2023-10-08 02:01:14,652][52060] Updated weights for policy 0, policy_version 52130 (0.0010) [2023-10-08 02:01:15,012][52060] Updated weights for policy 0, policy_version 52140 (0.0010) [2023-10-08 02:01:15,380][52060] Updated weights for policy 0, policy_version 52150 (0.0011) [2023-10-08 02:01:15,746][52060] Updated weights for policy 0, policy_version 52160 (0.0010) [2023-10-08 02:01:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 107479040. Throughput: 0: 1695.8, 1: 1747.1. Samples: 26880256. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:16,211][50642] Avg episode reward: [(0, '18.990'), (1, '22.850')] [2023-10-08 02:01:16,536][52059] Updated weights for policy 1, policy_version 52802 (0.0008) [2023-10-08 02:01:16,905][52059] Updated weights for policy 1, policy_version 52812 (0.0008) [2023-10-08 02:01:17,269][52059] Updated weights for policy 1, policy_version 52822 (0.0008) [2023-10-08 02:01:17,623][52059] Updated weights for policy 1, policy_version 52832 (0.0008) [2023-10-08 02:01:19,765][52060] Updated weights for policy 0, policy_version 52170 (0.0009) [2023-10-08 02:01:20,140][52060] Updated weights for policy 0, policy_version 52180 (0.0009) [2023-10-08 02:01:20,515][52060] Updated weights for policy 0, policy_version 52190 (0.0011) [2023-10-08 02:01:21,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 107544576. Throughput: 0: 1726.1, 1: 1716.2. Samples: 26890952. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:21,211][50642] Avg episode reward: [(0, '19.760'), (1, '23.100')] [2023-10-08 02:01:21,648][52059] Updated weights for policy 1, policy_version 52842 (0.0007) [2023-10-08 02:01:22,023][52059] Updated weights for policy 1, policy_version 52852 (0.0008) [2023-10-08 02:01:22,386][52059] Updated weights for policy 1, policy_version 52862 (0.0007) [2023-10-08 02:01:24,513][52060] Updated weights for policy 0, policy_version 52200 (0.0009) [2023-10-08 02:01:24,887][52060] Updated weights for policy 0, policy_version 52210 (0.0007) [2023-10-08 02:01:25,250][52060] Updated weights for policy 0, policy_version 52220 (0.0009) [2023-10-08 02:01:26,141][52059] Updated weights for policy 1, policy_version 52872 (0.0007) [2023-10-08 02:01:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 107610112. Throughput: 0: 1708.6, 1: 1745.2. Samples: 26911638. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:26,211][50642] Avg episode reward: [(0, '20.340'), (1, '23.120')] [2023-10-08 02:01:26,503][52059] Updated weights for policy 1, policy_version 52882 (0.0007) [2023-10-08 02:01:26,875][52059] Updated weights for policy 1, policy_version 52892 (0.0007) [2023-10-08 02:01:29,269][52060] Updated weights for policy 0, policy_version 52230 (0.0008) [2023-10-08 02:01:29,631][52060] Updated weights for policy 0, policy_version 52240 (0.0008) [2023-10-08 02:01:30,003][52060] Updated weights for policy 0, policy_version 52250 (0.0008) [2023-10-08 02:01:30,770][52059] Updated weights for policy 1, policy_version 52902 (0.0010) [2023-10-08 02:01:31,140][52059] Updated weights for policy 1, policy_version 52912 (0.0010) [2023-10-08 02:01:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 107675648. Throughput: 0: 1705.7, 1: 1750.0. Samples: 26932130. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:31,211][50642] Avg episode reward: [(0, '19.960'), (1, '23.820')] [2023-10-08 02:01:31,502][52059] Updated weights for policy 1, policy_version 52922 (0.0010) [2023-10-08 02:01:33,717][52060] Updated weights for policy 0, policy_version 52260 (0.0008) [2023-10-08 02:01:34,090][52060] Updated weights for policy 0, policy_version 52270 (0.0008) [2023-10-08 02:01:34,464][52060] Updated weights for policy 0, policy_version 52280 (0.0007) [2023-10-08 02:01:35,519][52059] Updated weights for policy 1, policy_version 52932 (0.0010) [2023-10-08 02:01:35,886][52059] Updated weights for policy 1, policy_version 52942 (0.0008) [2023-10-08 02:01:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 107741184. Throughput: 0: 1733.2, 1: 1747.1. Samples: 26942994. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:36,211][50642] Avg episode reward: [(0, '19.870'), (1, '23.050')] [2023-10-08 02:01:36,248][52059] Updated weights for policy 1, policy_version 52952 (0.0009) [2023-10-08 02:01:38,459][52060] Updated weights for policy 0, policy_version 52290 (0.0008) [2023-10-08 02:01:38,827][52060] Updated weights for policy 0, policy_version 52300 (0.0007) [2023-10-08 02:01:39,190][52060] Updated weights for policy 0, policy_version 52310 (0.0007) [2023-10-08 02:01:39,550][52060] Updated weights for policy 0, policy_version 52320 (0.0007) [2023-10-08 02:01:40,104][52059] Updated weights for policy 1, policy_version 52962 (0.0011) [2023-10-08 02:01:40,473][52059] Updated weights for policy 1, policy_version 52972 (0.0009) [2023-10-08 02:01:40,836][52059] Updated weights for policy 1, policy_version 52982 (0.0010) [2023-10-08 02:01:41,204][52059] Updated weights for policy 1, policy_version 52992 (0.0011) [2023-10-08 02:01:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 107839488. Throughput: 0: 1710.5, 1: 1763.1. Samples: 26963508. Policy #0 lag: (min: 22.0, avg: 22.7, max: 39.0) [2023-10-08 02:01:41,211][50642] Avg episode reward: [(0, '20.730'), (1, '24.480')] [2023-10-08 02:01:43,569][52060] Updated weights for policy 0, policy_version 52330 (0.0008) [2023-10-08 02:01:43,942][52060] Updated weights for policy 0, policy_version 52340 (0.0008) [2023-10-08 02:01:44,305][52060] Updated weights for policy 0, policy_version 52350 (0.0007) [2023-10-08 02:01:45,147][52059] Updated weights for policy 1, policy_version 53002 (0.0010) [2023-10-08 02:01:45,514][52059] Updated weights for policy 1, policy_version 53012 (0.0011) [2023-10-08 02:01:45,887][52059] Updated weights for policy 1, policy_version 53022 (0.0009) [2023-10-08 02:01:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 107905024. Throughput: 0: 1717.3, 1: 1733.6. Samples: 26983516. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:01:46,211][50642] Avg episode reward: [(0, '19.670'), (1, '23.120')] [2023-10-08 02:01:48,112][52060] Updated weights for policy 0, policy_version 52360 (0.0007) [2023-10-08 02:01:48,479][52060] Updated weights for policy 0, policy_version 52370 (0.0008) [2023-10-08 02:01:48,848][52060] Updated weights for policy 0, policy_version 52380 (0.0007) [2023-10-08 02:01:49,871][52059] Updated weights for policy 1, policy_version 53032 (0.0010) [2023-10-08 02:01:50,242][52059] Updated weights for policy 1, policy_version 53042 (0.0011) [2023-10-08 02:01:50,606][52059] Updated weights for policy 1, policy_version 53052 (0.0008) [2023-10-08 02:01:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 107970560. Throughput: 0: 1708.2, 1: 1759.6. Samples: 26994292. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:01:51,211][50642] Avg episode reward: [(0, '20.590'), (1, '23.000')] [2023-10-08 02:01:52,807][52060] Updated weights for policy 0, policy_version 52390 (0.0008) [2023-10-08 02:01:53,164][52060] Updated weights for policy 0, policy_version 52400 (0.0008) [2023-10-08 02:01:53,540][52060] Updated weights for policy 0, policy_version 52410 (0.0007) [2023-10-08 02:01:54,555][52059] Updated weights for policy 1, policy_version 53062 (0.0009) [2023-10-08 02:01:54,912][52059] Updated weights for policy 1, policy_version 53072 (0.0008) [2023-10-08 02:01:55,272][52059] Updated weights for policy 1, policy_version 53082 (0.0010) [2023-10-08 02:01:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 108036096. Throughput: 0: 1703.0, 1: 1742.2. Samples: 27014870. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:01:56,211][50642] Avg episode reward: [(0, '20.480'), (1, '25.180')] [2023-10-08 02:01:57,624][52060] Updated weights for policy 0, policy_version 52420 (0.0009) [2023-10-08 02:01:57,999][52060] Updated weights for policy 0, policy_version 52430 (0.0008) [2023-10-08 02:01:58,370][52060] Updated weights for policy 0, policy_version 52440 (0.0009) [2023-10-08 02:01:59,181][52059] Updated weights for policy 1, policy_version 53092 (0.0008) [2023-10-08 02:01:59,543][52059] Updated weights for policy 1, policy_version 53102 (0.0007) [2023-10-08 02:01:59,912][52059] Updated weights for policy 1, policy_version 53112 (0.0007) [2023-10-08 02:02:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 108101632. Throughput: 0: 1732.6, 1: 1722.5. Samples: 27035738. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:02:01,211][50642] Avg episode reward: [(0, '19.860'), (1, '21.830')] [2023-10-08 02:02:02,412][52060] Updated weights for policy 0, policy_version 52450 (0.0009) [2023-10-08 02:02:02,794][52060] Updated weights for policy 0, policy_version 52460 (0.0007) [2023-10-08 02:02:03,166][52060] Updated weights for policy 0, policy_version 52470 (0.0007) [2023-10-08 02:02:03,525][52059] Updated weights for policy 1, policy_version 53122 (0.0007) [2023-10-08 02:02:03,533][52060] Updated weights for policy 0, policy_version 52480 (0.0007) [2023-10-08 02:02:03,892][52059] Updated weights for policy 1, policy_version 53132 (0.0007) [2023-10-08 02:02:04,256][52059] Updated weights for policy 1, policy_version 53142 (0.0008) [2023-10-08 02:02:04,615][52059] Updated weights for policy 1, policy_version 53152 (0.0010) [2023-10-08 02:02:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 108167168. Throughput: 0: 1698.8, 1: 1747.2. Samples: 27046022. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:02:06,211][50642] Avg episode reward: [(0, '20.110'), (1, '21.890')] [2023-10-08 02:02:07,582][52060] Updated weights for policy 0, policy_version 52490 (0.0008) [2023-10-08 02:02:07,949][52060] Updated weights for policy 0, policy_version 52500 (0.0008) [2023-10-08 02:02:08,316][52060] Updated weights for policy 0, policy_version 52510 (0.0008) [2023-10-08 02:02:08,526][52059] Updated weights for policy 1, policy_version 53162 (0.0008) [2023-10-08 02:02:08,884][52059] Updated weights for policy 1, policy_version 53172 (0.0007) [2023-10-08 02:02:09,253][52059] Updated weights for policy 1, policy_version 53182 (0.0009) [2023-10-08 02:02:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 108232704. Throughput: 0: 1718.6, 1: 1726.7. Samples: 27066678. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:02:11,211][50642] Avg episode reward: [(0, '21.120'), (1, '22.890')] [2023-10-08 02:02:12,178][52060] Updated weights for policy 0, policy_version 52520 (0.0007) [2023-10-08 02:02:12,542][52060] Updated weights for policy 0, policy_version 52530 (0.0007) [2023-10-08 02:02:12,918][52060] Updated weights for policy 0, policy_version 52540 (0.0008) [2023-10-08 02:02:13,333][52059] Updated weights for policy 1, policy_version 53192 (0.0008) [2023-10-08 02:02:13,695][52059] Updated weights for policy 1, policy_version 53202 (0.0008) [2023-10-08 02:02:14,047][52059] Updated weights for policy 1, policy_version 53212 (0.0010) [2023-10-08 02:02:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 108298240. Throughput: 0: 1731.6, 1: 1732.4. Samples: 27088012. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:02:16,211][50642] Avg episode reward: [(0, '19.030'), (1, '25.380')] [2023-10-08 02:02:16,752][52060] Updated weights for policy 0, policy_version 52550 (0.0010) [2023-10-08 02:02:17,114][52060] Updated weights for policy 0, policy_version 52560 (0.0010) [2023-10-08 02:02:17,494][52060] Updated weights for policy 0, policy_version 52570 (0.0011) [2023-10-08 02:02:17,953][52059] Updated weights for policy 1, policy_version 53222 (0.0008) [2023-10-08 02:02:18,336][52059] Updated weights for policy 1, policy_version 53232 (0.0008) [2023-10-08 02:02:18,710][52059] Updated weights for policy 1, policy_version 53242 (0.0007) [2023-10-08 02:02:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 108363776. Throughput: 0: 1705.5, 1: 1725.0. Samples: 27097364. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:02:21,211][50642] Avg episode reward: [(0, '20.150'), (1, '20.470')] [2023-10-08 02:02:21,676][52060] Updated weights for policy 0, policy_version 52580 (0.0011) [2023-10-08 02:02:22,050][52060] Updated weights for policy 0, policy_version 52590 (0.0008) [2023-10-08 02:02:22,422][52060] Updated weights for policy 0, policy_version 52600 (0.0008) [2023-10-08 02:02:22,511][52059] Updated weights for policy 1, policy_version 53252 (0.0007) [2023-10-08 02:02:22,880][52059] Updated weights for policy 1, policy_version 53262 (0.0009) [2023-10-08 02:02:23,251][52059] Updated weights for policy 1, policy_version 53272 (0.0010) [2023-10-08 02:02:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 108429312. Throughput: 0: 1726.7, 1: 1720.0. Samples: 27118608. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:02:26,211][50642] Avg episode reward: [(0, '22.610'), (1, '22.320')] [2023-10-08 02:02:26,382][52060] Updated weights for policy 0, policy_version 52610 (0.0009) [2023-10-08 02:02:26,749][52060] Updated weights for policy 0, policy_version 52620 (0.0008) [2023-10-08 02:02:27,124][52060] Updated weights for policy 0, policy_version 52630 (0.0008) [2023-10-08 02:02:27,216][52059] Updated weights for policy 1, policy_version 53282 (0.0008) [2023-10-08 02:02:27,487][52060] Updated weights for policy 0, policy_version 52640 (0.0008) [2023-10-08 02:02:27,581][52059] Updated weights for policy 1, policy_version 53292 (0.0009) [2023-10-08 02:02:27,944][52059] Updated weights for policy 1, policy_version 53302 (0.0008) [2023-10-08 02:02:28,303][52059] Updated weights for policy 1, policy_version 53312 (0.0008) [2023-10-08 02:02:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 108494848. Throughput: 0: 1727.6, 1: 1749.8. Samples: 27139998. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:02:31,211][50642] Avg episode reward: [(0, '20.100'), (1, '25.580')] [2023-10-08 02:02:31,353][52060] Updated weights for policy 0, policy_version 52650 (0.0008) [2023-10-08 02:02:31,717][52060] Updated weights for policy 0, policy_version 52660 (0.0009) [2023-10-08 02:02:32,097][52060] Updated weights for policy 0, policy_version 52670 (0.0008) [2023-10-08 02:02:32,193][52059] Updated weights for policy 1, policy_version 53322 (0.0007) [2023-10-08 02:02:32,556][52059] Updated weights for policy 1, policy_version 53332 (0.0008) [2023-10-08 02:02:32,925][52059] Updated weights for policy 1, policy_version 53342 (0.0007) [2023-10-08 02:02:35,994][52060] Updated weights for policy 0, policy_version 52680 (0.0009) [2023-10-08 02:02:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 108560384. Throughput: 0: 1718.2, 1: 1727.0. Samples: 27149326. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:02:36,211][50642] Avg episode reward: [(0, '19.220'), (1, '23.300')] [2023-10-08 02:02:36,352][52060] Updated weights for policy 0, policy_version 52690 (0.0008) [2023-10-08 02:02:36,722][52060] Updated weights for policy 0, policy_version 52700 (0.0007) [2023-10-08 02:02:36,734][52059] Updated weights for policy 1, policy_version 53352 (0.0007) [2023-10-08 02:02:37,096][52059] Updated weights for policy 1, policy_version 53362 (0.0007) [2023-10-08 02:02:37,464][52059] Updated weights for policy 1, policy_version 53372 (0.0007) [2023-10-08 02:02:40,648][52060] Updated weights for policy 0, policy_version 52710 (0.0008) [2023-10-08 02:02:41,023][52060] Updated weights for policy 0, policy_version 52720 (0.0009) [2023-10-08 02:02:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 108625920. Throughput: 0: 1725.0, 1: 1740.6. Samples: 27170824. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:02:41,211][50642] Avg episode reward: [(0, '22.110'), (1, '20.010')] [2023-10-08 02:02:41,390][52059] Updated weights for policy 1, policy_version 53382 (0.0009) [2023-10-08 02:02:41,390][52060] Updated weights for policy 0, policy_version 52730 (0.0008) [2023-10-08 02:02:41,757][52059] Updated weights for policy 1, policy_version 53392 (0.0010) [2023-10-08 02:02:42,121][52059] Updated weights for policy 1, policy_version 53402 (0.0009) [2023-10-08 02:02:45,298][52060] Updated weights for policy 0, policy_version 52740 (0.0007) [2023-10-08 02:02:45,659][52060] Updated weights for policy 0, policy_version 52750 (0.0008) [2023-10-08 02:02:46,034][52060] Updated weights for policy 0, policy_version 52760 (0.0008) [2023-10-08 02:02:46,154][52059] Updated weights for policy 1, policy_version 53412 (0.0008) [2023-10-08 02:02:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 108691456. Throughput: 0: 1708.0, 1: 1755.0. Samples: 27191576. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:02:46,212][50642] Avg episode reward: [(0, '21.440'), (1, '20.040')] [2023-10-08 02:02:46,326][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000052768_54034432.pth... [2023-10-08 02:02:46,367][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000051136_52363264.pth [2023-10-08 02:02:46,514][52059] Updated weights for policy 1, policy_version 53422 (0.0009) [2023-10-08 02:02:46,879][52059] Updated weights for policy 1, policy_version 53432 (0.0008) [2023-10-08 02:02:47,172][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000053440_54722560.pth... [2023-10-08 02:02:47,201][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000051808_53051392.pth [2023-10-08 02:02:50,115][52060] Updated weights for policy 0, policy_version 52770 (0.0009) [2023-10-08 02:02:50,484][52060] Updated weights for policy 0, policy_version 52780 (0.0009) [2023-10-08 02:02:50,843][52060] Updated weights for policy 0, policy_version 52790 (0.0010) [2023-10-08 02:02:50,870][52059] Updated weights for policy 1, policy_version 53442 (0.0010) [2023-10-08 02:02:51,207][52060] Updated weights for policy 0, policy_version 52800 (0.0007) [2023-10-08 02:02:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 108789760. Throughput: 0: 1727.6, 1: 1730.7. Samples: 27201644. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:02:51,211][50642] Avg episode reward: [(0, '18.010'), (1, '26.040')] [2023-10-08 02:02:51,244][52059] Updated weights for policy 1, policy_version 53452 (0.0008) [2023-10-08 02:02:51,604][52059] Updated weights for policy 1, policy_version 53462 (0.0009) [2023-10-08 02:02:51,965][52059] Updated weights for policy 1, policy_version 53472 (0.0009) [2023-10-08 02:02:55,039][52060] Updated weights for policy 0, policy_version 52810 (0.0007) [2023-10-08 02:02:55,413][52060] Updated weights for policy 0, policy_version 52820 (0.0010) [2023-10-08 02:02:55,773][52060] Updated weights for policy 0, policy_version 52830 (0.0008) [2023-10-08 02:02:55,811][52059] Updated weights for policy 1, policy_version 53482 (0.0008) [2023-10-08 02:02:56,182][52059] Updated weights for policy 1, policy_version 53492 (0.0007) [2023-10-08 02:02:56,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.6). Total num frames: 108855296. Throughput: 0: 1721.4, 1: 1748.8. Samples: 27222838. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:02:56,211][50642] Avg episode reward: [(0, '19.590'), (1, '23.340')] [2023-10-08 02:02:56,548][52059] Updated weights for policy 1, policy_version 53502 (0.0008) [2023-10-08 02:02:59,882][52060] Updated weights for policy 0, policy_version 52840 (0.0010) [2023-10-08 02:03:00,173][52059] Updated weights for policy 1, policy_version 53512 (0.0007) [2023-10-08 02:03:00,244][52060] Updated weights for policy 0, policy_version 52850 (0.0009) [2023-10-08 02:03:00,540][52059] Updated weights for policy 1, policy_version 53522 (0.0008) [2023-10-08 02:03:00,612][52060] Updated weights for policy 0, policy_version 52860 (0.0008) [2023-10-08 02:03:00,895][52059] Updated weights for policy 1, policy_version 53532 (0.0011) [2023-10-08 02:03:01,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 108953600. Throughput: 0: 1696.6, 1: 1732.1. Samples: 27242302. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:03:01,211][50642] Avg episode reward: [(0, '21.200'), (1, '20.300')] [2023-10-08 02:03:04,662][52060] Updated weights for policy 0, policy_version 52870 (0.0009) [2023-10-08 02:03:04,833][52059] Updated weights for policy 1, policy_version 53542 (0.0007) [2023-10-08 02:03:05,033][52060] Updated weights for policy 0, policy_version 52880 (0.0007) [2023-10-08 02:03:05,200][52059] Updated weights for policy 1, policy_version 53552 (0.0008) [2023-10-08 02:03:05,399][52060] Updated weights for policy 0, policy_version 52890 (0.0008) [2023-10-08 02:03:05,556][52059] Updated weights for policy 1, policy_version 53562 (0.0008) [2023-10-08 02:03:06,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 109019136. Throughput: 0: 1723.5, 1: 1757.2. Samples: 27253994. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:03:06,211][50642] Avg episode reward: [(0, '18.130'), (1, '21.780')] [2023-10-08 02:03:09,283][52059] Updated weights for policy 1, policy_version 53572 (0.0008) [2023-10-08 02:03:09,542][52060] Updated weights for policy 0, policy_version 52900 (0.0008) [2023-10-08 02:03:09,638][52059] Updated weights for policy 1, policy_version 53582 (0.0008) [2023-10-08 02:03:09,944][52060] Updated weights for policy 0, policy_version 52910 (0.0008) [2023-10-08 02:03:10,003][52059] Updated weights for policy 1, policy_version 53592 (0.0008) [2023-10-08 02:03:10,315][52060] Updated weights for policy 0, policy_version 52920 (0.0008) [2023-10-08 02:03:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 109084672. Throughput: 0: 1708.5, 1: 1744.7. Samples: 27274002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:03:11,211][50642] Avg episode reward: [(0, '19.960'), (1, '25.300')] [2023-10-08 02:03:13,896][52059] Updated weights for policy 1, policy_version 53602 (0.0008) [2023-10-08 02:03:14,256][52059] Updated weights for policy 1, policy_version 53612 (0.0010) [2023-10-08 02:03:14,299][52060] Updated weights for policy 0, policy_version 52930 (0.0008) [2023-10-08 02:03:14,618][52059] Updated weights for policy 1, policy_version 53622 (0.0007) [2023-10-08 02:03:14,665][52060] Updated weights for policy 0, policy_version 52940 (0.0008) [2023-10-08 02:03:14,985][52059] Updated weights for policy 1, policy_version 53632 (0.0007) [2023-10-08 02:03:15,026][52060] Updated weights for policy 0, policy_version 52950 (0.0009) [2023-10-08 02:03:15,398][52060] Updated weights for policy 0, policy_version 52960 (0.0010) [2023-10-08 02:03:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 109150208. Throughput: 0: 1687.4, 1: 1735.4. Samples: 27294026. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:03:16,211][50642] Avg episode reward: [(0, '21.380'), (1, '25.630')] [2023-10-08 02:03:18,969][52059] Updated weights for policy 1, policy_version 53642 (0.0008) [2023-10-08 02:03:19,328][52059] Updated weights for policy 1, policy_version 53652 (0.0007) [2023-10-08 02:03:19,475][52060] Updated weights for policy 0, policy_version 52970 (0.0007) [2023-10-08 02:03:19,697][52059] Updated weights for policy 1, policy_version 53662 (0.0008) [2023-10-08 02:03:19,838][52060] Updated weights for policy 0, policy_version 52980 (0.0010) [2023-10-08 02:03:20,206][52060] Updated weights for policy 0, policy_version 52990 (0.0009) [2023-10-08 02:03:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 109215744. Throughput: 0: 1714.7, 1: 1754.5. Samples: 27305440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:03:21,211][50642] Avg episode reward: [(0, '20.390'), (1, '20.730')] [2023-10-08 02:03:23,675][52059] Updated weights for policy 1, policy_version 53672 (0.0010) [2023-10-08 02:03:24,046][52059] Updated weights for policy 1, policy_version 53682 (0.0009) [2023-10-08 02:03:24,117][52060] Updated weights for policy 0, policy_version 53000 (0.0009) [2023-10-08 02:03:24,404][52059] Updated weights for policy 1, policy_version 53692 (0.0008) [2023-10-08 02:03:24,481][52060] Updated weights for policy 0, policy_version 53010 (0.0008) [2023-10-08 02:03:24,850][52060] Updated weights for policy 0, policy_version 53020 (0.0009) [2023-10-08 02:03:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 109281280. Throughput: 0: 1689.2, 1: 1729.9. Samples: 27324682. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:03:26,211][50642] Avg episode reward: [(0, '19.260'), (1, '21.950')] [2023-10-08 02:03:28,422][52059] Updated weights for policy 1, policy_version 53702 (0.0008) [2023-10-08 02:03:28,796][52059] Updated weights for policy 1, policy_version 53712 (0.0008) [2023-10-08 02:03:28,908][52060] Updated weights for policy 0, policy_version 53030 (0.0008) [2023-10-08 02:03:29,156][52059] Updated weights for policy 1, policy_version 53722 (0.0008) [2023-10-08 02:03:29,265][52060] Updated weights for policy 0, policy_version 53040 (0.0008) [2023-10-08 02:03:29,637][52060] Updated weights for policy 0, policy_version 53050 (0.0007) [2023-10-08 02:03:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 109346816. Throughput: 0: 1700.5, 1: 1722.1. Samples: 27345592. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:03:31,211][50642] Avg episode reward: [(0, '19.980'), (1, '24.570')] [2023-10-08 02:03:33,168][52059] Updated weights for policy 1, policy_version 53732 (0.0008) [2023-10-08 02:03:33,532][52059] Updated weights for policy 1, policy_version 53742 (0.0008) [2023-10-08 02:03:33,554][52060] Updated weights for policy 0, policy_version 53060 (0.0007) [2023-10-08 02:03:33,892][52059] Updated weights for policy 1, policy_version 53752 (0.0008) [2023-10-08 02:03:33,917][52060] Updated weights for policy 0, policy_version 53070 (0.0007) [2023-10-08 02:03:34,290][52060] Updated weights for policy 0, policy_version 53080 (0.0009) [2023-10-08 02:03:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 109412352. Throughput: 0: 1703.3, 1: 1731.6. Samples: 27356218. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:03:36,211][50642] Avg episode reward: [(0, '20.710'), (1, '24.390')] [2023-10-08 02:03:37,876][52059] Updated weights for policy 1, policy_version 53762 (0.0007) [2023-10-08 02:03:38,066][52060] Updated weights for policy 0, policy_version 53090 (0.0008) [2023-10-08 02:03:38,234][52059] Updated weights for policy 1, policy_version 53772 (0.0007) [2023-10-08 02:03:38,442][52060] Updated weights for policy 0, policy_version 53100 (0.0008) [2023-10-08 02:03:38,600][52059] Updated weights for policy 1, policy_version 53782 (0.0009) [2023-10-08 02:03:38,798][52060] Updated weights for policy 0, policy_version 53110 (0.0008) [2023-10-08 02:03:38,959][52059] Updated weights for policy 1, policy_version 53792 (0.0008) [2023-10-08 02:03:39,167][52060] Updated weights for policy 0, policy_version 53120 (0.0008) [2023-10-08 02:03:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 109477888. Throughput: 0: 1691.6, 1: 1720.4. Samples: 27376376. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:03:41,211][50642] Avg episode reward: [(0, '20.580'), (1, '21.130')] [2023-10-08 02:03:42,868][52059] Updated weights for policy 1, policy_version 53802 (0.0009) [2023-10-08 02:03:43,224][52060] Updated weights for policy 0, policy_version 53130 (0.0008) [2023-10-08 02:03:43,242][52059] Updated weights for policy 1, policy_version 53812 (0.0007) [2023-10-08 02:03:43,589][52060] Updated weights for policy 0, policy_version 53140 (0.0007) [2023-10-08 02:03:43,610][52059] Updated weights for policy 1, policy_version 53822 (0.0008) [2023-10-08 02:03:43,968][52060] Updated weights for policy 0, policy_version 53150 (0.0010) [2023-10-08 02:03:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 109543424. Throughput: 0: 1711.5, 1: 1738.1. Samples: 27397536. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:03:46,211][50642] Avg episode reward: [(0, '21.640'), (1, '22.520')] [2023-10-08 02:03:47,637][52059] Updated weights for policy 1, policy_version 53832 (0.0008) [2023-10-08 02:03:48,000][52059] Updated weights for policy 1, policy_version 53842 (0.0009) [2023-10-08 02:03:48,090][52060] Updated weights for policy 0, policy_version 53160 (0.0008) [2023-10-08 02:03:48,370][52059] Updated weights for policy 1, policy_version 53852 (0.0008) [2023-10-08 02:03:48,457][52060] Updated weights for policy 0, policy_version 53170 (0.0010) [2023-10-08 02:03:48,829][52060] Updated weights for policy 0, policy_version 53180 (0.0009) [2023-10-08 02:03:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 109608960. Throughput: 0: 1687.9, 1: 1713.0. Samples: 27407034. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:03:51,211][50642] Avg episode reward: [(0, '21.010'), (1, '22.300')] [2023-10-08 02:03:52,328][52059] Updated weights for policy 1, policy_version 53862 (0.0008) [2023-10-08 02:03:52,688][52059] Updated weights for policy 1, policy_version 53872 (0.0008) [2023-10-08 02:03:52,698][52060] Updated weights for policy 0, policy_version 53190 (0.0008) [2023-10-08 02:03:53,058][52059] Updated weights for policy 1, policy_version 53882 (0.0008) [2023-10-08 02:03:53,064][52060] Updated weights for policy 0, policy_version 53200 (0.0008) [2023-10-08 02:03:53,436][52060] Updated weights for policy 0, policy_version 53210 (0.0007) [2023-10-08 02:03:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 109674496. Throughput: 0: 1705.1, 1: 1729.1. Samples: 27428540. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:03:56,211][50642] Avg episode reward: [(0, '20.480'), (1, '25.740')] [2023-10-08 02:03:56,991][52059] Updated weights for policy 1, policy_version 53892 (0.0007) [2023-10-08 02:03:57,389][52059] Updated weights for policy 1, policy_version 53902 (0.0008) [2023-10-08 02:03:57,510][52060] Updated weights for policy 0, policy_version 53220 (0.0008) [2023-10-08 02:03:57,758][52059] Updated weights for policy 1, policy_version 53912 (0.0008) [2023-10-08 02:03:57,897][52060] Updated weights for policy 0, policy_version 53230 (0.0009) [2023-10-08 02:03:58,274][52060] Updated weights for policy 0, policy_version 53240 (0.0008) [2023-10-08 02:04:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 109740032. Throughput: 0: 1718.4, 1: 1732.7. Samples: 27449330. Policy #0 lag: (min: 4.0, avg: 11.7, max: 36.0) [2023-10-08 02:04:01,211][50642] Avg episode reward: [(0, '21.740'), (1, '24.030')] [2023-10-08 02:04:01,720][52059] Updated weights for policy 1, policy_version 53922 (0.0008) [2023-10-08 02:04:02,089][52059] Updated weights for policy 1, policy_version 53932 (0.0007) [2023-10-08 02:04:02,330][52060] Updated weights for policy 0, policy_version 53250 (0.0008) [2023-10-08 02:04:02,449][52059] Updated weights for policy 1, policy_version 53942 (0.0007) [2023-10-08 02:04:02,689][52060] Updated weights for policy 0, policy_version 53260 (0.0009) [2023-10-08 02:04:02,810][52059] Updated weights for policy 1, policy_version 53952 (0.0008) [2023-10-08 02:04:03,062][52060] Updated weights for policy 0, policy_version 53270 (0.0008) [2023-10-08 02:04:03,431][52060] Updated weights for policy 0, policy_version 53280 (0.0007) [2023-10-08 02:04:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 109805568. Throughput: 0: 1693.8, 1: 1715.0. Samples: 27458836. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:06,211][50642] Avg episode reward: [(0, '20.930'), (1, '23.080')] [2023-10-08 02:04:06,688][52059] Updated weights for policy 1, policy_version 53962 (0.0009) [2023-10-08 02:04:07,070][52059] Updated weights for policy 1, policy_version 53972 (0.0010) [2023-10-08 02:04:07,381][52060] Updated weights for policy 0, policy_version 53290 (0.0008) [2023-10-08 02:04:07,440][52059] Updated weights for policy 1, policy_version 53982 (0.0009) [2023-10-08 02:04:07,755][52060] Updated weights for policy 0, policy_version 53300 (0.0008) [2023-10-08 02:04:08,122][52060] Updated weights for policy 0, policy_version 53310 (0.0011) [2023-10-08 02:04:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 109871104. Throughput: 0: 1717.2, 1: 1736.3. Samples: 27480088. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:11,211][50642] Avg episode reward: [(0, '19.340'), (1, '24.660')] [2023-10-08 02:04:11,391][52059] Updated weights for policy 1, policy_version 53992 (0.0008) [2023-10-08 02:04:11,749][52059] Updated weights for policy 1, policy_version 54002 (0.0007) [2023-10-08 02:04:11,987][52060] Updated weights for policy 0, policy_version 53320 (0.0009) [2023-10-08 02:04:12,112][52059] Updated weights for policy 1, policy_version 54012 (0.0008) [2023-10-08 02:04:12,355][52060] Updated weights for policy 0, policy_version 53330 (0.0007) [2023-10-08 02:04:12,732][52060] Updated weights for policy 0, policy_version 53340 (0.0007) [2023-10-08 02:04:16,019][52059] Updated weights for policy 1, policy_version 54022 (0.0007) [2023-10-08 02:04:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 109936640. Throughput: 0: 1720.9, 1: 1740.7. Samples: 27501362. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:16,211][50642] Avg episode reward: [(0, '20.010'), (1, '24.200')] [2023-10-08 02:04:16,393][52059] Updated weights for policy 1, policy_version 54032 (0.0009) [2023-10-08 02:04:16,710][52060] Updated weights for policy 0, policy_version 53350 (0.0009) [2023-10-08 02:04:16,756][52059] Updated weights for policy 1, policy_version 54042 (0.0009) [2023-10-08 02:04:17,085][52060] Updated weights for policy 0, policy_version 53360 (0.0010) [2023-10-08 02:04:17,455][52060] Updated weights for policy 0, policy_version 53370 (0.0009) [2023-10-08 02:04:20,704][52059] Updated weights for policy 1, policy_version 54052 (0.0008) [2023-10-08 02:04:21,064][52059] Updated weights for policy 1, policy_version 54062 (0.0011) [2023-10-08 02:04:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 110002176. Throughput: 0: 1700.3, 1: 1734.3. Samples: 27510776. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:21,211][50642] Avg episode reward: [(0, '20.550'), (1, '24.790')] [2023-10-08 02:04:21,429][52059] Updated weights for policy 1, policy_version 54072 (0.0010) [2023-10-08 02:04:21,454][52060] Updated weights for policy 0, policy_version 53380 (0.0009) [2023-10-08 02:04:21,825][52060] Updated weights for policy 0, policy_version 53390 (0.0007) [2023-10-08 02:04:22,182][52060] Updated weights for policy 0, policy_version 53400 (0.0008) [2023-10-08 02:04:25,413][52059] Updated weights for policy 1, policy_version 54082 (0.0007) [2023-10-08 02:04:25,780][52059] Updated weights for policy 1, policy_version 54092 (0.0009) [2023-10-08 02:04:26,134][52059] Updated weights for policy 1, policy_version 54102 (0.0010) [2023-10-08 02:04:26,176][52060] Updated weights for policy 0, policy_version 53410 (0.0009) [2023-10-08 02:04:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 110067712. Throughput: 0: 1716.1, 1: 1743.7. Samples: 27532070. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:26,211][50642] Avg episode reward: [(0, '20.090'), (1, '22.450')] [2023-10-08 02:04:26,497][52059] Updated weights for policy 1, policy_version 54112 (0.0007) [2023-10-08 02:04:26,542][52060] Updated weights for policy 0, policy_version 53420 (0.0009) [2023-10-08 02:04:26,906][52060] Updated weights for policy 0, policy_version 53430 (0.0008) [2023-10-08 02:04:27,273][52060] Updated weights for policy 0, policy_version 53440 (0.0010) [2023-10-08 02:04:30,506][52059] Updated weights for policy 1, policy_version 54122 (0.0007) [2023-10-08 02:04:30,872][52059] Updated weights for policy 1, policy_version 54132 (0.0007) [2023-10-08 02:04:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 110133248. Throughput: 0: 1722.5, 1: 1727.9. Samples: 27552802. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:31,211][50642] Avg episode reward: [(0, '20.290'), (1, '25.200')] [2023-10-08 02:04:31,231][52059] Updated weights for policy 1, policy_version 54142 (0.0007) [2023-10-08 02:04:31,249][52060] Updated weights for policy 0, policy_version 53450 (0.0009) [2023-10-08 02:04:31,627][52060] Updated weights for policy 0, policy_version 53460 (0.0007) [2023-10-08 02:04:31,990][52060] Updated weights for policy 0, policy_version 53470 (0.0008) [2023-10-08 02:04:35,048][52059] Updated weights for policy 1, policy_version 54152 (0.0010) [2023-10-08 02:04:35,407][52059] Updated weights for policy 1, policy_version 54162 (0.0011) [2023-10-08 02:04:35,772][52059] Updated weights for policy 1, policy_version 54172 (0.0010) [2023-10-08 02:04:36,090][52060] Updated weights for policy 0, policy_version 53480 (0.0009) [2023-10-08 02:04:36,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 110231552. Throughput: 0: 1719.4, 1: 1745.4. Samples: 27562948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:36,211][50642] Avg episode reward: [(0, '20.600'), (1, '22.680')] [2023-10-08 02:04:36,451][52060] Updated weights for policy 0, policy_version 53490 (0.0010) [2023-10-08 02:04:36,826][52060] Updated weights for policy 0, policy_version 53500 (0.0007) [2023-10-08 02:04:39,689][52059] Updated weights for policy 1, policy_version 54182 (0.0009) [2023-10-08 02:04:40,055][52059] Updated weights for policy 1, policy_version 54192 (0.0007) [2023-10-08 02:04:40,415][52059] Updated weights for policy 1, policy_version 54202 (0.0009) [2023-10-08 02:04:40,632][52060] Updated weights for policy 0, policy_version 53510 (0.0009) [2023-10-08 02:04:40,998][52060] Updated weights for policy 0, policy_version 53520 (0.0010) [2023-10-08 02:04:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 110297088. Throughput: 0: 1718.2, 1: 1735.1. Samples: 27583938. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:04:41,211][50642] Avg episode reward: [(0, '20.070'), (1, '23.760')] [2023-10-08 02:04:41,367][52060] Updated weights for policy 0, policy_version 53530 (0.0009) [2023-10-08 02:04:44,303][52059] Updated weights for policy 1, policy_version 54212 (0.0009) [2023-10-08 02:04:44,716][52059] Updated weights for policy 1, policy_version 54222 (0.0007) [2023-10-08 02:04:45,071][52059] Updated weights for policy 1, policy_version 54232 (0.0009) [2023-10-08 02:04:45,360][52060] Updated weights for policy 0, policy_version 53540 (0.0008) [2023-10-08 02:04:45,742][52060] Updated weights for policy 0, policy_version 53550 (0.0008) [2023-10-08 02:04:46,121][52060] Updated weights for policy 0, policy_version 53560 (0.0008) [2023-10-08 02:04:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 110362624. Throughput: 0: 1712.4, 1: 1716.3. Samples: 27603622. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:04:46,211][50642] Avg episode reward: [(0, '20.450'), (1, '21.270')] [2023-10-08 02:04:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000054240_55541760.pth... [2023-10-08 02:04:46,249][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000052608_53870592.pth [2023-10-08 02:04:46,417][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000053568_54853632.pth... [2023-10-08 02:04:46,455][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000051968_53215232.pth [2023-10-08 02:04:49,004][52059] Updated weights for policy 1, policy_version 54242 (0.0009) [2023-10-08 02:04:49,363][52059] Updated weights for policy 1, policy_version 54252 (0.0007) [2023-10-08 02:04:49,729][52059] Updated weights for policy 1, policy_version 54262 (0.0007) [2023-10-08 02:04:50,046][52060] Updated weights for policy 0, policy_version 53570 (0.0008) [2023-10-08 02:04:50,089][52059] Updated weights for policy 1, policy_version 54272 (0.0007) [2023-10-08 02:04:50,418][52060] Updated weights for policy 0, policy_version 53580 (0.0007) [2023-10-08 02:04:50,781][52060] Updated weights for policy 0, policy_version 53590 (0.0008) [2023-10-08 02:04:51,156][52060] Updated weights for policy 0, policy_version 53600 (0.0008) [2023-10-08 02:04:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 110460928. Throughput: 0: 1723.9, 1: 1742.4. Samples: 27614818. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:04:51,211][50642] Avg episode reward: [(0, '22.650'), (1, '26.870')] [2023-10-08 02:04:51,212][51710] Saving new best policy, reward=26.870! [2023-10-08 02:04:54,086][52059] Updated weights for policy 1, policy_version 54282 (0.0008) [2023-10-08 02:04:54,445][52059] Updated weights for policy 1, policy_version 54292 (0.0007) [2023-10-08 02:04:54,806][52059] Updated weights for policy 1, policy_version 54302 (0.0011) [2023-10-08 02:04:55,128][52060] Updated weights for policy 0, policy_version 53610 (0.0010) [2023-10-08 02:04:55,496][52060] Updated weights for policy 0, policy_version 53620 (0.0009) [2023-10-08 02:04:55,869][52060] Updated weights for policy 0, policy_version 53630 (0.0007) [2023-10-08 02:04:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 110526464. Throughput: 0: 1725.2, 1: 1715.3. Samples: 27634910. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:04:56,211][50642] Avg episode reward: [(0, '20.950'), (1, '21.440')] [2023-10-08 02:04:58,827][52059] Updated weights for policy 1, policy_version 54312 (0.0011) [2023-10-08 02:04:59,191][52059] Updated weights for policy 1, policy_version 54322 (0.0009) [2023-10-08 02:04:59,555][52059] Updated weights for policy 1, policy_version 54332 (0.0011) [2023-10-08 02:04:59,751][52060] Updated weights for policy 0, policy_version 53640 (0.0008) [2023-10-08 02:05:00,126][52060] Updated weights for policy 0, policy_version 53650 (0.0009) [2023-10-08 02:05:00,489][52060] Updated weights for policy 0, policy_version 53660 (0.0009) [2023-10-08 02:05:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 110592000. Throughput: 0: 1698.1, 1: 1712.7. Samples: 27654846. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:05:01,211][50642] Avg episode reward: [(0, '20.740'), (1, '23.030')] [2023-10-08 02:05:03,434][52059] Updated weights for policy 1, policy_version 54342 (0.0009) [2023-10-08 02:05:03,797][52059] Updated weights for policy 1, policy_version 54352 (0.0008) [2023-10-08 02:05:04,159][52059] Updated weights for policy 1, policy_version 54362 (0.0007) [2023-10-08 02:05:04,521][52060] Updated weights for policy 0, policy_version 53670 (0.0009) [2023-10-08 02:05:04,898][52060] Updated weights for policy 0, policy_version 53680 (0.0008) [2023-10-08 02:05:05,260][52060] Updated weights for policy 0, policy_version 53690 (0.0008) [2023-10-08 02:05:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 110657536. Throughput: 0: 1728.8, 1: 1723.0. Samples: 27666110. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:05:06,211][50642] Avg episode reward: [(0, '20.750'), (1, '21.460')] [2023-10-08 02:05:08,103][52059] Updated weights for policy 1, policy_version 54372 (0.0008) [2023-10-08 02:05:08,469][52059] Updated weights for policy 1, policy_version 54382 (0.0007) [2023-10-08 02:05:08,838][52059] Updated weights for policy 1, policy_version 54392 (0.0008) [2023-10-08 02:05:09,104][52060] Updated weights for policy 0, policy_version 53700 (0.0010) [2023-10-08 02:05:09,475][52060] Updated weights for policy 0, policy_version 53710 (0.0007) [2023-10-08 02:05:09,838][52060] Updated weights for policy 0, policy_version 53720 (0.0007) [2023-10-08 02:05:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 110723072. Throughput: 0: 1709.0, 1: 1716.9. Samples: 27686234. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:05:11,211][50642] Avg episode reward: [(0, '20.200'), (1, '25.490')] [2023-10-08 02:05:12,716][52059] Updated weights for policy 1, policy_version 54402 (0.0008) [2023-10-08 02:05:13,085][52059] Updated weights for policy 1, policy_version 54412 (0.0007) [2023-10-08 02:05:13,442][52059] Updated weights for policy 1, policy_version 54422 (0.0007) [2023-10-08 02:05:13,616][52060] Updated weights for policy 0, policy_version 53730 (0.0008) [2023-10-08 02:05:13,803][52059] Updated weights for policy 1, policy_version 54432 (0.0008) [2023-10-08 02:05:13,983][52060] Updated weights for policy 0, policy_version 53740 (0.0008) [2023-10-08 02:05:14,355][52060] Updated weights for policy 0, policy_version 53750 (0.0007) [2023-10-08 02:05:14,729][52060] Updated weights for policy 0, policy_version 53760 (0.0007) [2023-10-08 02:05:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 110788608. Throughput: 0: 1702.7, 1: 1733.6. Samples: 27707436. Policy #0 lag: (min: 31.0, avg: 34.1, max: 63.0) [2023-10-08 02:05:16,211][50642] Avg episode reward: [(0, '21.490'), (1, '23.970')] [2023-10-08 02:05:17,816][52059] Updated weights for policy 1, policy_version 54442 (0.0008) [2023-10-08 02:05:18,165][52059] Updated weights for policy 1, policy_version 54452 (0.0010) [2023-10-08 02:05:18,532][52059] Updated weights for policy 1, policy_version 54462 (0.0009) [2023-10-08 02:05:18,884][52060] Updated weights for policy 0, policy_version 53770 (0.0008) [2023-10-08 02:05:19,248][52060] Updated weights for policy 0, policy_version 53780 (0.0009) [2023-10-08 02:05:19,623][52060] Updated weights for policy 0, policy_version 53790 (0.0007) [2023-10-08 02:05:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 110854144. Throughput: 0: 1726.9, 1: 1713.3. Samples: 27717756. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:21,211][50642] Avg episode reward: [(0, '20.660'), (1, '22.170')] [2023-10-08 02:05:22,364][52059] Updated weights for policy 1, policy_version 54472 (0.0009) [2023-10-08 02:05:22,718][52059] Updated weights for policy 1, policy_version 54482 (0.0008) [2023-10-08 02:05:23,088][52059] Updated weights for policy 1, policy_version 54492 (0.0011) [2023-10-08 02:05:23,641][52060] Updated weights for policy 0, policy_version 53800 (0.0009) [2023-10-08 02:05:24,004][52060] Updated weights for policy 0, policy_version 53810 (0.0009) [2023-10-08 02:05:24,388][52060] Updated weights for policy 0, policy_version 53820 (0.0010) [2023-10-08 02:05:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 110919680. Throughput: 0: 1700.6, 1: 1722.8. Samples: 27737988. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:26,211][50642] Avg episode reward: [(0, '21.130'), (1, '25.370')] [2023-10-08 02:05:27,049][52059] Updated weights for policy 1, policy_version 54502 (0.0008) [2023-10-08 02:05:27,424][52059] Updated weights for policy 1, policy_version 54512 (0.0010) [2023-10-08 02:05:27,802][52059] Updated weights for policy 1, policy_version 54522 (0.0008) [2023-10-08 02:05:28,264][52060] Updated weights for policy 0, policy_version 53830 (0.0008) [2023-10-08 02:05:28,630][52060] Updated weights for policy 0, policy_version 53840 (0.0009) [2023-10-08 02:05:29,003][52060] Updated weights for policy 0, policy_version 53850 (0.0008) [2023-10-08 02:05:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 110985216. Throughput: 0: 1721.2, 1: 1742.0. Samples: 27759466. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:31,211][50642] Avg episode reward: [(0, '22.100'), (1, '23.830')] [2023-10-08 02:05:31,751][52059] Updated weights for policy 1, policy_version 54532 (0.0008) [2023-10-08 02:05:32,149][52059] Updated weights for policy 1, policy_version 54542 (0.0011) [2023-10-08 02:05:32,514][52059] Updated weights for policy 1, policy_version 54552 (0.0009) [2023-10-08 02:05:32,900][52060] Updated weights for policy 0, policy_version 53860 (0.0009) [2023-10-08 02:05:33,288][52060] Updated weights for policy 0, policy_version 53870 (0.0009) [2023-10-08 02:05:33,654][52060] Updated weights for policy 0, policy_version 53880 (0.0010) [2023-10-08 02:05:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111050752. Throughput: 0: 1711.5, 1: 1712.0. Samples: 27768874. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:36,211][52059] Updated weights for policy 1, policy_version 54562 (0.0007) [2023-10-08 02:05:36,211][50642] Avg episode reward: [(0, '22.890'), (1, '23.150')] [2023-10-08 02:05:36,577][52059] Updated weights for policy 1, policy_version 54572 (0.0008) [2023-10-08 02:05:36,938][52059] Updated weights for policy 1, policy_version 54582 (0.0007) [2023-10-08 02:05:37,308][52059] Updated weights for policy 1, policy_version 54592 (0.0007) [2023-10-08 02:05:37,398][52060] Updated weights for policy 0, policy_version 53890 (0.0008) [2023-10-08 02:05:37,766][52060] Updated weights for policy 0, policy_version 53900 (0.0008) [2023-10-08 02:05:38,128][52060] Updated weights for policy 0, policy_version 53910 (0.0009) [2023-10-08 02:05:38,493][52060] Updated weights for policy 0, policy_version 53920 (0.0011) [2023-10-08 02:05:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111116288. Throughput: 0: 1708.9, 1: 1740.7. Samples: 27790142. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:41,211][50642] Avg episode reward: [(0, '19.950'), (1, '23.870')] [2023-10-08 02:05:41,244][52059] Updated weights for policy 1, policy_version 54602 (0.0008) [2023-10-08 02:05:41,609][52059] Updated weights for policy 1, policy_version 54612 (0.0008) [2023-10-08 02:05:41,965][52059] Updated weights for policy 1, policy_version 54622 (0.0007) [2023-10-08 02:05:42,675][52060] Updated weights for policy 0, policy_version 53930 (0.0007) [2023-10-08 02:05:43,045][52060] Updated weights for policy 0, policy_version 53940 (0.0009) [2023-10-08 02:05:43,412][52060] Updated weights for policy 0, policy_version 53950 (0.0008) [2023-10-08 02:05:45,830][52059] Updated weights for policy 1, policy_version 54632 (0.0010) [2023-10-08 02:05:46,189][52059] Updated weights for policy 1, policy_version 54642 (0.0008) [2023-10-08 02:05:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 111181824. Throughput: 0: 1735.1, 1: 1742.1. Samples: 27811320. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:46,211][50642] Avg episode reward: [(0, '19.640'), (1, '23.540')] [2023-10-08 02:05:46,549][52059] Updated weights for policy 1, policy_version 54652 (0.0008) [2023-10-08 02:05:47,318][52060] Updated weights for policy 0, policy_version 53960 (0.0007) [2023-10-08 02:05:47,686][52060] Updated weights for policy 0, policy_version 53970 (0.0008) [2023-10-08 02:05:48,049][52060] Updated weights for policy 0, policy_version 53980 (0.0009) [2023-10-08 02:05:50,569][52059] Updated weights for policy 1, policy_version 54662 (0.0008) [2023-10-08 02:05:50,936][52059] Updated weights for policy 1, policy_version 54672 (0.0008) [2023-10-08 02:05:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 111247360. Throughput: 0: 1703.2, 1: 1737.7. Samples: 27820952. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:51,211][50642] Avg episode reward: [(0, '21.120'), (1, '24.540')] [2023-10-08 02:05:51,296][52059] Updated weights for policy 1, policy_version 54682 (0.0008) [2023-10-08 02:05:52,061][52060] Updated weights for policy 0, policy_version 53990 (0.0009) [2023-10-08 02:05:52,422][52060] Updated weights for policy 0, policy_version 54000 (0.0008) [2023-10-08 02:05:52,798][52060] Updated weights for policy 0, policy_version 54010 (0.0009) [2023-10-08 02:05:55,206][52059] Updated weights for policy 1, policy_version 54692 (0.0007) [2023-10-08 02:05:55,563][52059] Updated weights for policy 1, policy_version 54702 (0.0007) [2023-10-08 02:05:55,935][52059] Updated weights for policy 1, policy_version 54712 (0.0009) [2023-10-08 02:05:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 111312896. Throughput: 0: 1726.9, 1: 1748.1. Samples: 27842610. Policy #0 lag: (min: 31.0, avg: 38.1, max: 63.0) [2023-10-08 02:05:56,211][50642] Avg episode reward: [(0, '20.560'), (1, '21.270')] [2023-10-08 02:05:56,760][52060] Updated weights for policy 0, policy_version 54020 (0.0009) [2023-10-08 02:05:57,132][52060] Updated weights for policy 0, policy_version 54030 (0.0010) [2023-10-08 02:05:57,505][52060] Updated weights for policy 0, policy_version 54040 (0.0009) [2023-10-08 02:05:59,786][52059] Updated weights for policy 1, policy_version 54722 (0.0009) [2023-10-08 02:06:00,154][52059] Updated weights for policy 1, policy_version 54732 (0.0010) [2023-10-08 02:06:00,526][52059] Updated weights for policy 1, policy_version 54742 (0.0008) [2023-10-08 02:06:00,889][52059] Updated weights for policy 1, policy_version 54752 (0.0009) [2023-10-08 02:06:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111411200. Throughput: 0: 1727.1, 1: 1724.3. Samples: 27862748. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:01,211][50642] Avg episode reward: [(0, '19.610'), (1, '25.350')] [2023-10-08 02:06:01,579][52060] Updated weights for policy 0, policy_version 54050 (0.0008) [2023-10-08 02:06:01,945][52060] Updated weights for policy 0, policy_version 54060 (0.0009) [2023-10-08 02:06:02,311][52060] Updated weights for policy 0, policy_version 54070 (0.0007) [2023-10-08 02:06:02,680][52060] Updated weights for policy 0, policy_version 54080 (0.0010) [2023-10-08 02:06:04,722][52059] Updated weights for policy 1, policy_version 54762 (0.0007) [2023-10-08 02:06:05,086][52059] Updated weights for policy 1, policy_version 54772 (0.0009) [2023-10-08 02:06:05,439][52059] Updated weights for policy 1, policy_version 54782 (0.0008) [2023-10-08 02:06:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 111476736. Throughput: 0: 1706.0, 1: 1754.4. Samples: 27873474. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:06,211][50642] Avg episode reward: [(0, '20.530'), (1, '24.850')] [2023-10-08 02:06:06,654][52060] Updated weights for policy 0, policy_version 54090 (0.0007) [2023-10-08 02:06:07,022][52060] Updated weights for policy 0, policy_version 54100 (0.0009) [2023-10-08 02:06:07,394][52060] Updated weights for policy 0, policy_version 54110 (0.0009) [2023-10-08 02:06:09,289][52059] Updated weights for policy 1, policy_version 54792 (0.0009) [2023-10-08 02:06:09,657][52059] Updated weights for policy 1, policy_version 54802 (0.0007) [2023-10-08 02:06:10,022][52059] Updated weights for policy 1, policy_version 54812 (0.0007) [2023-10-08 02:06:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 111542272. Throughput: 0: 1733.1, 1: 1737.6. Samples: 27894166. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:11,211][50642] Avg episode reward: [(0, '20.850'), (1, '24.370')] [2023-10-08 02:06:11,274][52060] Updated weights for policy 0, policy_version 54120 (0.0010) [2023-10-08 02:06:11,640][52060] Updated weights for policy 0, policy_version 54130 (0.0009) [2023-10-08 02:06:12,010][52060] Updated weights for policy 0, policy_version 54140 (0.0011) [2023-10-08 02:06:13,996][52059] Updated weights for policy 1, policy_version 54822 (0.0008) [2023-10-08 02:06:14,367][52059] Updated weights for policy 1, policy_version 54832 (0.0009) [2023-10-08 02:06:14,725][52059] Updated weights for policy 1, policy_version 54842 (0.0009) [2023-10-08 02:06:16,047][52060] Updated weights for policy 0, policy_version 54150 (0.0010) [2023-10-08 02:06:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111607808. Throughput: 0: 1726.5, 1: 1729.4. Samples: 27914980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:16,211][50642] Avg episode reward: [(0, '21.850'), (1, '23.380')] [2023-10-08 02:06:16,418][52060] Updated weights for policy 0, policy_version 54160 (0.0010) [2023-10-08 02:06:16,781][52060] Updated weights for policy 0, policy_version 54170 (0.0008) [2023-10-08 02:06:18,657][52059] Updated weights for policy 1, policy_version 54852 (0.0009) [2023-10-08 02:06:19,041][52059] Updated weights for policy 1, policy_version 54862 (0.0008) [2023-10-08 02:06:19,404][52059] Updated weights for policy 1, policy_version 54872 (0.0007) [2023-10-08 02:06:20,801][52060] Updated weights for policy 0, policy_version 54180 (0.0008) [2023-10-08 02:06:21,182][52060] Updated weights for policy 0, policy_version 54190 (0.0009) [2023-10-08 02:06:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111673344. Throughput: 0: 1724.9, 1: 1753.6. Samples: 27925408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:21,211][50642] Avg episode reward: [(0, '21.130'), (1, '24.590')] [2023-10-08 02:06:21,558][52060] Updated weights for policy 0, policy_version 54200 (0.0008) [2023-10-08 02:06:23,418][52059] Updated weights for policy 1, policy_version 54882 (0.0008) [2023-10-08 02:06:23,776][52059] Updated weights for policy 1, policy_version 54892 (0.0009) [2023-10-08 02:06:24,139][52059] Updated weights for policy 1, policy_version 54902 (0.0009) [2023-10-08 02:06:24,498][52059] Updated weights for policy 1, policy_version 54912 (0.0008) [2023-10-08 02:06:25,462][52060] Updated weights for policy 0, policy_version 54210 (0.0008) [2023-10-08 02:06:25,829][52060] Updated weights for policy 0, policy_version 54220 (0.0009) [2023-10-08 02:06:26,194][52060] Updated weights for policy 0, policy_version 54230 (0.0007) [2023-10-08 02:06:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111738880. Throughput: 0: 1727.7, 1: 1729.2. Samples: 27945702. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:26,211][50642] Avg episode reward: [(0, '20.510'), (1, '24.800')] [2023-10-08 02:06:26,565][52060] Updated weights for policy 0, policy_version 54240 (0.0010) [2023-10-08 02:06:28,341][52059] Updated weights for policy 1, policy_version 54922 (0.0007) [2023-10-08 02:06:28,696][52059] Updated weights for policy 1, policy_version 54932 (0.0007) [2023-10-08 02:06:29,070][52059] Updated weights for policy 1, policy_version 54942 (0.0009) [2023-10-08 02:06:30,421][52060] Updated weights for policy 0, policy_version 54250 (0.0007) [2023-10-08 02:06:30,786][52060] Updated weights for policy 0, policy_version 54260 (0.0007) [2023-10-08 02:06:31,152][52060] Updated weights for policy 0, policy_version 54270 (0.0011) [2023-10-08 02:06:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 111804416. Throughput: 0: 1711.2, 1: 1737.9. Samples: 27966526. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:06:31,211][50642] Avg episode reward: [(0, '21.450'), (1, '23.630')] [2023-10-08 02:06:33,083][52059] Updated weights for policy 1, policy_version 54952 (0.0007) [2023-10-08 02:06:33,457][52059] Updated weights for policy 1, policy_version 54962 (0.0009) [2023-10-08 02:06:33,830][52059] Updated weights for policy 1, policy_version 54972 (0.0011) [2023-10-08 02:06:35,175][52060] Updated weights for policy 0, policy_version 54280 (0.0011) [2023-10-08 02:06:35,544][52060] Updated weights for policy 0, policy_version 54290 (0.0010) [2023-10-08 02:06:35,912][52060] Updated weights for policy 0, policy_version 54300 (0.0010) [2023-10-08 02:06:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 111902720. Throughput: 0: 1732.6, 1: 1733.0. Samples: 27976904. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:06:36,211][50642] Avg episode reward: [(0, '21.110'), (1, '22.450')] [2023-10-08 02:06:37,739][52059] Updated weights for policy 1, policy_version 54982 (0.0010) [2023-10-08 02:06:38,107][52059] Updated weights for policy 1, policy_version 54992 (0.0007) [2023-10-08 02:06:38,461][52059] Updated weights for policy 1, policy_version 55002 (0.0007) [2023-10-08 02:06:39,714][52060] Updated weights for policy 0, policy_version 54310 (0.0009) [2023-10-08 02:06:40,076][52060] Updated weights for policy 0, policy_version 54320 (0.0009) [2023-10-08 02:06:40,441][52060] Updated weights for policy 0, policy_version 54330 (0.0009) [2023-10-08 02:06:41,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 111968256. Throughput: 0: 1725.7, 1: 1723.9. Samples: 27997844. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:06:41,211][50642] Avg episode reward: [(0, '20.480'), (1, '22.060')] [2023-10-08 02:06:42,331][52059] Updated weights for policy 1, policy_version 55012 (0.0007) [2023-10-08 02:06:42,700][52059] Updated weights for policy 1, policy_version 55022 (0.0008) [2023-10-08 02:06:43,067][52059] Updated weights for policy 1, policy_version 55032 (0.0008) [2023-10-08 02:06:44,543][52060] Updated weights for policy 0, policy_version 54340 (0.0010) [2023-10-08 02:06:44,909][52060] Updated weights for policy 0, policy_version 54350 (0.0008) [2023-10-08 02:06:45,277][52060] Updated weights for policy 0, policy_version 54360 (0.0008) [2023-10-08 02:06:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 112033792. Throughput: 0: 1699.9, 1: 1752.9. Samples: 28018124. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:06:46,211][50642] Avg episode reward: [(0, '21.630'), (1, '21.880')] [2023-10-08 02:06:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000055040_56360960.pth... [2023-10-08 02:06:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000054368_55672832.pth... [2023-10-08 02:06:46,249][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000053440_54722560.pth [2023-10-08 02:06:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000052768_54034432.pth [2023-10-08 02:06:47,012][52059] Updated weights for policy 1, policy_version 55042 (0.0008) [2023-10-08 02:06:47,384][52059] Updated weights for policy 1, policy_version 55052 (0.0008) [2023-10-08 02:06:47,750][52059] Updated weights for policy 1, policy_version 55062 (0.0007) [2023-10-08 02:06:48,107][52059] Updated weights for policy 1, policy_version 55072 (0.0010) [2023-10-08 02:06:49,443][52060] Updated weights for policy 0, policy_version 54370 (0.0008) [2023-10-08 02:06:49,806][52060] Updated weights for policy 0, policy_version 54380 (0.0009) [2023-10-08 02:06:50,177][52060] Updated weights for policy 0, policy_version 54390 (0.0007) [2023-10-08 02:06:50,537][52060] Updated weights for policy 0, policy_version 54400 (0.0007) [2023-10-08 02:06:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 112099328. Throughput: 0: 1727.0, 1: 1722.9. Samples: 28028720. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:06:51,211][50642] Avg episode reward: [(0, '21.610'), (1, '23.170')] [2023-10-08 02:06:51,997][52059] Updated weights for policy 1, policy_version 55082 (0.0008) [2023-10-08 02:06:52,367][52059] Updated weights for policy 1, policy_version 55092 (0.0007) [2023-10-08 02:06:52,723][52059] Updated weights for policy 1, policy_version 55102 (0.0008) [2023-10-08 02:06:54,384][52060] Updated weights for policy 0, policy_version 54410 (0.0008) [2023-10-08 02:06:54,753][52060] Updated weights for policy 0, policy_version 54420 (0.0007) [2023-10-08 02:06:55,125][52060] Updated weights for policy 0, policy_version 54430 (0.0008) [2023-10-08 02:06:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 112164864. Throughput: 0: 1707.1, 1: 1739.9. Samples: 28049282. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:06:56,211][50642] Avg episode reward: [(0, '20.430'), (1, '19.990')] [2023-10-08 02:06:56,672][52059] Updated weights for policy 1, policy_version 55112 (0.0008) [2023-10-08 02:06:57,034][52059] Updated weights for policy 1, policy_version 55122 (0.0009) [2023-10-08 02:06:57,405][52059] Updated weights for policy 1, policy_version 55132 (0.0008) [2023-10-08 02:06:59,028][52060] Updated weights for policy 0, policy_version 54440 (0.0008) [2023-10-08 02:06:59,408][52060] Updated weights for policy 0, policy_version 54450 (0.0010) [2023-10-08 02:06:59,781][52060] Updated weights for policy 0, policy_version 54460 (0.0011) [2023-10-08 02:07:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 112230400. Throughput: 0: 1698.4, 1: 1749.0. Samples: 28070114. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:07:01,211][50642] Avg episode reward: [(0, '20.650'), (1, '21.030')] [2023-10-08 02:07:01,304][52059] Updated weights for policy 1, policy_version 55142 (0.0009) [2023-10-08 02:07:01,676][52059] Updated weights for policy 1, policy_version 55152 (0.0010) [2023-10-08 02:07:02,047][52059] Updated weights for policy 1, policy_version 55162 (0.0008) [2023-10-08 02:07:03,644][52060] Updated weights for policy 0, policy_version 54470 (0.0008) [2023-10-08 02:07:04,010][52060] Updated weights for policy 0, policy_version 54480 (0.0008) [2023-10-08 02:07:04,379][52060] Updated weights for policy 0, policy_version 54490 (0.0010) [2023-10-08 02:07:06,035][52059] Updated weights for policy 1, policy_version 55172 (0.0008) [2023-10-08 02:07:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 112295936. Throughput: 0: 1719.1, 1: 1727.6. Samples: 28080510. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:07:06,211][50642] Avg episode reward: [(0, '20.530'), (1, '21.530')] [2023-10-08 02:07:06,439][52059] Updated weights for policy 1, policy_version 55182 (0.0007) [2023-10-08 02:07:06,806][52059] Updated weights for policy 1, policy_version 55192 (0.0007) [2023-10-08 02:07:08,340][52060] Updated weights for policy 0, policy_version 54500 (0.0010) [2023-10-08 02:07:08,730][52060] Updated weights for policy 0, policy_version 54510 (0.0009) [2023-10-08 02:07:09,095][52060] Updated weights for policy 0, policy_version 54520 (0.0010) [2023-10-08 02:07:10,668][52059] Updated weights for policy 1, policy_version 55202 (0.0008) [2023-10-08 02:07:11,046][52059] Updated weights for policy 1, policy_version 55212 (0.0008) [2023-10-08 02:07:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 112361472. Throughput: 0: 1699.7, 1: 1750.2. Samples: 28100948. Policy #0 lag: (min: 19.0, avg: 22.6, max: 51.0) [2023-10-08 02:07:11,211][50642] Avg episode reward: [(0, '19.440'), (1, '21.280')] [2023-10-08 02:07:11,404][52059] Updated weights for policy 1, policy_version 55222 (0.0008) [2023-10-08 02:07:11,771][52059] Updated weights for policy 1, policy_version 55232 (0.0008) [2023-10-08 02:07:13,073][52060] Updated weights for policy 0, policy_version 54530 (0.0009) [2023-10-08 02:07:13,449][52060] Updated weights for policy 0, policy_version 54540 (0.0008) [2023-10-08 02:07:13,827][52060] Updated weights for policy 0, policy_version 54550 (0.0009) [2023-10-08 02:07:14,193][52060] Updated weights for policy 0, policy_version 54560 (0.0009) [2023-10-08 02:07:15,632][52059] Updated weights for policy 1, policy_version 55242 (0.0011) [2023-10-08 02:07:15,998][52059] Updated weights for policy 1, policy_version 55252 (0.0010) [2023-10-08 02:07:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 112427008. Throughput: 0: 1718.0, 1: 1730.4. Samples: 28121702. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:16,211][50642] Avg episode reward: [(0, '19.970'), (1, '23.480')] [2023-10-08 02:07:16,357][52059] Updated weights for policy 1, policy_version 55262 (0.0010) [2023-10-08 02:07:18,069][52060] Updated weights for policy 0, policy_version 54570 (0.0008) [2023-10-08 02:07:18,442][52060] Updated weights for policy 0, policy_version 54580 (0.0007) [2023-10-08 02:07:18,807][52060] Updated weights for policy 0, policy_version 54590 (0.0007) [2023-10-08 02:07:20,403][52059] Updated weights for policy 1, policy_version 55272 (0.0008) [2023-10-08 02:07:20,778][52059] Updated weights for policy 1, policy_version 55282 (0.0010) [2023-10-08 02:07:21,139][52059] Updated weights for policy 1, policy_version 55292 (0.0009) [2023-10-08 02:07:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 112492544. Throughput: 0: 1703.3, 1: 1736.8. Samples: 28131710. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:21,211][50642] Avg episode reward: [(0, '19.310'), (1, '21.880')] [2023-10-08 02:07:22,751][52060] Updated weights for policy 0, policy_version 54600 (0.0008) [2023-10-08 02:07:23,122][52060] Updated weights for policy 0, policy_version 54610 (0.0009) [2023-10-08 02:07:23,491][52060] Updated weights for policy 0, policy_version 54620 (0.0007) [2023-10-08 02:07:24,999][52059] Updated weights for policy 1, policy_version 55302 (0.0009) [2023-10-08 02:07:25,365][52059] Updated weights for policy 1, policy_version 55312 (0.0009) [2023-10-08 02:07:25,734][52059] Updated weights for policy 1, policy_version 55322 (0.0008) [2023-10-08 02:07:26,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 112590848. Throughput: 0: 1704.4, 1: 1742.5. Samples: 28152956. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:26,211][50642] Avg episode reward: [(0, '16.450'), (1, '23.780')] [2023-10-08 02:07:27,508][52060] Updated weights for policy 0, policy_version 54630 (0.0009) [2023-10-08 02:07:27,886][52060] Updated weights for policy 0, policy_version 54640 (0.0009) [2023-10-08 02:07:28,246][52060] Updated weights for policy 0, policy_version 54650 (0.0008) [2023-10-08 02:07:29,627][52059] Updated weights for policy 1, policy_version 55332 (0.0007) [2023-10-08 02:07:29,988][52059] Updated weights for policy 1, policy_version 55342 (0.0007) [2023-10-08 02:07:30,353][52059] Updated weights for policy 1, policy_version 55352 (0.0007) [2023-10-08 02:07:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 112656384. Throughput: 0: 1730.8, 1: 1712.8. Samples: 28173082. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:31,211][50642] Avg episode reward: [(0, '15.800'), (1, '22.600')] [2023-10-08 02:07:32,334][52060] Updated weights for policy 0, policy_version 54660 (0.0009) [2023-10-08 02:07:32,698][52060] Updated weights for policy 0, policy_version 54670 (0.0007) [2023-10-08 02:07:33,067][52060] Updated weights for policy 0, policy_version 54680 (0.0007) [2023-10-08 02:07:34,299][52059] Updated weights for policy 1, policy_version 55362 (0.0008) [2023-10-08 02:07:34,667][52059] Updated weights for policy 1, policy_version 55372 (0.0009) [2023-10-08 02:07:35,027][52059] Updated weights for policy 1, policy_version 55382 (0.0007) [2023-10-08 02:07:35,391][52059] Updated weights for policy 1, policy_version 55392 (0.0008) [2023-10-08 02:07:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 112721920. Throughput: 0: 1700.8, 1: 1745.0. Samples: 28183784. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:36,211][50642] Avg episode reward: [(0, '18.790'), (1, '24.080')] [2023-10-08 02:07:37,085][52060] Updated weights for policy 0, policy_version 54690 (0.0008) [2023-10-08 02:07:37,456][52060] Updated weights for policy 0, policy_version 54700 (0.0007) [2023-10-08 02:07:37,827][52060] Updated weights for policy 0, policy_version 54710 (0.0009) [2023-10-08 02:07:38,190][52060] Updated weights for policy 0, policy_version 54720 (0.0011) [2023-10-08 02:07:39,184][52059] Updated weights for policy 1, policy_version 55402 (0.0009) [2023-10-08 02:07:39,549][52059] Updated weights for policy 1, policy_version 55412 (0.0009) [2023-10-08 02:07:39,902][52059] Updated weights for policy 1, policy_version 55422 (0.0010) [2023-10-08 02:07:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 112787456. Throughput: 0: 1720.0, 1: 1724.0. Samples: 28204264. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:41,211][50642] Avg episode reward: [(0, '18.820'), (1, '23.340')] [2023-10-08 02:07:42,245][52060] Updated weights for policy 0, policy_version 54730 (0.0008) [2023-10-08 02:07:42,624][52060] Updated weights for policy 0, policy_version 54740 (0.0008) [2023-10-08 02:07:42,990][52060] Updated weights for policy 0, policy_version 54750 (0.0007) [2023-10-08 02:07:43,910][52059] Updated weights for policy 1, policy_version 55432 (0.0010) [2023-10-08 02:07:44,274][52059] Updated weights for policy 1, policy_version 55442 (0.0010) [2023-10-08 02:07:44,635][52059] Updated weights for policy 1, policy_version 55452 (0.0008) [2023-10-08 02:07:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 112852992. Throughput: 0: 1726.5, 1: 1721.1. Samples: 28225260. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:46,211][50642] Avg episode reward: [(0, '17.740'), (1, '24.900')] [2023-10-08 02:07:46,859][52060] Updated weights for policy 0, policy_version 54760 (0.0007) [2023-10-08 02:07:47,226][52060] Updated weights for policy 0, policy_version 54770 (0.0008) [2023-10-08 02:07:47,597][52060] Updated weights for policy 0, policy_version 54780 (0.0010) [2023-10-08 02:07:48,628][52059] Updated weights for policy 1, policy_version 55462 (0.0008) [2023-10-08 02:07:49,001][52059] Updated weights for policy 1, policy_version 55472 (0.0008) [2023-10-08 02:07:49,358][52059] Updated weights for policy 1, policy_version 55482 (0.0007) [2023-10-08 02:07:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 112918528. Throughput: 0: 1705.0, 1: 1737.5. Samples: 28235420. Policy #0 lag: (min: 31.0, avg: 31.3, max: 43.0) [2023-10-08 02:07:51,211][50642] Avg episode reward: [(0, '19.030'), (1, '21.420')] [2023-10-08 02:07:51,495][52060] Updated weights for policy 0, policy_version 54790 (0.0009) [2023-10-08 02:07:51,862][52060] Updated weights for policy 0, policy_version 54800 (0.0008) [2023-10-08 02:07:52,227][52060] Updated weights for policy 0, policy_version 54810 (0.0010) [2023-10-08 02:07:53,217][52059] Updated weights for policy 1, policy_version 55492 (0.0007) [2023-10-08 02:07:53,574][52059] Updated weights for policy 1, policy_version 55502 (0.0008) [2023-10-08 02:07:53,939][52059] Updated weights for policy 1, policy_version 55512 (0.0009) [2023-10-08 02:07:56,203][52060] Updated weights for policy 0, policy_version 54820 (0.0009) [2023-10-08 02:07:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 112984064. Throughput: 0: 1731.6, 1: 1721.5. Samples: 28256338. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:07:56,211][50642] Avg episode reward: [(0, '23.130'), (1, '21.320')] [2023-10-08 02:07:56,585][52060] Updated weights for policy 0, policy_version 54830 (0.0010) [2023-10-08 02:07:56,953][52060] Updated weights for policy 0, policy_version 54840 (0.0008) [2023-10-08 02:07:57,842][52059] Updated weights for policy 1, policy_version 55522 (0.0008) [2023-10-08 02:07:58,235][52059] Updated weights for policy 1, policy_version 55532 (0.0007) [2023-10-08 02:07:58,602][52059] Updated weights for policy 1, policy_version 55542 (0.0007) [2023-10-08 02:07:58,959][52059] Updated weights for policy 1, policy_version 55552 (0.0009) [2023-10-08 02:08:00,865][52060] Updated weights for policy 0, policy_version 54850 (0.0007) [2023-10-08 02:08:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 113049600. Throughput: 0: 1724.1, 1: 1738.9. Samples: 28277538. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:08:01,211][50642] Avg episode reward: [(0, '20.770'), (1, '20.270')] [2023-10-08 02:08:01,245][52060] Updated weights for policy 0, policy_version 54860 (0.0007) [2023-10-08 02:08:01,617][52060] Updated weights for policy 0, policy_version 54870 (0.0009) [2023-10-08 02:08:01,980][52060] Updated weights for policy 0, policy_version 54880 (0.0008) [2023-10-08 02:08:02,661][52059] Updated weights for policy 1, policy_version 55562 (0.0008) [2023-10-08 02:08:03,019][52059] Updated weights for policy 1, policy_version 55572 (0.0011) [2023-10-08 02:08:03,386][52059] Updated weights for policy 1, policy_version 55582 (0.0011) [2023-10-08 02:08:05,961][52060] Updated weights for policy 0, policy_version 54890 (0.0008) [2023-10-08 02:08:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 113115136. Throughput: 0: 1722.2, 1: 1728.1. Samples: 28286972. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:08:06,211][50642] Avg episode reward: [(0, '18.740'), (1, '20.740')] [2023-10-08 02:08:06,332][52060] Updated weights for policy 0, policy_version 54900 (0.0007) [2023-10-08 02:08:06,695][52060] Updated weights for policy 0, policy_version 54910 (0.0007) [2023-10-08 02:08:07,407][52059] Updated weights for policy 1, policy_version 55592 (0.0011) [2023-10-08 02:08:07,768][52059] Updated weights for policy 1, policy_version 55602 (0.0009) [2023-10-08 02:08:08,133][52059] Updated weights for policy 1, policy_version 55612 (0.0009) [2023-10-08 02:08:10,428][52060] Updated weights for policy 0, policy_version 54920 (0.0009) [2023-10-08 02:08:10,795][52060] Updated weights for policy 0, policy_version 54930 (0.0010) [2023-10-08 02:08:11,160][52060] Updated weights for policy 0, policy_version 54940 (0.0011) [2023-10-08 02:08:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 113180672. Throughput: 0: 1732.1, 1: 1728.8. Samples: 28308694. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:08:11,211][50642] Avg episode reward: [(0, '20.890'), (1, '19.910')] [2023-10-08 02:08:12,141][52059] Updated weights for policy 1, policy_version 55622 (0.0008) [2023-10-08 02:08:12,504][52059] Updated weights for policy 1, policy_version 55632 (0.0007) [2023-10-08 02:08:12,865][52059] Updated weights for policy 1, policy_version 55642 (0.0007) [2023-10-08 02:08:15,162][52060] Updated weights for policy 0, policy_version 54950 (0.0008) [2023-10-08 02:08:15,525][52060] Updated weights for policy 0, policy_version 54960 (0.0008) [2023-10-08 02:08:15,893][52060] Updated weights for policy 0, policy_version 54970 (0.0008) [2023-10-08 02:08:16,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 113278976. Throughput: 0: 1711.8, 1: 1754.1. Samples: 28329046. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:08:16,211][50642] Avg episode reward: [(0, '23.780'), (1, '19.740')] [2023-10-08 02:08:16,718][52059] Updated weights for policy 1, policy_version 55652 (0.0009) [2023-10-08 02:08:17,085][52059] Updated weights for policy 1, policy_version 55662 (0.0007) [2023-10-08 02:08:17,459][52059] Updated weights for policy 1, policy_version 55672 (0.0008) [2023-10-08 02:08:20,025][52060] Updated weights for policy 0, policy_version 54980 (0.0009) [2023-10-08 02:08:20,390][52060] Updated weights for policy 0, policy_version 54990 (0.0010) [2023-10-08 02:08:20,756][52060] Updated weights for policy 0, policy_version 55000 (0.0009) [2023-10-08 02:08:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 113344512. Throughput: 0: 1735.2, 1: 1718.9. Samples: 28339216. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:08:21,211][50642] Avg episode reward: [(0, '17.800'), (1, '24.000')] [2023-10-08 02:08:21,539][52059] Updated weights for policy 1, policy_version 55682 (0.0007) [2023-10-08 02:08:21,905][52059] Updated weights for policy 1, policy_version 55692 (0.0007) [2023-10-08 02:08:22,263][52059] Updated weights for policy 1, policy_version 55702 (0.0008) [2023-10-08 02:08:22,625][52059] Updated weights for policy 1, policy_version 55712 (0.0008) [2023-10-08 02:08:24,787][52060] Updated weights for policy 0, policy_version 55010 (0.0007) [2023-10-08 02:08:25,158][52060] Updated weights for policy 0, policy_version 55020 (0.0008) [2023-10-08 02:08:25,532][52060] Updated weights for policy 0, policy_version 55030 (0.0008) [2023-10-08 02:08:25,908][52060] Updated weights for policy 0, policy_version 55040 (0.0010) [2023-10-08 02:08:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 113410048. Throughput: 0: 1726.7, 1: 1745.5. Samples: 28360512. Policy #0 lag: (min: 29.0, avg: 37.9, max: 61.0) [2023-10-08 02:08:26,211][50642] Avg episode reward: [(0, '20.290'), (1, '25.570')] [2023-10-08 02:08:26,541][52059] Updated weights for policy 1, policy_version 55722 (0.0008) [2023-10-08 02:08:26,908][52059] Updated weights for policy 1, policy_version 55732 (0.0009) [2023-10-08 02:08:27,264][52059] Updated weights for policy 1, policy_version 55742 (0.0009) [2023-10-08 02:08:29,734][52060] Updated weights for policy 0, policy_version 55050 (0.0008) [2023-10-08 02:08:30,100][52060] Updated weights for policy 0, policy_version 55060 (0.0009) [2023-10-08 02:08:30,472][52060] Updated weights for policy 0, policy_version 55070 (0.0008) [2023-10-08 02:08:31,109][52059] Updated weights for policy 1, policy_version 55752 (0.0008) [2023-10-08 02:08:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 113475584. Throughput: 0: 1705.0, 1: 1750.2. Samples: 28380746. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:08:31,211][50642] Avg episode reward: [(0, '23.330'), (1, '22.230')] [2023-10-08 02:08:31,479][52059] Updated weights for policy 1, policy_version 55762 (0.0009) [2023-10-08 02:08:31,838][52059] Updated weights for policy 1, policy_version 55772 (0.0008) [2023-10-08 02:08:34,562][52060] Updated weights for policy 0, policy_version 55080 (0.0008) [2023-10-08 02:08:34,938][52060] Updated weights for policy 0, policy_version 55090 (0.0008) [2023-10-08 02:08:35,307][52060] Updated weights for policy 0, policy_version 55100 (0.0008) [2023-10-08 02:08:35,654][52059] Updated weights for policy 1, policy_version 55782 (0.0008) [2023-10-08 02:08:36,021][52059] Updated weights for policy 1, policy_version 55792 (0.0007) [2023-10-08 02:08:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 113541120. Throughput: 0: 1738.0, 1: 1735.9. Samples: 28391748. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:08:36,211][50642] Avg episode reward: [(0, '21.170'), (1, '20.490')] [2023-10-08 02:08:36,374][52059] Updated weights for policy 1, policy_version 55802 (0.0009) [2023-10-08 02:08:39,083][52060] Updated weights for policy 0, policy_version 55110 (0.0009) [2023-10-08 02:08:39,459][52060] Updated weights for policy 0, policy_version 55120 (0.0009) [2023-10-08 02:08:39,820][52060] Updated weights for policy 0, policy_version 55130 (0.0007) [2023-10-08 02:08:40,319][52059] Updated weights for policy 1, policy_version 55812 (0.0009) [2023-10-08 02:08:40,677][52059] Updated weights for policy 1, policy_version 55822 (0.0010) [2023-10-08 02:08:41,048][52059] Updated weights for policy 1, policy_version 55832 (0.0008) [2023-10-08 02:08:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 113606656. Throughput: 0: 1706.6, 1: 1757.6. Samples: 28412226. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:08:41,211][50642] Avg episode reward: [(0, '18.800'), (1, '23.840')] [2023-10-08 02:08:43,896][52060] Updated weights for policy 0, policy_version 55140 (0.0008) [2023-10-08 02:08:44,287][52060] Updated weights for policy 0, policy_version 55150 (0.0009) [2023-10-08 02:08:44,659][52060] Updated weights for policy 0, policy_version 55160 (0.0009) [2023-10-08 02:08:45,058][52059] Updated weights for policy 1, policy_version 55842 (0.0009) [2023-10-08 02:08:45,459][52059] Updated weights for policy 1, policy_version 55852 (0.0009) [2023-10-08 02:08:45,831][52059] Updated weights for policy 1, policy_version 55862 (0.0010) [2023-10-08 02:08:46,187][52059] Updated weights for policy 1, policy_version 55872 (0.0008) [2023-10-08 02:08:46,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 113704960. Throughput: 0: 1701.8, 1: 1732.0. Samples: 28432060. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:08:46,211][50642] Avg episode reward: [(0, '21.630'), (1, '22.210')] [2023-10-08 02:08:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000055168_56492032.pth... [2023-10-08 02:08:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000055872_57212928.pth... [2023-10-08 02:08:46,256][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000053568_54853632.pth [2023-10-08 02:08:46,257][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000054240_55541760.pth [2023-10-08 02:08:48,463][52060] Updated weights for policy 0, policy_version 55170 (0.0010) [2023-10-08 02:08:48,831][52060] Updated weights for policy 0, policy_version 55180 (0.0009) [2023-10-08 02:08:49,201][52060] Updated weights for policy 0, policy_version 55190 (0.0008) [2023-10-08 02:08:49,578][52060] Updated weights for policy 0, policy_version 55200 (0.0007) [2023-10-08 02:08:50,010][52059] Updated weights for policy 1, policy_version 55882 (0.0010) [2023-10-08 02:08:50,376][52059] Updated weights for policy 1, policy_version 55892 (0.0010) [2023-10-08 02:08:50,727][52059] Updated weights for policy 1, policy_version 55902 (0.0008) [2023-10-08 02:08:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 113770496. Throughput: 0: 1718.9, 1: 1753.0. Samples: 28443208. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:08:51,211][50642] Avg episode reward: [(0, '21.340'), (1, '19.990')] [2023-10-08 02:08:53,609][52060] Updated weights for policy 0, policy_version 55210 (0.0007) [2023-10-08 02:08:53,970][52060] Updated weights for policy 0, policy_version 55220 (0.0010) [2023-10-08 02:08:54,348][52060] Updated weights for policy 0, policy_version 55230 (0.0009) [2023-10-08 02:08:54,684][52059] Updated weights for policy 1, policy_version 55912 (0.0009) [2023-10-08 02:08:55,054][52059] Updated weights for policy 1, policy_version 55922 (0.0008) [2023-10-08 02:08:55,415][52059] Updated weights for policy 1, policy_version 55932 (0.0010) [2023-10-08 02:08:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 113836032. Throughput: 0: 1690.4, 1: 1742.5. Samples: 28463172. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:08:56,211][50642] Avg episode reward: [(0, '17.790'), (1, '21.680')] [2023-10-08 02:08:58,197][52060] Updated weights for policy 0, policy_version 55240 (0.0010) [2023-10-08 02:08:58,566][52060] Updated weights for policy 0, policy_version 55250 (0.0007) [2023-10-08 02:08:58,929][52060] Updated weights for policy 0, policy_version 55260 (0.0008) [2023-10-08 02:08:59,364][52059] Updated weights for policy 1, policy_version 55942 (0.0010) [2023-10-08 02:08:59,735][52059] Updated weights for policy 1, policy_version 55952 (0.0008) [2023-10-08 02:09:00,107][52059] Updated weights for policy 1, policy_version 55962 (0.0010) [2023-10-08 02:09:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 113901568. Throughput: 0: 1714.6, 1: 1720.4. Samples: 28483624. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:09:01,212][50642] Avg episode reward: [(0, '20.580'), (1, '24.250')] [2023-10-08 02:09:03,026][52060] Updated weights for policy 0, policy_version 55270 (0.0007) [2023-10-08 02:09:03,405][52060] Updated weights for policy 0, policy_version 55280 (0.0008) [2023-10-08 02:09:03,776][52060] Updated weights for policy 0, policy_version 55290 (0.0007) [2023-10-08 02:09:04,187][52059] Updated weights for policy 1, policy_version 55972 (0.0009) [2023-10-08 02:09:04,553][52059] Updated weights for policy 1, policy_version 55982 (0.0008) [2023-10-08 02:09:04,908][52059] Updated weights for policy 1, policy_version 55992 (0.0009) [2023-10-08 02:09:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 113967104. Throughput: 0: 1696.8, 1: 1754.0. Samples: 28494506. Policy #0 lag: (min: 23.0, avg: 23.0, max: 23.0) [2023-10-08 02:09:06,211][50642] Avg episode reward: [(0, '22.020'), (1, '21.480')] [2023-10-08 02:09:07,869][52060] Updated weights for policy 0, policy_version 55300 (0.0008) [2023-10-08 02:09:08,230][52060] Updated weights for policy 0, policy_version 55310 (0.0010) [2023-10-08 02:09:08,597][52060] Updated weights for policy 0, policy_version 55320 (0.0010) [2023-10-08 02:09:08,811][52059] Updated weights for policy 1, policy_version 56002 (0.0009) [2023-10-08 02:09:09,179][52059] Updated weights for policy 1, policy_version 56012 (0.0010) [2023-10-08 02:09:09,541][52059] Updated weights for policy 1, policy_version 56022 (0.0007) [2023-10-08 02:09:09,901][52059] Updated weights for policy 1, policy_version 56032 (0.0008) [2023-10-08 02:09:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 114032640. Throughput: 0: 1696.2, 1: 1722.4. Samples: 28514350. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:11,211][50642] Avg episode reward: [(0, '20.500'), (1, '20.670')] [2023-10-08 02:09:12,465][52060] Updated weights for policy 0, policy_version 55330 (0.0008) [2023-10-08 02:09:12,826][52060] Updated weights for policy 0, policy_version 55340 (0.0007) [2023-10-08 02:09:13,194][52060] Updated weights for policy 0, policy_version 55350 (0.0009) [2023-10-08 02:09:13,566][52060] Updated weights for policy 0, policy_version 55360 (0.0009) [2023-10-08 02:09:13,899][52059] Updated weights for policy 1, policy_version 56042 (0.0008) [2023-10-08 02:09:14,256][52059] Updated weights for policy 1, policy_version 56052 (0.0009) [2023-10-08 02:09:14,627][52059] Updated weights for policy 1, policy_version 56062 (0.0009) [2023-10-08 02:09:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 114098176. Throughput: 0: 1723.9, 1: 1719.1. Samples: 28535682. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:16,212][50642] Avg episode reward: [(0, '18.540'), (1, '23.130')] [2023-10-08 02:09:17,615][52060] Updated weights for policy 0, policy_version 55370 (0.0010) [2023-10-08 02:09:17,983][52060] Updated weights for policy 0, policy_version 55380 (0.0009) [2023-10-08 02:09:18,347][52060] Updated weights for policy 0, policy_version 55390 (0.0009) [2023-10-08 02:09:18,525][52059] Updated weights for policy 1, policy_version 56072 (0.0007) [2023-10-08 02:09:18,895][52059] Updated weights for policy 1, policy_version 56082 (0.0008) [2023-10-08 02:09:19,261][52059] Updated weights for policy 1, policy_version 56092 (0.0007) [2023-10-08 02:09:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 114163712. Throughput: 0: 1686.2, 1: 1729.1. Samples: 28545440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:21,211][50642] Avg episode reward: [(0, '19.720'), (1, '21.400')] [2023-10-08 02:09:22,412][52060] Updated weights for policy 0, policy_version 55400 (0.0008) [2023-10-08 02:09:22,786][52060] Updated weights for policy 0, policy_version 55410 (0.0007) [2023-10-08 02:09:23,157][52060] Updated weights for policy 0, policy_version 55420 (0.0007) [2023-10-08 02:09:23,262][52059] Updated weights for policy 1, policy_version 56102 (0.0007) [2023-10-08 02:09:23,631][52059] Updated weights for policy 1, policy_version 56112 (0.0007) [2023-10-08 02:09:23,995][52059] Updated weights for policy 1, policy_version 56122 (0.0007) [2023-10-08 02:09:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 114229248. Throughput: 0: 1710.8, 1: 1704.8. Samples: 28565930. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:26,211][50642] Avg episode reward: [(0, '18.640'), (1, '20.520')] [2023-10-08 02:09:27,097][52060] Updated weights for policy 0, policy_version 55430 (0.0008) [2023-10-08 02:09:27,474][52060] Updated weights for policy 0, policy_version 55440 (0.0011) [2023-10-08 02:09:27,850][52060] Updated weights for policy 0, policy_version 55450 (0.0008) [2023-10-08 02:09:27,984][52059] Updated weights for policy 1, policy_version 56132 (0.0009) [2023-10-08 02:09:28,346][52059] Updated weights for policy 1, policy_version 56142 (0.0007) [2023-10-08 02:09:28,708][52059] Updated weights for policy 1, policy_version 56152 (0.0008) [2023-10-08 02:09:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 114294784. Throughput: 0: 1726.0, 1: 1727.5. Samples: 28587468. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:31,211][50642] Avg episode reward: [(0, '20.140'), (1, '20.610')] [2023-10-08 02:09:31,784][52060] Updated weights for policy 0, policy_version 55460 (0.0008) [2023-10-08 02:09:32,168][52060] Updated weights for policy 0, policy_version 55470 (0.0010) [2023-10-08 02:09:32,532][52060] Updated weights for policy 0, policy_version 55480 (0.0009) [2023-10-08 02:09:32,558][52059] Updated weights for policy 1, policy_version 56162 (0.0008) [2023-10-08 02:09:32,932][52059] Updated weights for policy 1, policy_version 56172 (0.0009) [2023-10-08 02:09:33,299][52059] Updated weights for policy 1, policy_version 56182 (0.0009) [2023-10-08 02:09:33,670][52059] Updated weights for policy 1, policy_version 56192 (0.0009) [2023-10-08 02:09:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 114360320. Throughput: 0: 1702.0, 1: 1706.2. Samples: 28596576. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:36,211][50642] Avg episode reward: [(0, '21.050'), (1, '24.180')] [2023-10-08 02:09:36,603][52060] Updated weights for policy 0, policy_version 55490 (0.0009) [2023-10-08 02:09:36,980][52060] Updated weights for policy 0, policy_version 55500 (0.0009) [2023-10-08 02:09:37,337][52060] Updated weights for policy 0, policy_version 55510 (0.0008) [2023-10-08 02:09:37,596][52059] Updated weights for policy 1, policy_version 56202 (0.0007) [2023-10-08 02:09:37,702][52060] Updated weights for policy 0, policy_version 55520 (0.0010) [2023-10-08 02:09:37,961][52059] Updated weights for policy 1, policy_version 56212 (0.0007) [2023-10-08 02:09:38,329][52059] Updated weights for policy 1, policy_version 56222 (0.0007) [2023-10-08 02:09:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 114425856. Throughput: 0: 1720.9, 1: 1718.4. Samples: 28617940. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:41,211][50642] Avg episode reward: [(0, '19.560'), (1, '21.240')] [2023-10-08 02:09:41,646][52060] Updated weights for policy 0, policy_version 55530 (0.0008) [2023-10-08 02:09:42,022][52060] Updated weights for policy 0, policy_version 55540 (0.0010) [2023-10-08 02:09:42,070][52059] Updated weights for policy 1, policy_version 56232 (0.0008) [2023-10-08 02:09:42,384][52060] Updated weights for policy 0, policy_version 55550 (0.0007) [2023-10-08 02:09:42,434][52059] Updated weights for policy 1, policy_version 56242 (0.0009) [2023-10-08 02:09:42,795][52059] Updated weights for policy 1, policy_version 56252 (0.0009) [2023-10-08 02:09:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 114491392. Throughput: 0: 1717.0, 1: 1743.7. Samples: 28639356. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:46,211][50642] Avg episode reward: [(0, '19.870'), (1, '21.310')] [2023-10-08 02:09:46,323][52060] Updated weights for policy 0, policy_version 55560 (0.0008) [2023-10-08 02:09:46,688][52060] Updated weights for policy 0, policy_version 55570 (0.0009) [2023-10-08 02:09:46,697][52059] Updated weights for policy 1, policy_version 56262 (0.0008) [2023-10-08 02:09:47,054][52060] Updated weights for policy 0, policy_version 55580 (0.0007) [2023-10-08 02:09:47,063][52059] Updated weights for policy 1, policy_version 56272 (0.0009) [2023-10-08 02:09:47,431][52059] Updated weights for policy 1, policy_version 56282 (0.0009) [2023-10-08 02:09:50,972][52060] Updated weights for policy 0, policy_version 55590 (0.0007) [2023-10-08 02:09:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 114556928. Throughput: 0: 1712.6, 1: 1710.5. Samples: 28648544. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:51,211][50642] Avg episode reward: [(0, '22.140'), (1, '23.840')] [2023-10-08 02:09:51,335][52060] Updated weights for policy 0, policy_version 55600 (0.0008) [2023-10-08 02:09:51,364][52059] Updated weights for policy 1, policy_version 56292 (0.0008) [2023-10-08 02:09:51,716][52060] Updated weights for policy 0, policy_version 55610 (0.0007) [2023-10-08 02:09:51,729][52059] Updated weights for policy 1, policy_version 56302 (0.0008) [2023-10-08 02:09:52,088][52059] Updated weights for policy 1, policy_version 56312 (0.0008) [2023-10-08 02:09:55,828][52060] Updated weights for policy 0, policy_version 55620 (0.0009) [2023-10-08 02:09:56,190][52059] Updated weights for policy 1, policy_version 56322 (0.0009) [2023-10-08 02:09:56,205][52060] Updated weights for policy 0, policy_version 55630 (0.0008) [2023-10-08 02:09:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 114622464. Throughput: 0: 1722.5, 1: 1735.0. Samples: 28669936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:09:56,211][50642] Avg episode reward: [(0, '18.380'), (1, '22.750')] [2023-10-08 02:09:56,554][52059] Updated weights for policy 1, policy_version 56332 (0.0008) [2023-10-08 02:09:56,565][52060] Updated weights for policy 0, policy_version 55640 (0.0007) [2023-10-08 02:09:56,913][52059] Updated weights for policy 1, policy_version 56342 (0.0008) [2023-10-08 02:09:57,281][52059] Updated weights for policy 1, policy_version 56352 (0.0009) [2023-10-08 02:10:00,496][52060] Updated weights for policy 0, policy_version 55650 (0.0008) [2023-10-08 02:10:00,867][52060] Updated weights for policy 0, policy_version 55660 (0.0010) [2023-10-08 02:10:01,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 114688000. Throughput: 0: 1706.2, 1: 1735.1. Samples: 28690542. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:01,211][50642] Avg episode reward: [(0, '18.380'), (1, '19.240')] [2023-10-08 02:10:01,229][52060] Updated weights for policy 0, policy_version 55670 (0.0009) [2023-10-08 02:10:01,399][52059] Updated weights for policy 1, policy_version 56362 (0.0008) [2023-10-08 02:10:01,605][52060] Updated weights for policy 0, policy_version 55680 (0.0009) [2023-10-08 02:10:01,761][52059] Updated weights for policy 1, policy_version 56372 (0.0009) [2023-10-08 02:10:02,124][52059] Updated weights for policy 1, policy_version 56382 (0.0009) [2023-10-08 02:10:05,509][52060] Updated weights for policy 0, policy_version 55690 (0.0010) [2023-10-08 02:10:05,882][52060] Updated weights for policy 0, policy_version 55700 (0.0010) [2023-10-08 02:10:06,174][52059] Updated weights for policy 1, policy_version 56392 (0.0009) [2023-10-08 02:10:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 114753536. Throughput: 0: 1723.7, 1: 1721.2. Samples: 28700458. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:06,211][50642] Avg episode reward: [(0, '22.830'), (1, '20.270')] [2023-10-08 02:10:06,243][52060] Updated weights for policy 0, policy_version 55710 (0.0008) [2023-10-08 02:10:06,539][52059] Updated weights for policy 1, policy_version 56402 (0.0009) [2023-10-08 02:10:06,900][52059] Updated weights for policy 1, policy_version 56412 (0.0007) [2023-10-08 02:10:10,204][52060] Updated weights for policy 0, policy_version 55720 (0.0007) [2023-10-08 02:10:10,573][52060] Updated weights for policy 0, policy_version 55730 (0.0011) [2023-10-08 02:10:10,706][52059] Updated weights for policy 1, policy_version 56422 (0.0008) [2023-10-08 02:10:10,938][52060] Updated weights for policy 0, policy_version 55740 (0.0009) [2023-10-08 02:10:11,066][52059] Updated weights for policy 1, policy_version 56432 (0.0009) [2023-10-08 02:10:11,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 114851840. Throughput: 0: 1721.5, 1: 1743.1. Samples: 28721836. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:11,211][50642] Avg episode reward: [(0, '20.700'), (1, '24.610')] [2023-10-08 02:10:11,422][52059] Updated weights for policy 1, policy_version 56442 (0.0007) [2023-10-08 02:10:14,889][52060] Updated weights for policy 0, policy_version 55750 (0.0007) [2023-10-08 02:10:15,254][52060] Updated weights for policy 0, policy_version 55760 (0.0008) [2023-10-08 02:10:15,303][52059] Updated weights for policy 1, policy_version 56452 (0.0008) [2023-10-08 02:10:15,635][52060] Updated weights for policy 0, policy_version 55770 (0.0009) [2023-10-08 02:10:15,665][52059] Updated weights for policy 1, policy_version 56462 (0.0008) [2023-10-08 02:10:16,031][52059] Updated weights for policy 1, policy_version 56472 (0.0009) [2023-10-08 02:10:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 114917376. Throughput: 0: 1692.4, 1: 1723.6. Samples: 28741184. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:16,211][50642] Avg episode reward: [(0, '17.920'), (1, '23.070')] [2023-10-08 02:10:19,501][52060] Updated weights for policy 0, policy_version 55780 (0.0008) [2023-10-08 02:10:19,871][52060] Updated weights for policy 0, policy_version 55790 (0.0010) [2023-10-08 02:10:20,132][52059] Updated weights for policy 1, policy_version 56482 (0.0009) [2023-10-08 02:10:20,236][52060] Updated weights for policy 0, policy_version 55800 (0.0009) [2023-10-08 02:10:20,535][52059] Updated weights for policy 1, policy_version 56492 (0.0010) [2023-10-08 02:10:20,906][52059] Updated weights for policy 1, policy_version 56502 (0.0010) [2023-10-08 02:10:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 114982912. Throughput: 0: 1725.5, 1: 1739.2. Samples: 28752486. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:21,211][50642] Avg episode reward: [(0, '20.540'), (1, '20.550')] [2023-10-08 02:10:21,269][52059] Updated weights for policy 1, policy_version 56512 (0.0009) [2023-10-08 02:10:24,137][52060] Updated weights for policy 0, policy_version 55810 (0.0009) [2023-10-08 02:10:24,512][52060] Updated weights for policy 0, policy_version 55820 (0.0011) [2023-10-08 02:10:24,880][52060] Updated weights for policy 0, policy_version 55830 (0.0009) [2023-10-08 02:10:25,013][52059] Updated weights for policy 1, policy_version 56522 (0.0007) [2023-10-08 02:10:25,252][52060] Updated weights for policy 0, policy_version 55840 (0.0008) [2023-10-08 02:10:25,373][52059] Updated weights for policy 1, policy_version 56532 (0.0010) [2023-10-08 02:10:25,735][52059] Updated weights for policy 1, policy_version 56542 (0.0009) [2023-10-08 02:10:26,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 115081216. Throughput: 0: 1706.7, 1: 1732.8. Samples: 28772720. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:26,211][50642] Avg episode reward: [(0, '22.390'), (1, '22.190')] [2023-10-08 02:10:29,471][52060] Updated weights for policy 0, policy_version 55850 (0.0007) [2023-10-08 02:10:29,610][52059] Updated weights for policy 1, policy_version 56552 (0.0010) [2023-10-08 02:10:29,836][52060] Updated weights for policy 0, policy_version 55860 (0.0009) [2023-10-08 02:10:29,978][52059] Updated weights for policy 1, policy_version 56562 (0.0008) [2023-10-08 02:10:30,206][52060] Updated weights for policy 0, policy_version 55870 (0.0008) [2023-10-08 02:10:30,340][52059] Updated weights for policy 1, policy_version 56572 (0.0010) [2023-10-08 02:10:31,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 115146752. Throughput: 0: 1694.4, 1: 1705.7. Samples: 28792358. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:31,211][50642] Avg episode reward: [(0, '17.430'), (1, '27.380')] [2023-10-08 02:10:31,223][51710] Saving new best policy, reward=27.380! [2023-10-08 02:10:34,268][52060] Updated weights for policy 0, policy_version 55880 (0.0008) [2023-10-08 02:10:34,283][52059] Updated weights for policy 1, policy_version 56582 (0.0008) [2023-10-08 02:10:34,632][52060] Updated weights for policy 0, policy_version 55890 (0.0009) [2023-10-08 02:10:34,657][52059] Updated weights for policy 1, policy_version 56592 (0.0007) [2023-10-08 02:10:34,994][52060] Updated weights for policy 0, policy_version 55900 (0.0008) [2023-10-08 02:10:35,020][52059] Updated weights for policy 1, policy_version 56602 (0.0007) [2023-10-08 02:10:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 115212288. Throughput: 0: 1722.2, 1: 1740.3. Samples: 28804356. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:36,211][50642] Avg episode reward: [(0, '19.730'), (1, '20.720')] [2023-10-08 02:10:39,009][52059] Updated weights for policy 1, policy_version 56612 (0.0008) [2023-10-08 02:10:39,137][52060] Updated weights for policy 0, policy_version 55910 (0.0008) [2023-10-08 02:10:39,370][52059] Updated weights for policy 1, policy_version 56622 (0.0009) [2023-10-08 02:10:39,499][52060] Updated weights for policy 0, policy_version 55920 (0.0007) [2023-10-08 02:10:39,743][52059] Updated weights for policy 1, policy_version 56632 (0.0007) [2023-10-08 02:10:39,866][52060] Updated weights for policy 0, policy_version 55930 (0.0008) [2023-10-08 02:10:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 115277824. Throughput: 0: 1695.8, 1: 1720.3. Samples: 28823658. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:41,211][50642] Avg episode reward: [(0, '23.430'), (1, '18.960')] [2023-10-08 02:10:43,709][52059] Updated weights for policy 1, policy_version 56642 (0.0008) [2023-10-08 02:10:43,892][52060] Updated weights for policy 0, policy_version 55940 (0.0008) [2023-10-08 02:10:44,073][52059] Updated weights for policy 1, policy_version 56652 (0.0009) [2023-10-08 02:10:44,261][52060] Updated weights for policy 0, policy_version 55950 (0.0008) [2023-10-08 02:10:44,439][52059] Updated weights for policy 1, policy_version 56662 (0.0007) [2023-10-08 02:10:44,621][52060] Updated weights for policy 0, policy_version 55960 (0.0007) [2023-10-08 02:10:44,804][52059] Updated weights for policy 1, policy_version 56672 (0.0007) [2023-10-08 02:10:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 115343360. Throughput: 0: 1699.8, 1: 1714.7. Samples: 28844194. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:46,211][50642] Avg episode reward: [(0, '16.540'), (1, '22.450')] [2023-10-08 02:10:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000055968_57311232.pth... [2023-10-08 02:10:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000056672_58032128.pth... [2023-10-08 02:10:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000054368_55672832.pth [2023-10-08 02:10:46,252][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000055040_56360960.pth [2023-10-08 02:10:46,256][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000055968_57311232.pth [2023-10-08 02:10:46,256][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000056672_58032128.pth [2023-10-08 02:10:48,597][52060] Updated weights for policy 0, policy_version 55970 (0.0009) [2023-10-08 02:10:48,704][52059] Updated weights for policy 1, policy_version 56682 (0.0009) [2023-10-08 02:10:48,968][52060] Updated weights for policy 0, policy_version 55980 (0.0008) [2023-10-08 02:10:49,077][52059] Updated weights for policy 1, policy_version 56692 (0.0008) [2023-10-08 02:10:49,334][52060] Updated weights for policy 0, policy_version 55990 (0.0007) [2023-10-08 02:10:49,437][52059] Updated weights for policy 1, policy_version 56702 (0.0007) [2023-10-08 02:10:49,706][52060] Updated weights for policy 0, policy_version 56000 (0.0008) [2023-10-08 02:10:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 115408896. Throughput: 0: 1705.8, 1: 1732.6. Samples: 28855186. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:51,211][50642] Avg episode reward: [(0, '16.870'), (1, '24.710')] [2023-10-08 02:10:53,469][52059] Updated weights for policy 1, policy_version 56712 (0.0007) [2023-10-08 02:10:53,697][52060] Updated weights for policy 0, policy_version 56010 (0.0010) [2023-10-08 02:10:53,826][52059] Updated weights for policy 1, policy_version 56722 (0.0008) [2023-10-08 02:10:54,061][52060] Updated weights for policy 0, policy_version 56020 (0.0010) [2023-10-08 02:10:54,201][52059] Updated weights for policy 1, policy_version 56732 (0.0009) [2023-10-08 02:10:54,425][52060] Updated weights for policy 0, policy_version 56030 (0.0008) [2023-10-08 02:10:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 115474432. Throughput: 0: 1684.2, 1: 1711.3. Samples: 28874634. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:10:56,211][50642] Avg episode reward: [(0, '21.960'), (1, '21.800')] [2023-10-08 02:10:58,142][52059] Updated weights for policy 1, policy_version 56742 (0.0009) [2023-10-08 02:10:58,467][52060] Updated weights for policy 0, policy_version 56040 (0.0009) [2023-10-08 02:10:58,521][52059] Updated weights for policy 1, policy_version 56752 (0.0009) [2023-10-08 02:10:58,844][52060] Updated weights for policy 0, policy_version 56050 (0.0007) [2023-10-08 02:10:58,880][52059] Updated weights for policy 1, policy_version 56762 (0.0007) [2023-10-08 02:10:59,209][52060] Updated weights for policy 0, policy_version 56060 (0.0008) [2023-10-08 02:11:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 115539968. Throughput: 0: 1706.3, 1: 1726.1. Samples: 28895642. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:11:01,211][50642] Avg episode reward: [(0, '22.380'), (1, '18.500')] [2023-10-08 02:11:02,720][52059] Updated weights for policy 1, policy_version 56772 (0.0010) [2023-10-08 02:11:02,951][52060] Updated weights for policy 0, policy_version 56070 (0.0011) [2023-10-08 02:11:03,083][52059] Updated weights for policy 1, policy_version 56782 (0.0007) [2023-10-08 02:11:03,321][52060] Updated weights for policy 0, policy_version 56080 (0.0007) [2023-10-08 02:11:03,450][52059] Updated weights for policy 1, policy_version 56792 (0.0007) [2023-10-08 02:11:03,686][52060] Updated weights for policy 0, policy_version 56090 (0.0007) [2023-10-08 02:11:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 115605504. Throughput: 0: 1685.4, 1: 1712.0. Samples: 28905368. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:06,211][50642] Avg episode reward: [(0, '17.170'), (1, '21.260')] [2023-10-08 02:11:07,607][52060] Updated weights for policy 0, policy_version 56100 (0.0008) [2023-10-08 02:11:07,677][52059] Updated weights for policy 1, policy_version 56802 (0.0007) [2023-10-08 02:11:07,993][52060] Updated weights for policy 0, policy_version 56110 (0.0007) [2023-10-08 02:11:08,057][52059] Updated weights for policy 1, policy_version 56812 (0.0009) [2023-10-08 02:11:08,352][52060] Updated weights for policy 0, policy_version 56120 (0.0007) [2023-10-08 02:11:08,424][52059] Updated weights for policy 1, policy_version 56822 (0.0009) [2023-10-08 02:11:08,785][52059] Updated weights for policy 1, policy_version 56832 (0.0009) [2023-10-08 02:11:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 115671040. Throughput: 0: 1706.7, 1: 1704.8. Samples: 28926236. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:11,211][50642] Avg episode reward: [(0, '19.660'), (1, '24.270')] [2023-10-08 02:11:12,363][52060] Updated weights for policy 0, policy_version 56130 (0.0008) [2023-10-08 02:11:12,735][52060] Updated weights for policy 0, policy_version 56140 (0.0007) [2023-10-08 02:11:12,747][52059] Updated weights for policy 1, policy_version 56842 (0.0007) [2023-10-08 02:11:13,101][52060] Updated weights for policy 0, policy_version 56150 (0.0007) [2023-10-08 02:11:13,104][52059] Updated weights for policy 1, policy_version 56852 (0.0007) [2023-10-08 02:11:13,465][52060] Updated weights for policy 0, policy_version 56160 (0.0008) [2023-10-08 02:11:13,469][52059] Updated weights for policy 1, policy_version 56862 (0.0007) [2023-10-08 02:11:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 115736576. Throughput: 0: 1718.1, 1: 1730.7. Samples: 28947556. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:16,211][50642] Avg episode reward: [(0, '24.060'), (1, '24.750')] [2023-10-08 02:11:16,218][51605] Saving new best policy, reward=24.060! [2023-10-08 02:11:17,265][52059] Updated weights for policy 1, policy_version 56872 (0.0007) [2023-10-08 02:11:17,524][52060] Updated weights for policy 0, policy_version 56170 (0.0008) [2023-10-08 02:11:17,632][52059] Updated weights for policy 1, policy_version 56882 (0.0007) [2023-10-08 02:11:17,895][52060] Updated weights for policy 0, policy_version 56180 (0.0010) [2023-10-08 02:11:17,984][52059] Updated weights for policy 1, policy_version 56892 (0.0009) [2023-10-08 02:11:18,270][52060] Updated weights for policy 0, policy_version 56190 (0.0009) [2023-10-08 02:11:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 115802112. Throughput: 0: 1687.4, 1: 1700.4. Samples: 28956806. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:21,211][50642] Avg episode reward: [(0, '17.020'), (1, '22.000')] [2023-10-08 02:11:21,961][52059] Updated weights for policy 1, policy_version 56902 (0.0008) [2023-10-08 02:11:22,324][52059] Updated weights for policy 1, policy_version 56912 (0.0009) [2023-10-08 02:11:22,339][52060] Updated weights for policy 0, policy_version 56200 (0.0009) [2023-10-08 02:11:22,678][52059] Updated weights for policy 1, policy_version 56922 (0.0009) [2023-10-08 02:11:22,702][52060] Updated weights for policy 0, policy_version 56210 (0.0007) [2023-10-08 02:11:23,072][52060] Updated weights for policy 0, policy_version 56220 (0.0007) [2023-10-08 02:11:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 115867648. Throughput: 0: 1708.6, 1: 1725.3. Samples: 28978184. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:26,211][50642] Avg episode reward: [(0, '18.330'), (1, '21.900')] [2023-10-08 02:11:26,558][52059] Updated weights for policy 1, policy_version 56932 (0.0009) [2023-10-08 02:11:26,928][52059] Updated weights for policy 1, policy_version 56942 (0.0009) [2023-10-08 02:11:26,958][52060] Updated weights for policy 0, policy_version 56230 (0.0009) [2023-10-08 02:11:27,296][52059] Updated weights for policy 1, policy_version 56952 (0.0009) [2023-10-08 02:11:27,323][52060] Updated weights for policy 0, policy_version 56240 (0.0010) [2023-10-08 02:11:27,694][52060] Updated weights for policy 0, policy_version 56250 (0.0009) [2023-10-08 02:11:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 115933184. Throughput: 0: 1717.6, 1: 1731.6. Samples: 28999412. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:31,211][50642] Avg episode reward: [(0, '22.160'), (1, '25.750')] [2023-10-08 02:11:31,277][52059] Updated weights for policy 1, policy_version 56962 (0.0010) [2023-10-08 02:11:31,645][52059] Updated weights for policy 1, policy_version 56972 (0.0008) [2023-10-08 02:11:31,719][52060] Updated weights for policy 0, policy_version 56260 (0.0008) [2023-10-08 02:11:32,015][52059] Updated weights for policy 1, policy_version 56982 (0.0009) [2023-10-08 02:11:32,078][52060] Updated weights for policy 0, policy_version 56270 (0.0009) [2023-10-08 02:11:32,390][52059] Updated weights for policy 1, policy_version 56992 (0.0008) [2023-10-08 02:11:32,446][52060] Updated weights for policy 0, policy_version 56280 (0.0008) [2023-10-08 02:11:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 115998720. Throughput: 0: 1701.5, 1: 1709.5. Samples: 29008682. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:36,211][50642] Avg episode reward: [(0, '20.720'), (1, '25.330')] [2023-10-08 02:11:36,407][52060] Updated weights for policy 0, policy_version 56290 (0.0009) [2023-10-08 02:11:36,522][52059] Updated weights for policy 1, policy_version 57002 (0.0009) [2023-10-08 02:11:36,778][52060] Updated weights for policy 0, policy_version 56300 (0.0008) [2023-10-08 02:11:36,893][52059] Updated weights for policy 1, policy_version 57012 (0.0009) [2023-10-08 02:11:37,133][52060] Updated weights for policy 0, policy_version 56310 (0.0008) [2023-10-08 02:11:37,265][52059] Updated weights for policy 1, policy_version 57022 (0.0009) [2023-10-08 02:11:37,502][52060] Updated weights for policy 0, policy_version 56320 (0.0009) [2023-10-08 02:11:41,195][52059] Updated weights for policy 1, policy_version 57032 (0.0009) [2023-10-08 02:11:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 116064256. Throughput: 0: 1726.8, 1: 1726.0. Samples: 29030006. Policy #0 lag: (min: 31.0, avg: 33.9, max: 63.0) [2023-10-08 02:11:41,211][50642] Avg episode reward: [(0, '18.770'), (1, '22.140')] [2023-10-08 02:11:41,396][52060] Updated weights for policy 0, policy_version 56330 (0.0008) [2023-10-08 02:11:41,561][52059] Updated weights for policy 1, policy_version 57042 (0.0010) [2023-10-08 02:11:41,767][52060] Updated weights for policy 0, policy_version 56340 (0.0008) [2023-10-08 02:11:41,917][52059] Updated weights for policy 1, policy_version 57052 (0.0009) [2023-10-08 02:11:42,138][52060] Updated weights for policy 0, policy_version 56350 (0.0008) [2023-10-08 02:11:45,787][52059] Updated weights for policy 1, policy_version 57062 (0.0007) [2023-10-08 02:11:46,094][52060] Updated weights for policy 0, policy_version 56360 (0.0007) [2023-10-08 02:11:46,156][52059] Updated weights for policy 1, policy_version 57072 (0.0007) [2023-10-08 02:11:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 116129792. Throughput: 0: 1730.6, 1: 1724.4. Samples: 29051120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:11:46,211][50642] Avg episode reward: [(0, '20.970'), (1, '20.490')] [2023-10-08 02:11:46,463][52060] Updated weights for policy 0, policy_version 56370 (0.0008) [2023-10-08 02:11:46,514][52059] Updated weights for policy 1, policy_version 57082 (0.0008) [2023-10-08 02:11:46,835][52060] Updated weights for policy 0, policy_version 56380 (0.0012) [2023-10-08 02:11:50,440][52059] Updated weights for policy 1, policy_version 57092 (0.0008) [2023-10-08 02:11:50,749][52060] Updated weights for policy 0, policy_version 56390 (0.0010) [2023-10-08 02:11:50,798][52059] Updated weights for policy 1, policy_version 57102 (0.0007) [2023-10-08 02:11:51,105][52060] Updated weights for policy 0, policy_version 56400 (0.0008) [2023-10-08 02:11:51,162][52059] Updated weights for policy 1, policy_version 57112 (0.0007) [2023-10-08 02:11:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 116195328. Throughput: 0: 1725.8, 1: 1731.2. Samples: 29060930. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:11:51,211][50642] Avg episode reward: [(0, '23.700'), (1, '25.930')] [2023-10-08 02:11:51,467][52060] Updated weights for policy 0, policy_version 56410 (0.0007) [2023-10-08 02:11:55,075][52059] Updated weights for policy 1, policy_version 57122 (0.0008) [2023-10-08 02:11:55,455][52060] Updated weights for policy 0, policy_version 56420 (0.0008) [2023-10-08 02:11:55,476][52059] Updated weights for policy 1, policy_version 57132 (0.0008) [2023-10-08 02:11:55,840][52060] Updated weights for policy 0, policy_version 56430 (0.0009) [2023-10-08 02:11:55,841][52059] Updated weights for policy 1, policy_version 57142 (0.0010) [2023-10-08 02:11:56,200][52059] Updated weights for policy 1, policy_version 57152 (0.0009) [2023-10-08 02:11:56,202][52060] Updated weights for policy 0, policy_version 56440 (0.0007) [2023-10-08 02:11:56,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 116293632. Throughput: 0: 1728.9, 1: 1746.4. Samples: 29082626. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:11:56,211][50642] Avg episode reward: [(0, '18.950'), (1, '24.060')] [2023-10-08 02:12:00,046][52060] Updated weights for policy 0, policy_version 56450 (0.0007) [2023-10-08 02:12:00,118][52059] Updated weights for policy 1, policy_version 57162 (0.0009) [2023-10-08 02:12:00,409][52060] Updated weights for policy 0, policy_version 56460 (0.0009) [2023-10-08 02:12:00,479][52059] Updated weights for policy 1, policy_version 57172 (0.0009) [2023-10-08 02:12:00,777][52060] Updated weights for policy 0, policy_version 56470 (0.0007) [2023-10-08 02:12:00,842][52059] Updated weights for policy 1, policy_version 57182 (0.0008) [2023-10-08 02:12:01,140][52060] Updated weights for policy 0, policy_version 56480 (0.0008) [2023-10-08 02:12:01,210][50642] Fps is (10 sec: 19660.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 116391936. Throughput: 0: 1712.9, 1: 1715.1. Samples: 29101820. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:12:01,211][50642] Avg episode reward: [(0, '18.500'), (1, '21.940')] [2023-10-08 02:12:04,902][52059] Updated weights for policy 1, policy_version 57192 (0.0008) [2023-10-08 02:12:05,152][52060] Updated weights for policy 0, policy_version 56490 (0.0008) [2023-10-08 02:12:05,269][52059] Updated weights for policy 1, policy_version 57202 (0.0008) [2023-10-08 02:12:05,521][52060] Updated weights for policy 0, policy_version 56500 (0.0009) [2023-10-08 02:12:05,624][52059] Updated weights for policy 1, policy_version 57212 (0.0010) [2023-10-08 02:12:05,885][52060] Updated weights for policy 0, policy_version 56510 (0.0010) [2023-10-08 02:12:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 116457472. Throughput: 0: 1736.9, 1: 1737.9. Samples: 29113172. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:12:06,211][50642] Avg episode reward: [(0, '22.070'), (1, '22.700')] [2023-10-08 02:12:09,676][52059] Updated weights for policy 1, policy_version 57222 (0.0007) [2023-10-08 02:12:09,931][52060] Updated weights for policy 0, policy_version 56520 (0.0007) [2023-10-08 02:12:10,045][52059] Updated weights for policy 1, policy_version 57232 (0.0008) [2023-10-08 02:12:10,293][52060] Updated weights for policy 0, policy_version 56530 (0.0007) [2023-10-08 02:12:10,411][52059] Updated weights for policy 1, policy_version 57242 (0.0008) [2023-10-08 02:12:10,668][52060] Updated weights for policy 0, policy_version 56540 (0.0007) [2023-10-08 02:12:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 116523008. Throughput: 0: 1735.3, 1: 1723.9. Samples: 29133850. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:12:11,211][50642] Avg episode reward: [(0, '21.240'), (1, '23.520')] [2023-10-08 02:12:14,379][52059] Updated weights for policy 1, policy_version 57252 (0.0009) [2023-10-08 02:12:14,656][52060] Updated weights for policy 0, policy_version 56550 (0.0007) [2023-10-08 02:12:14,741][52059] Updated weights for policy 1, policy_version 57262 (0.0008) [2023-10-08 02:12:15,037][52060] Updated weights for policy 0, policy_version 56560 (0.0008) [2023-10-08 02:12:15,100][52059] Updated weights for policy 1, policy_version 57272 (0.0009) [2023-10-08 02:12:15,403][52060] Updated weights for policy 0, policy_version 56570 (0.0009) [2023-10-08 02:12:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 116588544. Throughput: 0: 1710.6, 1: 1703.1. Samples: 29153026. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:12:16,211][50642] Avg episode reward: [(0, '19.210'), (1, '26.070')] [2023-10-08 02:12:19,066][52059] Updated weights for policy 1, policy_version 57282 (0.0008) [2023-10-08 02:12:19,261][52060] Updated weights for policy 0, policy_version 56580 (0.0009) [2023-10-08 02:12:19,441][52059] Updated weights for policy 1, policy_version 57292 (0.0008) [2023-10-08 02:12:19,640][52060] Updated weights for policy 0, policy_version 56590 (0.0009) [2023-10-08 02:12:19,799][52059] Updated weights for policy 1, policy_version 57302 (0.0009) [2023-10-08 02:12:20,015][52060] Updated weights for policy 0, policy_version 56600 (0.0009) [2023-10-08 02:12:20,172][52059] Updated weights for policy 1, policy_version 57312 (0.0008) [2023-10-08 02:12:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 116654080. Throughput: 0: 1738.0, 1: 1736.6. Samples: 29165042. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:21,211][50642] Avg episode reward: [(0, '21.200'), (1, '22.060')] [2023-10-08 02:12:24,079][52060] Updated weights for policy 0, policy_version 56610 (0.0007) [2023-10-08 02:12:24,094][52059] Updated weights for policy 1, policy_version 57322 (0.0008) [2023-10-08 02:12:24,450][52060] Updated weights for policy 0, policy_version 56620 (0.0007) [2023-10-08 02:12:24,470][52059] Updated weights for policy 1, policy_version 57332 (0.0009) [2023-10-08 02:12:24,818][52060] Updated weights for policy 0, policy_version 56630 (0.0007) [2023-10-08 02:12:24,838][52059] Updated weights for policy 1, policy_version 57342 (0.0007) [2023-10-08 02:12:25,193][52060] Updated weights for policy 0, policy_version 56640 (0.0008) [2023-10-08 02:12:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 116719616. Throughput: 0: 1712.7, 1: 1711.4. Samples: 29184090. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:26,211][50642] Avg episode reward: [(0, '22.510'), (1, '22.050')] [2023-10-08 02:12:28,796][52059] Updated weights for policy 1, policy_version 57352 (0.0007) [2023-10-08 02:12:29,168][52059] Updated weights for policy 1, policy_version 57362 (0.0008) [2023-10-08 02:12:29,234][52060] Updated weights for policy 0, policy_version 56650 (0.0009) [2023-10-08 02:12:29,533][52059] Updated weights for policy 1, policy_version 57372 (0.0008) [2023-10-08 02:12:29,595][52060] Updated weights for policy 0, policy_version 56660 (0.0009) [2023-10-08 02:12:29,959][52060] Updated weights for policy 0, policy_version 56670 (0.0008) [2023-10-08 02:12:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 116785152. Throughput: 0: 1697.6, 1: 1711.5. Samples: 29204530. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:31,211][50642] Avg episode reward: [(0, '19.430'), (1, '23.720')] [2023-10-08 02:12:33,387][52059] Updated weights for policy 1, policy_version 57382 (0.0007) [2023-10-08 02:12:33,748][52059] Updated weights for policy 1, policy_version 57392 (0.0008) [2023-10-08 02:12:33,902][52060] Updated weights for policy 0, policy_version 56680 (0.0008) [2023-10-08 02:12:34,114][52059] Updated weights for policy 1, policy_version 57402 (0.0008) [2023-10-08 02:12:34,278][52060] Updated weights for policy 0, policy_version 56690 (0.0009) [2023-10-08 02:12:34,643][52060] Updated weights for policy 0, policy_version 56700 (0.0009) [2023-10-08 02:12:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 116850688. Throughput: 0: 1714.9, 1: 1722.3. Samples: 29215604. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:36,211][50642] Avg episode reward: [(0, '20.430'), (1, '22.160')] [2023-10-08 02:12:38,190][52059] Updated weights for policy 1, policy_version 57412 (0.0008) [2023-10-08 02:12:38,557][52059] Updated weights for policy 1, policy_version 57422 (0.0009) [2023-10-08 02:12:38,833][52060] Updated weights for policy 0, policy_version 56710 (0.0008) [2023-10-08 02:12:38,924][52059] Updated weights for policy 1, policy_version 57432 (0.0008) [2023-10-08 02:12:39,196][52060] Updated weights for policy 0, policy_version 56720 (0.0008) [2023-10-08 02:12:39,577][52060] Updated weights for policy 0, policy_version 56730 (0.0009) [2023-10-08 02:12:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 116916224. Throughput: 0: 1682.7, 1: 1705.2. Samples: 29235082. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:41,211][50642] Avg episode reward: [(0, '23.350'), (1, '22.010')] [2023-10-08 02:12:42,813][52059] Updated weights for policy 1, policy_version 57442 (0.0008) [2023-10-08 02:12:43,215][52059] Updated weights for policy 1, policy_version 57452 (0.0007) [2023-10-08 02:12:43,589][52059] Updated weights for policy 1, policy_version 57462 (0.0007) [2023-10-08 02:12:43,594][52060] Updated weights for policy 0, policy_version 56740 (0.0007) [2023-10-08 02:12:43,947][52059] Updated weights for policy 1, policy_version 57472 (0.0008) [2023-10-08 02:12:43,978][52060] Updated weights for policy 0, policy_version 56750 (0.0008) [2023-10-08 02:12:44,352][52060] Updated weights for policy 0, policy_version 56760 (0.0007) [2023-10-08 02:12:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 116981760. Throughput: 0: 1700.0, 1: 1733.6. Samples: 29256334. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:46,211][50642] Avg episode reward: [(0, '21.060'), (1, '22.000')] [2023-10-08 02:12:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000056768_58130432.pth... [2023-10-08 02:12:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000057472_58851328.pth... [2023-10-08 02:12:46,264][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000055872_57212928.pth [2023-10-08 02:12:46,269][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000055168_56492032.pth [2023-10-08 02:12:47,799][52059] Updated weights for policy 1, policy_version 57482 (0.0008) [2023-10-08 02:12:48,163][52059] Updated weights for policy 1, policy_version 57492 (0.0009) [2023-10-08 02:12:48,411][52060] Updated weights for policy 0, policy_version 56770 (0.0008) [2023-10-08 02:12:48,520][52059] Updated weights for policy 1, policy_version 57502 (0.0007) [2023-10-08 02:12:48,772][52060] Updated weights for policy 0, policy_version 56780 (0.0008) [2023-10-08 02:12:49,137][52060] Updated weights for policy 0, policy_version 56790 (0.0007) [2023-10-08 02:12:49,504][52060] Updated weights for policy 0, policy_version 56800 (0.0007) [2023-10-08 02:12:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 117047296. Throughput: 0: 1693.2, 1: 1707.1. Samples: 29266184. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:51,211][50642] Avg episode reward: [(0, '19.030'), (1, '23.780')] [2023-10-08 02:12:52,419][52059] Updated weights for policy 1, policy_version 57512 (0.0008) [2023-10-08 02:12:52,791][52059] Updated weights for policy 1, policy_version 57522 (0.0008) [2023-10-08 02:12:53,149][52059] Updated weights for policy 1, policy_version 57532 (0.0008) [2023-10-08 02:12:53,288][52060] Updated weights for policy 0, policy_version 56810 (0.0009) [2023-10-08 02:12:53,656][52060] Updated weights for policy 0, policy_version 56820 (0.0007) [2023-10-08 02:12:54,024][52060] Updated weights for policy 0, policy_version 56830 (0.0007) [2023-10-08 02:12:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 117112832. Throughput: 0: 1686.5, 1: 1718.9. Samples: 29287092. Policy #0 lag: (min: 7.0, avg: 7.2, max: 16.0) [2023-10-08 02:12:56,211][50642] Avg episode reward: [(0, '21.910'), (1, '23.740')] [2023-10-08 02:12:57,123][52059] Updated weights for policy 1, policy_version 57542 (0.0008) [2023-10-08 02:12:57,489][52059] Updated weights for policy 1, policy_version 57552 (0.0008) [2023-10-08 02:12:57,767][52060] Updated weights for policy 0, policy_version 56840 (0.0008) [2023-10-08 02:12:57,856][52059] Updated weights for policy 1, policy_version 57562 (0.0007) [2023-10-08 02:12:58,148][52060] Updated weights for policy 0, policy_version 56850 (0.0008) [2023-10-08 02:12:58,507][52060] Updated weights for policy 0, policy_version 56860 (0.0008) [2023-10-08 02:13:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 117178368. Throughput: 0: 1712.8, 1: 1741.0. Samples: 29308448. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:01,211][50642] Avg episode reward: [(0, '22.530'), (1, '22.590')] [2023-10-08 02:13:01,805][52059] Updated weights for policy 1, policy_version 57572 (0.0008) [2023-10-08 02:13:02,170][52059] Updated weights for policy 1, policy_version 57582 (0.0008) [2023-10-08 02:13:02,540][52059] Updated weights for policy 1, policy_version 57592 (0.0008) [2023-10-08 02:13:02,613][52060] Updated weights for policy 0, policy_version 56870 (0.0008) [2023-10-08 02:13:02,976][52060] Updated weights for policy 0, policy_version 56880 (0.0009) [2023-10-08 02:13:03,340][52060] Updated weights for policy 0, policy_version 56890 (0.0008) [2023-10-08 02:13:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 117243904. Throughput: 0: 1681.4, 1: 1714.9. Samples: 29317876. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:06,211][50642] Avg episode reward: [(0, '18.980'), (1, '23.360')] [2023-10-08 02:13:06,318][52059] Updated weights for policy 1, policy_version 57602 (0.0007) [2023-10-08 02:13:06,674][52059] Updated weights for policy 1, policy_version 57612 (0.0010) [2023-10-08 02:13:07,042][52059] Updated weights for policy 1, policy_version 57622 (0.0008) [2023-10-08 02:13:07,406][52059] Updated weights for policy 1, policy_version 57632 (0.0008) [2023-10-08 02:13:07,414][52060] Updated weights for policy 0, policy_version 56900 (0.0008) [2023-10-08 02:13:07,787][52060] Updated weights for policy 0, policy_version 56910 (0.0008) [2023-10-08 02:13:08,153][52060] Updated weights for policy 0, policy_version 56920 (0.0009) [2023-10-08 02:13:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 117309440. Throughput: 0: 1706.1, 1: 1742.9. Samples: 29339296. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:11,211][50642] Avg episode reward: [(0, '19.890'), (1, '24.780')] [2023-10-08 02:13:11,414][52059] Updated weights for policy 1, policy_version 57642 (0.0008) [2023-10-08 02:13:11,783][52059] Updated weights for policy 1, policy_version 57652 (0.0008) [2023-10-08 02:13:12,042][52060] Updated weights for policy 0, policy_version 56930 (0.0007) [2023-10-08 02:13:12,144][52059] Updated weights for policy 1, policy_version 57662 (0.0008) [2023-10-08 02:13:12,407][52060] Updated weights for policy 0, policy_version 56940 (0.0007) [2023-10-08 02:13:12,769][52060] Updated weights for policy 0, policy_version 56950 (0.0007) [2023-10-08 02:13:13,142][52060] Updated weights for policy 0, policy_version 56960 (0.0009) [2023-10-08 02:13:16,099][52059] Updated weights for policy 1, policy_version 57672 (0.0007) [2023-10-08 02:13:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 117374976. Throughput: 0: 1723.4, 1: 1747.5. Samples: 29360718. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:16,211][50642] Avg episode reward: [(0, '21.860'), (1, '23.050')] [2023-10-08 02:13:16,463][52059] Updated weights for policy 1, policy_version 57682 (0.0008) [2023-10-08 02:13:16,840][52059] Updated weights for policy 1, policy_version 57692 (0.0007) [2023-10-08 02:13:17,028][52060] Updated weights for policy 0, policy_version 56970 (0.0008) [2023-10-08 02:13:17,397][52060] Updated weights for policy 0, policy_version 56980 (0.0007) [2023-10-08 02:13:17,763][52060] Updated weights for policy 0, policy_version 56990 (0.0008) [2023-10-08 02:13:20,768][52059] Updated weights for policy 1, policy_version 57702 (0.0008) [2023-10-08 02:13:21,138][52059] Updated weights for policy 1, policy_version 57712 (0.0008) [2023-10-08 02:13:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 117440512. Throughput: 0: 1700.9, 1: 1731.9. Samples: 29370078. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:21,211][50642] Avg episode reward: [(0, '20.970'), (1, '21.800')] [2023-10-08 02:13:21,502][52059] Updated weights for policy 1, policy_version 57722 (0.0008) [2023-10-08 02:13:21,911][52060] Updated weights for policy 0, policy_version 57000 (0.0008) [2023-10-08 02:13:22,292][52060] Updated weights for policy 0, policy_version 57010 (0.0007) [2023-10-08 02:13:22,664][52060] Updated weights for policy 0, policy_version 57020 (0.0008) [2023-10-08 02:13:25,452][52059] Updated weights for policy 1, policy_version 57732 (0.0009) [2023-10-08 02:13:25,821][52059] Updated weights for policy 1, policy_version 57742 (0.0007) [2023-10-08 02:13:26,180][52059] Updated weights for policy 1, policy_version 57752 (0.0007) [2023-10-08 02:13:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 117506048. Throughput: 0: 1724.0, 1: 1744.2. Samples: 29391150. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:26,212][50642] Avg episode reward: [(0, '20.110'), (1, '22.660')] [2023-10-08 02:13:26,701][52060] Updated weights for policy 0, policy_version 57030 (0.0008) [2023-10-08 02:13:27,059][52060] Updated weights for policy 0, policy_version 57040 (0.0010) [2023-10-08 02:13:27,428][52060] Updated weights for policy 0, policy_version 57050 (0.0009) [2023-10-08 02:13:30,133][52059] Updated weights for policy 1, policy_version 57762 (0.0008) [2023-10-08 02:13:30,560][52059] Updated weights for policy 1, policy_version 57772 (0.0007) [2023-10-08 02:13:30,918][52059] Updated weights for policy 1, policy_version 57782 (0.0009) [2023-10-08 02:13:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 117571584. Throughput: 0: 1723.5, 1: 1723.0. Samples: 29411426. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:31,211][50642] Avg episode reward: [(0, '21.260'), (1, '26.850')] [2023-10-08 02:13:31,293][52059] Updated weights for policy 1, policy_version 57792 (0.0007) [2023-10-08 02:13:31,439][52060] Updated weights for policy 0, policy_version 57060 (0.0008) [2023-10-08 02:13:31,828][52060] Updated weights for policy 0, policy_version 57070 (0.0007) [2023-10-08 02:13:32,205][52060] Updated weights for policy 0, policy_version 57080 (0.0007) [2023-10-08 02:13:35,417][52059] Updated weights for policy 1, policy_version 57802 (0.0008) [2023-10-08 02:13:35,778][52059] Updated weights for policy 1, policy_version 57812 (0.0010) [2023-10-08 02:13:36,106][52060] Updated weights for policy 0, policy_version 57090 (0.0008) [2023-10-08 02:13:36,145][52059] Updated weights for policy 1, policy_version 57822 (0.0009) [2023-10-08 02:13:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 117637120. Throughput: 0: 1708.6, 1: 1739.3. Samples: 29421342. Policy #0 lag: (min: 15.0, avg: 16.4, max: 41.0) [2023-10-08 02:13:36,211][50642] Avg episode reward: [(0, '21.200'), (1, '21.910')] [2023-10-08 02:13:36,471][52060] Updated weights for policy 0, policy_version 57100 (0.0007) [2023-10-08 02:13:36,836][52060] Updated weights for policy 0, policy_version 57110 (0.0007) [2023-10-08 02:13:37,215][52060] Updated weights for policy 0, policy_version 57120 (0.0007) [2023-10-08 02:13:40,050][52059] Updated weights for policy 1, policy_version 57832 (0.0009) [2023-10-08 02:13:40,429][52059] Updated weights for policy 1, policy_version 57842 (0.0008) [2023-10-08 02:13:40,789][52059] Updated weights for policy 1, policy_version 57852 (0.0008) [2023-10-08 02:13:41,079][52060] Updated weights for policy 0, policy_version 57130 (0.0008) [2023-10-08 02:13:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 117735424. Throughput: 0: 1722.9, 1: 1732.9. Samples: 29442600. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:13:41,211][50642] Avg episode reward: [(0, '20.520'), (1, '22.300')] [2023-10-08 02:13:41,449][52060] Updated weights for policy 0, policy_version 57140 (0.0009) [2023-10-08 02:13:41,820][52060] Updated weights for policy 0, policy_version 57150 (0.0009) [2023-10-08 02:13:44,525][52059] Updated weights for policy 1, policy_version 57862 (0.0008) [2023-10-08 02:13:44,898][52059] Updated weights for policy 1, policy_version 57872 (0.0007) [2023-10-08 02:13:45,264][52059] Updated weights for policy 1, policy_version 57882 (0.0008) [2023-10-08 02:13:45,714][52060] Updated weights for policy 0, policy_version 57160 (0.0009) [2023-10-08 02:13:46,084][52060] Updated weights for policy 0, policy_version 57170 (0.0008) [2023-10-08 02:13:46,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 117800960. Throughput: 0: 1712.8, 1: 1712.8. Samples: 29462598. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:13:46,211][50642] Avg episode reward: [(0, '21.200'), (1, '23.230')] [2023-10-08 02:13:46,456][52060] Updated weights for policy 0, policy_version 57180 (0.0008) [2023-10-08 02:13:49,154][52059] Updated weights for policy 1, policy_version 57892 (0.0008) [2023-10-08 02:13:49,523][52059] Updated weights for policy 1, policy_version 57902 (0.0008) [2023-10-08 02:13:49,885][52059] Updated weights for policy 1, policy_version 57912 (0.0009) [2023-10-08 02:13:50,382][52060] Updated weights for policy 0, policy_version 57190 (0.0010) [2023-10-08 02:13:50,748][52060] Updated weights for policy 0, policy_version 57200 (0.0010) [2023-10-08 02:13:51,109][52060] Updated weights for policy 0, policy_version 57210 (0.0008) [2023-10-08 02:13:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 117866496. Throughput: 0: 1722.3, 1: 1742.7. Samples: 29473800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:13:51,211][50642] Avg episode reward: [(0, '21.620'), (1, '23.520')] [2023-10-08 02:13:53,858][52059] Updated weights for policy 1, policy_version 57922 (0.0010) [2023-10-08 02:13:54,230][52059] Updated weights for policy 1, policy_version 57932 (0.0010) [2023-10-08 02:13:54,598][52059] Updated weights for policy 1, policy_version 57942 (0.0008) [2023-10-08 02:13:54,963][52059] Updated weights for policy 1, policy_version 57952 (0.0007) [2023-10-08 02:13:55,033][52060] Updated weights for policy 0, policy_version 57220 (0.0008) [2023-10-08 02:13:55,406][52060] Updated weights for policy 0, policy_version 57230 (0.0011) [2023-10-08 02:13:55,768][52060] Updated weights for policy 0, policy_version 57240 (0.0008) [2023-10-08 02:13:56,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 117964800. Throughput: 0: 1723.6, 1: 1715.3. Samples: 29494044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:13:56,211][50642] Avg episode reward: [(0, '20.230'), (1, '18.190')] [2023-10-08 02:13:58,773][52059] Updated weights for policy 1, policy_version 57962 (0.0010) [2023-10-08 02:13:59,139][52059] Updated weights for policy 1, policy_version 57972 (0.0008) [2023-10-08 02:13:59,499][52059] Updated weights for policy 1, policy_version 57982 (0.0007) [2023-10-08 02:13:59,861][52060] Updated weights for policy 0, policy_version 57250 (0.0008) [2023-10-08 02:14:00,229][52060] Updated weights for policy 0, policy_version 57260 (0.0007) [2023-10-08 02:14:00,597][52060] Updated weights for policy 0, policy_version 57270 (0.0008) [2023-10-08 02:14:00,963][52060] Updated weights for policy 0, policy_version 57280 (0.0008) [2023-10-08 02:14:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118030336. Throughput: 0: 1690.3, 1: 1719.5. Samples: 29514158. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:01,211][50642] Avg episode reward: [(0, '20.230'), (1, '20.600')] [2023-10-08 02:14:03,313][52059] Updated weights for policy 1, policy_version 57992 (0.0009) [2023-10-08 02:14:03,673][52059] Updated weights for policy 1, policy_version 58002 (0.0008) [2023-10-08 02:14:04,040][52059] Updated weights for policy 1, policy_version 58012 (0.0008) [2023-10-08 02:14:04,995][52060] Updated weights for policy 0, policy_version 57290 (0.0009) [2023-10-08 02:14:05,370][52060] Updated weights for policy 0, policy_version 57300 (0.0008) [2023-10-08 02:14:05,732][52060] Updated weights for policy 0, policy_version 57310 (0.0009) [2023-10-08 02:14:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 118095872. Throughput: 0: 1716.9, 1: 1725.2. Samples: 29524974. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:06,211][50642] Avg episode reward: [(0, '20.050'), (1, '22.930')] [2023-10-08 02:14:07,961][52059] Updated weights for policy 1, policy_version 58022 (0.0008) [2023-10-08 02:14:08,333][52059] Updated weights for policy 1, policy_version 58032 (0.0007) [2023-10-08 02:14:08,698][52059] Updated weights for policy 1, policy_version 58042 (0.0008) [2023-10-08 02:14:09,629][52060] Updated weights for policy 0, policy_version 57320 (0.0010) [2023-10-08 02:14:10,003][52060] Updated weights for policy 0, policy_version 57330 (0.0009) [2023-10-08 02:14:10,367][52060] Updated weights for policy 0, policy_version 57340 (0.0008) [2023-10-08 02:14:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118161408. Throughput: 0: 1714.7, 1: 1715.6. Samples: 29545514. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:11,211][50642] Avg episode reward: [(0, '21.070'), (1, '23.950')] [2023-10-08 02:14:12,759][52059] Updated weights for policy 1, policy_version 58052 (0.0009) [2023-10-08 02:14:13,117][52059] Updated weights for policy 1, policy_version 58062 (0.0008) [2023-10-08 02:14:13,487][52059] Updated weights for policy 1, policy_version 58072 (0.0008) [2023-10-08 02:14:14,355][52060] Updated weights for policy 0, policy_version 57350 (0.0008) [2023-10-08 02:14:14,720][52060] Updated weights for policy 0, policy_version 57360 (0.0007) [2023-10-08 02:14:15,081][52060] Updated weights for policy 0, policy_version 57370 (0.0007) [2023-10-08 02:14:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118226944. Throughput: 0: 1698.0, 1: 1741.2. Samples: 29566190. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:16,211][50642] Avg episode reward: [(0, '22.170'), (1, '23.250')] [2023-10-08 02:14:17,356][52059] Updated weights for policy 1, policy_version 58082 (0.0008) [2023-10-08 02:14:17,748][52059] Updated weights for policy 1, policy_version 58092 (0.0009) [2023-10-08 02:14:18,117][52059] Updated weights for policy 1, policy_version 58102 (0.0008) [2023-10-08 02:14:18,481][52059] Updated weights for policy 1, policy_version 58112 (0.0008) [2023-10-08 02:14:19,142][52060] Updated weights for policy 0, policy_version 57380 (0.0007) [2023-10-08 02:14:19,520][52060] Updated weights for policy 0, policy_version 57390 (0.0008) [2023-10-08 02:14:19,899][52060] Updated weights for policy 0, policy_version 57400 (0.0008) [2023-10-08 02:14:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118292480. Throughput: 0: 1727.8, 1: 1730.5. Samples: 29576966. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:21,211][50642] Avg episode reward: [(0, '21.520'), (1, '20.830')] [2023-10-08 02:14:22,394][52059] Updated weights for policy 1, policy_version 58122 (0.0011) [2023-10-08 02:14:22,746][52059] Updated weights for policy 1, policy_version 58132 (0.0010) [2023-10-08 02:14:23,118][52059] Updated weights for policy 1, policy_version 58142 (0.0010) [2023-10-08 02:14:23,848][52060] Updated weights for policy 0, policy_version 57410 (0.0008) [2023-10-08 02:14:24,216][52060] Updated weights for policy 0, policy_version 57420 (0.0011) [2023-10-08 02:14:24,583][52060] Updated weights for policy 0, policy_version 57430 (0.0010) [2023-10-08 02:14:24,953][52060] Updated weights for policy 0, policy_version 57440 (0.0007) [2023-10-08 02:14:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118358016. Throughput: 0: 1697.6, 1: 1734.9. Samples: 29597064. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:26,211][50642] Avg episode reward: [(0, '20.740'), (1, '25.730')] [2023-10-08 02:14:27,146][52059] Updated weights for policy 1, policy_version 58152 (0.0008) [2023-10-08 02:14:27,501][52059] Updated weights for policy 1, policy_version 58162 (0.0007) [2023-10-08 02:14:27,872][52059] Updated weights for policy 1, policy_version 58172 (0.0007) [2023-10-08 02:14:29,064][52060] Updated weights for policy 0, policy_version 57450 (0.0010) [2023-10-08 02:14:29,430][52060] Updated weights for policy 0, policy_version 57460 (0.0009) [2023-10-08 02:14:29,792][52060] Updated weights for policy 0, policy_version 57470 (0.0009) [2023-10-08 02:14:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118423552. Throughput: 0: 1700.3, 1: 1754.8. Samples: 29618080. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:31,211][50642] Avg episode reward: [(0, '20.650'), (1, '22.500')] [2023-10-08 02:14:31,723][52059] Updated weights for policy 1, policy_version 58182 (0.0008) [2023-10-08 02:14:32,086][52059] Updated weights for policy 1, policy_version 58192 (0.0007) [2023-10-08 02:14:32,445][52059] Updated weights for policy 1, policy_version 58202 (0.0007) [2023-10-08 02:14:33,697][52060] Updated weights for policy 0, policy_version 57480 (0.0010) [2023-10-08 02:14:34,070][52060] Updated weights for policy 0, policy_version 57490 (0.0009) [2023-10-08 02:14:34,440][52060] Updated weights for policy 0, policy_version 57500 (0.0010) [2023-10-08 02:14:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 118489088. Throughput: 0: 1713.3, 1: 1724.3. Samples: 29628490. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:36,211][50642] Avg episode reward: [(0, '21.260'), (1, '20.160')] [2023-10-08 02:14:36,278][52059] Updated weights for policy 1, policy_version 58212 (0.0009) [2023-10-08 02:14:36,649][52059] Updated weights for policy 1, policy_version 58222 (0.0007) [2023-10-08 02:14:37,008][52059] Updated weights for policy 1, policy_version 58232 (0.0009) [2023-10-08 02:14:38,352][52060] Updated weights for policy 0, policy_version 57510 (0.0011) [2023-10-08 02:14:38,714][52060] Updated weights for policy 0, policy_version 57520 (0.0009) [2023-10-08 02:14:39,081][52060] Updated weights for policy 0, policy_version 57530 (0.0008) [2023-10-08 02:14:40,849][52059] Updated weights for policy 1, policy_version 58242 (0.0009) [2023-10-08 02:14:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 118554624. Throughput: 0: 1689.7, 1: 1757.5. Samples: 29649166. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:41,211][52059] Updated weights for policy 1, policy_version 58252 (0.0008) [2023-10-08 02:14:41,211][50642] Avg episode reward: [(0, '19.560'), (1, '20.340')] [2023-10-08 02:14:41,579][52059] Updated weights for policy 1, policy_version 58262 (0.0008) [2023-10-08 02:14:41,942][52059] Updated weights for policy 1, policy_version 58272 (0.0012) [2023-10-08 02:14:43,064][52060] Updated weights for policy 0, policy_version 57540 (0.0010) [2023-10-08 02:14:43,438][52060] Updated weights for policy 0, policy_version 57550 (0.0010) [2023-10-08 02:14:43,807][52060] Updated weights for policy 0, policy_version 57560 (0.0010) [2023-10-08 02:14:45,647][52059] Updated weights for policy 1, policy_version 58282 (0.0008) [2023-10-08 02:14:46,011][52059] Updated weights for policy 1, policy_version 58292 (0.0011) [2023-10-08 02:14:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 118620160. Throughput: 0: 1720.6, 1: 1748.1. Samples: 29670252. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:46,211][50642] Avg episode reward: [(0, '20.020'), (1, '28.170')] [2023-10-08 02:14:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000057568_58949632.pth... [2023-10-08 02:14:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000055968_57311232.pth [2023-10-08 02:14:46,377][52059] Updated weights for policy 1, policy_version 58302 (0.0008) [2023-10-08 02:14:46,441][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000058304_59703296.pth... [2023-10-08 02:14:46,479][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000056672_58032128.pth [2023-10-08 02:14:46,485][51710] Saving new best policy, reward=28.170! [2023-10-08 02:14:47,861][52060] Updated weights for policy 0, policy_version 57570 (0.0010) [2023-10-08 02:14:48,228][52060] Updated weights for policy 0, policy_version 57580 (0.0008) [2023-10-08 02:14:48,597][52060] Updated weights for policy 0, policy_version 57590 (0.0007) [2023-10-08 02:14:48,961][52060] Updated weights for policy 0, policy_version 57600 (0.0008) [2023-10-08 02:14:50,295][52059] Updated weights for policy 1, policy_version 58312 (0.0007) [2023-10-08 02:14:50,648][52059] Updated weights for policy 1, policy_version 58322 (0.0009) [2023-10-08 02:14:51,012][52059] Updated weights for policy 1, policy_version 58332 (0.0007) [2023-10-08 02:14:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 118718464. Throughput: 0: 1700.3, 1: 1751.2. Samples: 29680292. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:14:51,211][50642] Avg episode reward: [(0, '22.250'), (1, '19.960')] [2023-10-08 02:14:52,802][52060] Updated weights for policy 0, policy_version 57610 (0.0008) [2023-10-08 02:14:53,167][52060] Updated weights for policy 0, policy_version 57620 (0.0009) [2023-10-08 02:14:53,538][52060] Updated weights for policy 0, policy_version 57630 (0.0009) [2023-10-08 02:14:55,007][52059] Updated weights for policy 1, policy_version 58342 (0.0008) [2023-10-08 02:14:55,367][52059] Updated weights for policy 1, policy_version 58352 (0.0007) [2023-10-08 02:14:55,729][52059] Updated weights for policy 1, policy_version 58362 (0.0009) [2023-10-08 02:14:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 118784000. Throughput: 0: 1709.4, 1: 1763.6. Samples: 29701798. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:14:56,211][50642] Avg episode reward: [(0, '20.660'), (1, '20.350')] [2023-10-08 02:14:57,561][52060] Updated weights for policy 0, policy_version 57640 (0.0010) [2023-10-08 02:14:57,935][52060] Updated weights for policy 0, policy_version 57650 (0.0010) [2023-10-08 02:14:58,297][52060] Updated weights for policy 0, policy_version 57660 (0.0007) [2023-10-08 02:14:59,553][52059] Updated weights for policy 1, policy_version 58372 (0.0008) [2023-10-08 02:14:59,929][52059] Updated weights for policy 1, policy_version 58382 (0.0008) [2023-10-08 02:15:00,294][52059] Updated weights for policy 1, policy_version 58392 (0.0009) [2023-10-08 02:15:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 118849536. Throughput: 0: 1729.0, 1: 1730.0. Samples: 29721844. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:01,211][50642] Avg episode reward: [(0, '20.090'), (1, '21.970')] [2023-10-08 02:15:02,279][52060] Updated weights for policy 0, policy_version 57670 (0.0007) [2023-10-08 02:15:02,647][52060] Updated weights for policy 0, policy_version 57680 (0.0008) [2023-10-08 02:15:03,028][52060] Updated weights for policy 0, policy_version 57690 (0.0009) [2023-10-08 02:15:04,264][52059] Updated weights for policy 1, policy_version 58402 (0.0009) [2023-10-08 02:15:04,677][52059] Updated weights for policy 1, policy_version 58412 (0.0007) [2023-10-08 02:15:05,034][52059] Updated weights for policy 1, policy_version 58422 (0.0010) [2023-10-08 02:15:05,393][52059] Updated weights for policy 1, policy_version 58432 (0.0010) [2023-10-08 02:15:06,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 118915072. Throughput: 0: 1699.5, 1: 1757.9. Samples: 29732546. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:06,211][50642] Avg episode reward: [(0, '21.550'), (1, '25.820')] [2023-10-08 02:15:07,016][52060] Updated weights for policy 0, policy_version 57700 (0.0008) [2023-10-08 02:15:07,407][52060] Updated weights for policy 0, policy_version 57710 (0.0007) [2023-10-08 02:15:07,772][52060] Updated weights for policy 0, policy_version 57720 (0.0008) [2023-10-08 02:15:09,341][52059] Updated weights for policy 1, policy_version 58442 (0.0008) [2023-10-08 02:15:09,698][52059] Updated weights for policy 1, policy_version 58452 (0.0007) [2023-10-08 02:15:10,063][52059] Updated weights for policy 1, policy_version 58462 (0.0008) [2023-10-08 02:15:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 118980608. Throughput: 0: 1722.6, 1: 1735.7. Samples: 29752688. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:11,211][50642] Avg episode reward: [(0, '22.060'), (1, '18.240')] [2023-10-08 02:15:11,778][52060] Updated weights for policy 0, policy_version 57730 (0.0009) [2023-10-08 02:15:12,148][52060] Updated weights for policy 0, policy_version 57740 (0.0008) [2023-10-08 02:15:12,517][52060] Updated weights for policy 0, policy_version 57750 (0.0007) [2023-10-08 02:15:12,879][52060] Updated weights for policy 0, policy_version 57760 (0.0009) [2023-10-08 02:15:13,938][52059] Updated weights for policy 1, policy_version 58472 (0.0009) [2023-10-08 02:15:14,317][52059] Updated weights for policy 1, policy_version 58482 (0.0008) [2023-10-08 02:15:14,681][52059] Updated weights for policy 1, policy_version 58492 (0.0008) [2023-10-08 02:15:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 119046144. Throughput: 0: 1730.2, 1: 1730.1. Samples: 29773796. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:16,211][50642] Avg episode reward: [(0, '19.670'), (1, '20.560')] [2023-10-08 02:15:16,836][52060] Updated weights for policy 0, policy_version 57770 (0.0010) [2023-10-08 02:15:17,207][52060] Updated weights for policy 0, policy_version 57780 (0.0012) [2023-10-08 02:15:17,577][52060] Updated weights for policy 0, policy_version 57790 (0.0008) [2023-10-08 02:15:18,650][52059] Updated weights for policy 1, policy_version 58502 (0.0007) [2023-10-08 02:15:19,017][52059] Updated weights for policy 1, policy_version 58512 (0.0010) [2023-10-08 02:15:19,385][52059] Updated weights for policy 1, policy_version 58522 (0.0009) [2023-10-08 02:15:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 119111680. Throughput: 0: 1708.3, 1: 1748.8. Samples: 29784060. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:21,211][50642] Avg episode reward: [(0, '20.130'), (1, '23.580')] [2023-10-08 02:15:21,565][52060] Updated weights for policy 0, policy_version 57800 (0.0009) [2023-10-08 02:15:21,930][52060] Updated weights for policy 0, policy_version 57810 (0.0010) [2023-10-08 02:15:22,301][52060] Updated weights for policy 0, policy_version 57820 (0.0007) [2023-10-08 02:15:23,268][52059] Updated weights for policy 1, policy_version 58532 (0.0008) [2023-10-08 02:15:23,640][52059] Updated weights for policy 1, policy_version 58542 (0.0008) [2023-10-08 02:15:24,005][52059] Updated weights for policy 1, policy_version 58552 (0.0008) [2023-10-08 02:15:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 119177216. Throughput: 0: 1729.6, 1: 1722.3. Samples: 29804500. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:26,211][50642] Avg episode reward: [(0, '22.200'), (1, '24.950')] [2023-10-08 02:15:26,219][52060] Updated weights for policy 0, policy_version 57830 (0.0010) [2023-10-08 02:15:26,597][52060] Updated weights for policy 0, policy_version 57840 (0.0010) [2023-10-08 02:15:26,964][52060] Updated weights for policy 0, policy_version 57850 (0.0010) [2023-10-08 02:15:27,881][52059] Updated weights for policy 1, policy_version 58562 (0.0010) [2023-10-08 02:15:28,247][52059] Updated weights for policy 1, policy_version 58572 (0.0008) [2023-10-08 02:15:28,611][52059] Updated weights for policy 1, policy_version 58582 (0.0009) [2023-10-08 02:15:28,971][52059] Updated weights for policy 1, policy_version 58592 (0.0009) [2023-10-08 02:15:30,996][52060] Updated weights for policy 0, policy_version 57860 (0.0009) [2023-10-08 02:15:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 119242752. Throughput: 0: 1730.3, 1: 1728.3. Samples: 29825888. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 02:15:31,211][50642] Avg episode reward: [(0, '20.260'), (1, '19.190')] [2023-10-08 02:15:31,356][52060] Updated weights for policy 0, policy_version 57870 (0.0007) [2023-10-08 02:15:31,716][52060] Updated weights for policy 0, policy_version 57880 (0.0010) [2023-10-08 02:15:32,896][52059] Updated weights for policy 1, policy_version 58602 (0.0007) [2023-10-08 02:15:33,264][52059] Updated weights for policy 1, policy_version 58612 (0.0008) [2023-10-08 02:15:33,630][52059] Updated weights for policy 1, policy_version 58622 (0.0009) [2023-10-08 02:15:35,645][52060] Updated weights for policy 0, policy_version 57890 (0.0010) [2023-10-08 02:15:36,015][52060] Updated weights for policy 0, policy_version 57900 (0.0008) [2023-10-08 02:15:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 119308288. Throughput: 0: 1727.7, 1: 1719.1. Samples: 29835400. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:15:36,211][50642] Avg episode reward: [(0, '20.660'), (1, '20.970')] [2023-10-08 02:15:36,374][52060] Updated weights for policy 0, policy_version 57910 (0.0007) [2023-10-08 02:15:36,748][52060] Updated weights for policy 0, policy_version 57920 (0.0007) [2023-10-08 02:15:37,533][52059] Updated weights for policy 1, policy_version 58632 (0.0009) [2023-10-08 02:15:37,897][52059] Updated weights for policy 1, policy_version 58642 (0.0007) [2023-10-08 02:15:38,260][52059] Updated weights for policy 1, policy_version 58652 (0.0008) [2023-10-08 02:15:40,601][52060] Updated weights for policy 0, policy_version 57930 (0.0011) [2023-10-08 02:15:40,970][52060] Updated weights for policy 0, policy_version 57940 (0.0010) [2023-10-08 02:15:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 119373824. Throughput: 0: 1728.6, 1: 1720.7. Samples: 29857016. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:15:41,211][50642] Avg episode reward: [(0, '20.570'), (1, '25.540')] [2023-10-08 02:15:41,326][52060] Updated weights for policy 0, policy_version 57950 (0.0009) [2023-10-08 02:15:42,161][52059] Updated weights for policy 1, policy_version 58662 (0.0008) [2023-10-08 02:15:42,532][52059] Updated weights for policy 1, policy_version 58672 (0.0009) [2023-10-08 02:15:42,902][52059] Updated weights for policy 1, policy_version 58682 (0.0008) [2023-10-08 02:15:45,223][52060] Updated weights for policy 0, policy_version 57960 (0.0008) [2023-10-08 02:15:45,584][52060] Updated weights for policy 0, policy_version 57970 (0.0008) [2023-10-08 02:15:45,955][52060] Updated weights for policy 0, policy_version 57980 (0.0008) [2023-10-08 02:15:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 119472128. Throughput: 0: 1707.0, 1: 1745.2. Samples: 29877192. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:15:46,211][50642] Avg episode reward: [(0, '21.290'), (1, '21.080')] [2023-10-08 02:15:46,954][52059] Updated weights for policy 1, policy_version 58692 (0.0008) [2023-10-08 02:15:47,322][52059] Updated weights for policy 1, policy_version 58702 (0.0009) [2023-10-08 02:15:47,682][52059] Updated weights for policy 1, policy_version 58712 (0.0009) [2023-10-08 02:15:49,954][52060] Updated weights for policy 0, policy_version 57990 (0.0009) [2023-10-08 02:15:50,328][52060] Updated weights for policy 0, policy_version 58000 (0.0009) [2023-10-08 02:15:50,688][52060] Updated weights for policy 0, policy_version 58010 (0.0011) [2023-10-08 02:15:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 119537664. Throughput: 0: 1727.9, 1: 1712.9. Samples: 29887384. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:15:51,211][50642] Avg episode reward: [(0, '20.020'), (1, '20.480')] [2023-10-08 02:15:51,632][52059] Updated weights for policy 1, policy_version 58722 (0.0008) [2023-10-08 02:15:52,019][52059] Updated weights for policy 1, policy_version 58732 (0.0010) [2023-10-08 02:15:52,400][52059] Updated weights for policy 1, policy_version 58742 (0.0010) [2023-10-08 02:15:52,763][52059] Updated weights for policy 1, policy_version 58752 (0.0009) [2023-10-08 02:15:54,693][52060] Updated weights for policy 0, policy_version 58020 (0.0008) [2023-10-08 02:15:55,086][52060] Updated weights for policy 0, policy_version 58030 (0.0007) [2023-10-08 02:15:55,447][52060] Updated weights for policy 0, policy_version 58040 (0.0009) [2023-10-08 02:15:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 119603200. Throughput: 0: 1724.0, 1: 1734.0. Samples: 29908298. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:15:56,211][50642] Avg episode reward: [(0, '19.950'), (1, '22.670')] [2023-10-08 02:15:56,701][52059] Updated weights for policy 1, policy_version 58762 (0.0008) [2023-10-08 02:15:57,062][52059] Updated weights for policy 1, policy_version 58772 (0.0007) [2023-10-08 02:15:57,438][52059] Updated weights for policy 1, policy_version 58782 (0.0009) [2023-10-08 02:15:59,271][52060] Updated weights for policy 0, policy_version 58050 (0.0010) [2023-10-08 02:15:59,630][52060] Updated weights for policy 0, policy_version 58060 (0.0008) [2023-10-08 02:15:59,992][52060] Updated weights for policy 0, policy_version 58070 (0.0007) [2023-10-08 02:16:00,359][52060] Updated weights for policy 0, policy_version 58080 (0.0009) [2023-10-08 02:16:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 119668736. Throughput: 0: 1699.8, 1: 1740.2. Samples: 29928596. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:16:01,211][50642] Avg episode reward: [(0, '21.570'), (1, '24.280')] [2023-10-08 02:16:01,392][52059] Updated weights for policy 1, policy_version 58792 (0.0007) [2023-10-08 02:16:01,759][52059] Updated weights for policy 1, policy_version 58802 (0.0008) [2023-10-08 02:16:02,125][52059] Updated weights for policy 1, policy_version 58812 (0.0009) [2023-10-08 02:16:04,321][52060] Updated weights for policy 0, policy_version 58090 (0.0009) [2023-10-08 02:16:04,686][52060] Updated weights for policy 0, policy_version 58100 (0.0009) [2023-10-08 02:16:05,064][52060] Updated weights for policy 0, policy_version 58110 (0.0008) [2023-10-08 02:16:06,093][52059] Updated weights for policy 1, policy_version 58822 (0.0008) [2023-10-08 02:16:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 119734272. Throughput: 0: 1732.4, 1: 1715.3. Samples: 29939206. Policy #0 lag: (min: 14.0, avg: 14.7, max: 32.0) [2023-10-08 02:16:06,211][50642] Avg episode reward: [(0, '20.510'), (1, '20.060')] [2023-10-08 02:16:06,454][52059] Updated weights for policy 1, policy_version 58832 (0.0009) [2023-10-08 02:16:06,815][52059] Updated weights for policy 1, policy_version 58842 (0.0009) [2023-10-08 02:16:09,022][52060] Updated weights for policy 0, policy_version 58120 (0.0009) [2023-10-08 02:16:09,392][52060] Updated weights for policy 0, policy_version 58130 (0.0009) [2023-10-08 02:16:09,756][52060] Updated weights for policy 0, policy_version 58140 (0.0008) [2023-10-08 02:16:10,613][52059] Updated weights for policy 1, policy_version 58852 (0.0009) [2023-10-08 02:16:10,968][52059] Updated weights for policy 1, policy_version 58862 (0.0011) [2023-10-08 02:16:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 119799808. Throughput: 0: 1706.3, 1: 1739.0. Samples: 29959538. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:11,211][50642] Avg episode reward: [(0, '20.230'), (1, '19.970')] [2023-10-08 02:16:11,327][52059] Updated weights for policy 1, policy_version 58872 (0.0011) [2023-10-08 02:16:13,732][52060] Updated weights for policy 0, policy_version 58150 (0.0008) [2023-10-08 02:16:14,111][52060] Updated weights for policy 0, policy_version 58160 (0.0008) [2023-10-08 02:16:14,481][52060] Updated weights for policy 0, policy_version 58170 (0.0008) [2023-10-08 02:16:15,469][52059] Updated weights for policy 1, policy_version 58882 (0.0008) [2023-10-08 02:16:15,833][52059] Updated weights for policy 1, policy_version 58892 (0.0009) [2023-10-08 02:16:16,193][52059] Updated weights for policy 1, policy_version 58902 (0.0007) [2023-10-08 02:16:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 119865344. Throughput: 0: 1703.4, 1: 1727.7. Samples: 29980290. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:16,211][50642] Avg episode reward: [(0, '21.610'), (1, '21.530')] [2023-10-08 02:16:16,557][52059] Updated weights for policy 1, policy_version 58912 (0.0007) [2023-10-08 02:16:18,432][52060] Updated weights for policy 0, policy_version 58180 (0.0009) [2023-10-08 02:16:18,808][52060] Updated weights for policy 0, policy_version 58190 (0.0007) [2023-10-08 02:16:19,177][52060] Updated weights for policy 0, policy_version 58200 (0.0010) [2023-10-08 02:16:20,274][52059] Updated weights for policy 1, policy_version 58922 (0.0010) [2023-10-08 02:16:20,634][52059] Updated weights for policy 1, policy_version 58932 (0.0010) [2023-10-08 02:16:21,003][52059] Updated weights for policy 1, policy_version 58942 (0.0008) [2023-10-08 02:16:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 119963648. Throughput: 0: 1720.0, 1: 1742.0. Samples: 29991190. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:21,211][50642] Avg episode reward: [(0, '20.650'), (1, '22.360')] [2023-10-08 02:16:23,164][52060] Updated weights for policy 0, policy_version 58210 (0.0009) [2023-10-08 02:16:23,533][52060] Updated weights for policy 0, policy_version 58220 (0.0007) [2023-10-08 02:16:23,901][52060] Updated weights for policy 0, policy_version 58230 (0.0009) [2023-10-08 02:16:24,267][52060] Updated weights for policy 0, policy_version 58240 (0.0010) [2023-10-08 02:16:24,920][52059] Updated weights for policy 1, policy_version 58952 (0.0009) [2023-10-08 02:16:25,277][52059] Updated weights for policy 1, policy_version 58962 (0.0009) [2023-10-08 02:16:25,643][52059] Updated weights for policy 1, policy_version 58972 (0.0007) [2023-10-08 02:16:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 120029184. Throughput: 0: 1697.2, 1: 1732.5. Samples: 30011350. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:26,211][50642] Avg episode reward: [(0, '18.890'), (1, '22.720')] [2023-10-08 02:16:28,346][52060] Updated weights for policy 0, policy_version 58250 (0.0010) [2023-10-08 02:16:28,714][52060] Updated weights for policy 0, policy_version 58260 (0.0010) [2023-10-08 02:16:29,072][52060] Updated weights for policy 0, policy_version 58270 (0.0009) [2023-10-08 02:16:29,581][52059] Updated weights for policy 1, policy_version 58982 (0.0009) [2023-10-08 02:16:29,948][52059] Updated weights for policy 1, policy_version 58992 (0.0009) [2023-10-08 02:16:30,318][52059] Updated weights for policy 1, policy_version 59002 (0.0008) [2023-10-08 02:16:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 120094720. Throughput: 0: 1722.0, 1: 1714.0. Samples: 30031816. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:31,211][50642] Avg episode reward: [(0, '18.560'), (1, '22.640')] [2023-10-08 02:16:32,966][52060] Updated weights for policy 0, policy_version 58280 (0.0007) [2023-10-08 02:16:33,340][52060] Updated weights for policy 0, policy_version 58290 (0.0009) [2023-10-08 02:16:33,715][52060] Updated weights for policy 0, policy_version 58300 (0.0012) [2023-10-08 02:16:34,334][52059] Updated weights for policy 1, policy_version 59012 (0.0008) [2023-10-08 02:16:34,703][52059] Updated weights for policy 1, policy_version 59022 (0.0007) [2023-10-08 02:16:35,066][52059] Updated weights for policy 1, policy_version 59032 (0.0007) [2023-10-08 02:16:36,210][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 120160256. Throughput: 0: 1705.4, 1: 1743.0. Samples: 30042564. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:36,211][50642] Avg episode reward: [(0, '20.440'), (1, '23.330')] [2023-10-08 02:16:37,746][52060] Updated weights for policy 0, policy_version 58310 (0.0007) [2023-10-08 02:16:38,108][52060] Updated weights for policy 0, policy_version 58320 (0.0009) [2023-10-08 02:16:38,484][52060] Updated weights for policy 0, policy_version 58330 (0.0009) [2023-10-08 02:16:39,054][52059] Updated weights for policy 1, policy_version 59042 (0.0007) [2023-10-08 02:16:39,464][52059] Updated weights for policy 1, policy_version 59052 (0.0008) [2023-10-08 02:16:39,829][52059] Updated weights for policy 1, policy_version 59062 (0.0010) [2023-10-08 02:16:40,189][52059] Updated weights for policy 1, policy_version 59072 (0.0008) [2023-10-08 02:16:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 120225792. Throughput: 0: 1709.3, 1: 1720.9. Samples: 30062656. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:41,211][50642] Avg episode reward: [(0, '19.860'), (1, '22.780')] [2023-10-08 02:16:42,447][52060] Updated weights for policy 0, policy_version 58340 (0.0008) [2023-10-08 02:16:42,834][52060] Updated weights for policy 0, policy_version 58350 (0.0008) [2023-10-08 02:16:43,195][52060] Updated weights for policy 0, policy_version 58360 (0.0008) [2023-10-08 02:16:44,020][52059] Updated weights for policy 1, policy_version 59082 (0.0008) [2023-10-08 02:16:44,378][52059] Updated weights for policy 1, policy_version 59092 (0.0007) [2023-10-08 02:16:44,755][52059] Updated weights for policy 1, policy_version 59102 (0.0008) [2023-10-08 02:16:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 120291328. Throughput: 0: 1728.9, 1: 1714.2. Samples: 30083538. Policy #0 lag: (min: 16.0, avg: 35.8, max: 48.0) [2023-10-08 02:16:46,211][50642] Avg episode reward: [(0, '18.790'), (1, '22.040')] [2023-10-08 02:16:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000059104_60522496.pth... [2023-10-08 02:16:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000058368_59768832.pth... [2023-10-08 02:16:46,267][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000057472_58851328.pth [2023-10-08 02:16:46,268][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000056768_58130432.pth [2023-10-08 02:16:47,285][52060] Updated weights for policy 0, policy_version 58370 (0.0008) [2023-10-08 02:16:47,648][52060] Updated weights for policy 0, policy_version 58380 (0.0009) [2023-10-08 02:16:48,018][52060] Updated weights for policy 0, policy_version 58390 (0.0010) [2023-10-08 02:16:48,389][52060] Updated weights for policy 0, policy_version 58400 (0.0010) [2023-10-08 02:16:48,711][52059] Updated weights for policy 1, policy_version 59112 (0.0008) [2023-10-08 02:16:49,080][52059] Updated weights for policy 1, policy_version 59122 (0.0008) [2023-10-08 02:16:49,446][52059] Updated weights for policy 1, policy_version 59132 (0.0007) [2023-10-08 02:16:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 120356864. Throughput: 0: 1694.0, 1: 1741.4. Samples: 30093798. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:16:51,211][50642] Avg episode reward: [(0, '18.390'), (1, '23.280')] [2023-10-08 02:16:52,368][52060] Updated weights for policy 0, policy_version 58410 (0.0009) [2023-10-08 02:16:52,744][52060] Updated weights for policy 0, policy_version 58420 (0.0008) [2023-10-08 02:16:53,114][52060] Updated weights for policy 0, policy_version 58430 (0.0007) [2023-10-08 02:16:53,454][52059] Updated weights for policy 1, policy_version 59142 (0.0008) [2023-10-08 02:16:53,817][52059] Updated weights for policy 1, policy_version 59152 (0.0011) [2023-10-08 02:16:54,174][52059] Updated weights for policy 1, policy_version 59162 (0.0011) [2023-10-08 02:16:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 120422400. Throughput: 0: 1723.1, 1: 1716.3. Samples: 30114312. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:16:56,211][50642] Avg episode reward: [(0, '19.950'), (1, '24.750')] [2023-10-08 02:16:57,039][52060] Updated weights for policy 0, policy_version 58440 (0.0010) [2023-10-08 02:16:57,406][52060] Updated weights for policy 0, policy_version 58450 (0.0012) [2023-10-08 02:16:57,775][52060] Updated weights for policy 0, policy_version 58460 (0.0009) [2023-10-08 02:16:58,121][52059] Updated weights for policy 1, policy_version 59172 (0.0009) [2023-10-08 02:16:58,486][52059] Updated weights for policy 1, policy_version 59182 (0.0008) [2023-10-08 02:16:58,858][52059] Updated weights for policy 1, policy_version 59192 (0.0009) [2023-10-08 02:17:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 120487936. Throughput: 0: 1729.6, 1: 1724.8. Samples: 30135742. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:17:01,211][50642] Avg episode reward: [(0, '19.310'), (1, '22.420')] [2023-10-08 02:17:01,521][52060] Updated weights for policy 0, policy_version 58470 (0.0009) [2023-10-08 02:17:01,879][52060] Updated weights for policy 0, policy_version 58480 (0.0008) [2023-10-08 02:17:02,254][52060] Updated weights for policy 0, policy_version 58490 (0.0009) [2023-10-08 02:17:02,851][52059] Updated weights for policy 1, policy_version 59202 (0.0008) [2023-10-08 02:17:03,213][52059] Updated weights for policy 1, policy_version 59212 (0.0008) [2023-10-08 02:17:03,586][52059] Updated weights for policy 1, policy_version 59222 (0.0010) [2023-10-08 02:17:03,947][52059] Updated weights for policy 1, policy_version 59232 (0.0008) [2023-10-08 02:17:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 120553472. Throughput: 0: 1710.4, 1: 1717.0. Samples: 30145426. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:17:06,211][50642] Avg episode reward: [(0, '18.130'), (1, '22.670')] [2023-10-08 02:17:06,219][52060] Updated weights for policy 0, policy_version 58500 (0.0007) [2023-10-08 02:17:06,580][52060] Updated weights for policy 0, policy_version 58510 (0.0008) [2023-10-08 02:17:06,947][52060] Updated weights for policy 0, policy_version 58520 (0.0007) [2023-10-08 02:17:07,771][52059] Updated weights for policy 1, policy_version 59242 (0.0008) [2023-10-08 02:17:08,133][52059] Updated weights for policy 1, policy_version 59252 (0.0009) [2023-10-08 02:17:08,505][52059] Updated weights for policy 1, policy_version 59262 (0.0010) [2023-10-08 02:17:10,838][52060] Updated weights for policy 0, policy_version 58530 (0.0008) [2023-10-08 02:17:11,208][52060] Updated weights for policy 0, policy_version 58540 (0.0009) [2023-10-08 02:17:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 120619008. Throughput: 0: 1731.9, 1: 1720.6. Samples: 30166714. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:17:11,211][50642] Avg episode reward: [(0, '19.130'), (1, '24.400')] [2023-10-08 02:17:11,584][52060] Updated weights for policy 0, policy_version 58550 (0.0009) [2023-10-08 02:17:11,947][52060] Updated weights for policy 0, policy_version 58560 (0.0008) [2023-10-08 02:17:12,499][52059] Updated weights for policy 1, policy_version 59272 (0.0008) [2023-10-08 02:17:12,874][52059] Updated weights for policy 1, policy_version 59282 (0.0008) [2023-10-08 02:17:13,226][52059] Updated weights for policy 1, policy_version 59292 (0.0009) [2023-10-08 02:17:15,955][52060] Updated weights for policy 0, policy_version 58570 (0.0008) [2023-10-08 02:17:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 120684544. Throughput: 0: 1721.1, 1: 1746.7. Samples: 30187868. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:17:16,211][50642] Avg episode reward: [(0, '20.970'), (1, '25.250')] [2023-10-08 02:17:16,319][52060] Updated weights for policy 0, policy_version 58580 (0.0009) [2023-10-08 02:17:16,685][52060] Updated weights for policy 0, policy_version 58590 (0.0009) [2023-10-08 02:17:16,961][52059] Updated weights for policy 1, policy_version 59302 (0.0007) [2023-10-08 02:17:17,335][52059] Updated weights for policy 1, policy_version 59312 (0.0007) [2023-10-08 02:17:17,697][52059] Updated weights for policy 1, policy_version 59322 (0.0008) [2023-10-08 02:17:20,568][52060] Updated weights for policy 0, policy_version 58600 (0.0009) [2023-10-08 02:17:20,948][52060] Updated weights for policy 0, policy_version 58610 (0.0010) [2023-10-08 02:17:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 120750080. Throughput: 0: 1726.5, 1: 1719.7. Samples: 30197642. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:17:21,211][50642] Avg episode reward: [(0, '20.040'), (1, '23.260')] [2023-10-08 02:17:21,304][52060] Updated weights for policy 0, policy_version 58620 (0.0010) [2023-10-08 02:17:21,539][52059] Updated weights for policy 1, policy_version 59332 (0.0009) [2023-10-08 02:17:21,902][52059] Updated weights for policy 1, policy_version 59342 (0.0009) [2023-10-08 02:17:22,275][52059] Updated weights for policy 1, policy_version 59352 (0.0008) [2023-10-08 02:17:25,287][52060] Updated weights for policy 0, policy_version 58630 (0.0008) [2023-10-08 02:17:25,652][52060] Updated weights for policy 0, policy_version 58640 (0.0009) [2023-10-08 02:17:26,022][52060] Updated weights for policy 0, policy_version 58650 (0.0007) [2023-10-08 02:17:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 120815616. Throughput: 0: 1730.3, 1: 1744.5. Samples: 30219022. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:17:26,211][50642] Avg episode reward: [(0, '19.760'), (1, '20.890')] [2023-10-08 02:17:26,242][52059] Updated weights for policy 1, policy_version 59362 (0.0008) [2023-10-08 02:17:26,662][52059] Updated weights for policy 1, policy_version 59372 (0.0008) [2023-10-08 02:17:27,027][52059] Updated weights for policy 1, policy_version 59382 (0.0008) [2023-10-08 02:17:27,385][52059] Updated weights for policy 1, policy_version 59392 (0.0007) [2023-10-08 02:17:30,214][52060] Updated weights for policy 0, policy_version 58660 (0.0009) [2023-10-08 02:17:30,599][52060] Updated weights for policy 0, policy_version 58670 (0.0009) [2023-10-08 02:17:30,953][52060] Updated weights for policy 0, policy_version 58680 (0.0009) [2023-10-08 02:17:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 120881152. Throughput: 0: 1712.9, 1: 1742.3. Samples: 30239024. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:17:31,211][50642] Avg episode reward: [(0, '21.540'), (1, '22.830')] [2023-10-08 02:17:31,433][52059] Updated weights for policy 1, policy_version 59402 (0.0008) [2023-10-08 02:17:31,797][52059] Updated weights for policy 1, policy_version 59412 (0.0008) [2023-10-08 02:17:32,161][52059] Updated weights for policy 1, policy_version 59422 (0.0010) [2023-10-08 02:17:34,836][52060] Updated weights for policy 0, policy_version 58690 (0.0008) [2023-10-08 02:17:35,206][52060] Updated weights for policy 0, policy_version 58700 (0.0009) [2023-10-08 02:17:35,569][52060] Updated weights for policy 0, policy_version 58710 (0.0008) [2023-10-08 02:17:35,936][52060] Updated weights for policy 0, policy_version 58720 (0.0008) [2023-10-08 02:17:36,204][52059] Updated weights for policy 1, policy_version 59432 (0.0008) [2023-10-08 02:17:36,211][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 120979456. Throughput: 0: 1738.2, 1: 1717.0. Samples: 30249282. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:17:36,212][50642] Avg episode reward: [(0, '22.630'), (1, '25.410')] [2023-10-08 02:17:36,563][52059] Updated weights for policy 1, policy_version 59442 (0.0007) [2023-10-08 02:17:36,926][52059] Updated weights for policy 1, policy_version 59452 (0.0008) [2023-10-08 02:17:39,970][52060] Updated weights for policy 0, policy_version 58730 (0.0008) [2023-10-08 02:17:40,339][52060] Updated weights for policy 0, policy_version 58740 (0.0007) [2023-10-08 02:17:40,708][52060] Updated weights for policy 0, policy_version 58750 (0.0008) [2023-10-08 02:17:40,795][52059] Updated weights for policy 1, policy_version 59462 (0.0008) [2023-10-08 02:17:41,163][52059] Updated weights for policy 1, policy_version 59472 (0.0007) [2023-10-08 02:17:41,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 121044992. Throughput: 0: 1726.9, 1: 1745.9. Samples: 30270588. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:17:41,211][50642] Avg episode reward: [(0, '19.660'), (1, '21.950')] [2023-10-08 02:17:41,537][52059] Updated weights for policy 1, policy_version 59482 (0.0008) [2023-10-08 02:17:44,719][52060] Updated weights for policy 0, policy_version 58760 (0.0010) [2023-10-08 02:17:45,085][52060] Updated weights for policy 0, policy_version 58770 (0.0009) [2023-10-08 02:17:45,412][52059] Updated weights for policy 1, policy_version 59492 (0.0009) [2023-10-08 02:17:45,451][52060] Updated weights for policy 0, policy_version 58780 (0.0008) [2023-10-08 02:17:45,777][52059] Updated weights for policy 1, policy_version 59502 (0.0007) [2023-10-08 02:17:46,142][52059] Updated weights for policy 1, policy_version 59512 (0.0009) [2023-10-08 02:17:46,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 121110528. Throughput: 0: 1698.3, 1: 1737.0. Samples: 30290332. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:17:46,211][50642] Avg episode reward: [(0, '18.850'), (1, '22.590')] [2023-10-08 02:17:49,352][52060] Updated weights for policy 0, policy_version 58790 (0.0007) [2023-10-08 02:17:49,716][52060] Updated weights for policy 0, policy_version 58800 (0.0007) [2023-10-08 02:17:50,077][52060] Updated weights for policy 0, policy_version 58810 (0.0007) [2023-10-08 02:17:50,138][52059] Updated weights for policy 1, policy_version 59522 (0.0007) [2023-10-08 02:17:50,506][52059] Updated weights for policy 1, policy_version 59532 (0.0008) [2023-10-08 02:17:50,864][52059] Updated weights for policy 1, policy_version 59542 (0.0009) [2023-10-08 02:17:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 121176064. Throughput: 0: 1729.2, 1: 1740.0. Samples: 30301540. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:17:51,211][50642] Avg episode reward: [(0, '19.690'), (1, '21.250')] [2023-10-08 02:17:51,224][52059] Updated weights for policy 1, policy_version 59552 (0.0009) [2023-10-08 02:17:54,069][52060] Updated weights for policy 0, policy_version 58820 (0.0010) [2023-10-08 02:17:54,438][52060] Updated weights for policy 0, policy_version 58830 (0.0009) [2023-10-08 02:17:54,802][52060] Updated weights for policy 0, policy_version 58840 (0.0007) [2023-10-08 02:17:55,219][52059] Updated weights for policy 1, policy_version 59562 (0.0009) [2023-10-08 02:17:55,585][52059] Updated weights for policy 1, policy_version 59572 (0.0010) [2023-10-08 02:17:55,948][52059] Updated weights for policy 1, policy_version 59582 (0.0010) [2023-10-08 02:17:56,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 121274368. Throughput: 0: 1704.6, 1: 1743.4. Samples: 30321872. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:17:56,211][50642] Avg episode reward: [(0, '18.800'), (1, '22.000')] [2023-10-08 02:17:58,556][52060] Updated weights for policy 0, policy_version 58850 (0.0010) [2023-10-08 02:17:58,920][52060] Updated weights for policy 0, policy_version 58860 (0.0007) [2023-10-08 02:17:59,293][52060] Updated weights for policy 0, policy_version 58870 (0.0010) [2023-10-08 02:17:59,665][52060] Updated weights for policy 0, policy_version 58880 (0.0007) [2023-10-08 02:17:59,692][52059] Updated weights for policy 1, policy_version 59592 (0.0008) [2023-10-08 02:18:00,062][52059] Updated weights for policy 1, policy_version 59602 (0.0007) [2023-10-08 02:18:00,426][52059] Updated weights for policy 1, policy_version 59612 (0.0007) [2023-10-08 02:18:01,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 121339904. Throughput: 0: 1708.5, 1: 1713.3. Samples: 30341846. Policy #0 lag: (min: 26.0, avg: 34.7, max: 58.0) [2023-10-08 02:18:01,211][50642] Avg episode reward: [(0, '17.650'), (1, '20.700')] [2023-10-08 02:18:03,733][52060] Updated weights for policy 0, policy_version 58890 (0.0010) [2023-10-08 02:18:04,109][52060] Updated weights for policy 0, policy_version 58900 (0.0010) [2023-10-08 02:18:04,328][52059] Updated weights for policy 1, policy_version 59622 (0.0008) [2023-10-08 02:18:04,472][52060] Updated weights for policy 0, policy_version 58910 (0.0008) [2023-10-08 02:18:04,686][52059] Updated weights for policy 1, policy_version 59632 (0.0007) [2023-10-08 02:18:05,052][52059] Updated weights for policy 1, policy_version 59642 (0.0007) [2023-10-08 02:18:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 121405440. Throughput: 0: 1716.6, 1: 1741.4. Samples: 30353254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:06,211][50642] Avg episode reward: [(0, '18.270'), (1, '20.290')] [2023-10-08 02:18:08,368][52060] Updated weights for policy 0, policy_version 58920 (0.0009) [2023-10-08 02:18:08,741][52060] Updated weights for policy 0, policy_version 58930 (0.0008) [2023-10-08 02:18:08,863][52059] Updated weights for policy 1, policy_version 59652 (0.0007) [2023-10-08 02:18:09,094][52060] Updated weights for policy 0, policy_version 58940 (0.0009) [2023-10-08 02:18:09,225][52059] Updated weights for policy 1, policy_version 59662 (0.0008) [2023-10-08 02:18:09,586][52059] Updated weights for policy 1, policy_version 59672 (0.0009) [2023-10-08 02:18:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 121470976. Throughput: 0: 1697.5, 1: 1715.9. Samples: 30372624. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:11,211][50642] Avg episode reward: [(0, '19.660'), (1, '20.460')] [2023-10-08 02:18:13,066][52060] Updated weights for policy 0, policy_version 58950 (0.0008) [2023-10-08 02:18:13,421][52060] Updated weights for policy 0, policy_version 58960 (0.0008) [2023-10-08 02:18:13,749][52059] Updated weights for policy 1, policy_version 59682 (0.0009) [2023-10-08 02:18:13,790][52060] Updated weights for policy 0, policy_version 58970 (0.0008) [2023-10-08 02:18:14,168][52059] Updated weights for policy 1, policy_version 59692 (0.0009) [2023-10-08 02:18:14,541][52059] Updated weights for policy 1, policy_version 59702 (0.0010) [2023-10-08 02:18:14,916][52059] Updated weights for policy 1, policy_version 59712 (0.0009) [2023-10-08 02:18:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 121536512. Throughput: 0: 1716.1, 1: 1717.5. Samples: 30393538. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:16,211][50642] Avg episode reward: [(0, '18.000'), (1, '23.720')] [2023-10-08 02:18:17,867][52060] Updated weights for policy 0, policy_version 58980 (0.0008) [2023-10-08 02:18:18,252][52060] Updated weights for policy 0, policy_version 58990 (0.0009) [2023-10-08 02:18:18,621][52060] Updated weights for policy 0, policy_version 59000 (0.0008) [2023-10-08 02:18:18,854][52059] Updated weights for policy 1, policy_version 59722 (0.0007) [2023-10-08 02:18:19,220][52059] Updated weights for policy 1, policy_version 59732 (0.0008) [2023-10-08 02:18:19,574][52059] Updated weights for policy 1, policy_version 59742 (0.0007) [2023-10-08 02:18:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 121602048. Throughput: 0: 1694.6, 1: 1742.5. Samples: 30403954. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:21,211][50642] Avg episode reward: [(0, '18.980'), (1, '20.530')] [2023-10-08 02:18:22,611][52060] Updated weights for policy 0, policy_version 59010 (0.0008) [2023-10-08 02:18:22,980][52060] Updated weights for policy 0, policy_version 59020 (0.0011) [2023-10-08 02:18:23,344][52060] Updated weights for policy 0, policy_version 59030 (0.0010) [2023-10-08 02:18:23,454][52059] Updated weights for policy 1, policy_version 59752 (0.0008) [2023-10-08 02:18:23,716][52060] Updated weights for policy 0, policy_version 59040 (0.0007) [2023-10-08 02:18:23,822][52059] Updated weights for policy 1, policy_version 59762 (0.0010) [2023-10-08 02:18:24,188][52059] Updated weights for policy 1, policy_version 59772 (0.0008) [2023-10-08 02:18:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 121667584. Throughput: 0: 1694.6, 1: 1723.0. Samples: 30424378. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:26,211][50642] Avg episode reward: [(0, '19.840'), (1, '21.530')] [2023-10-08 02:18:27,571][52060] Updated weights for policy 0, policy_version 59050 (0.0008) [2023-10-08 02:18:27,876][52059] Updated weights for policy 1, policy_version 59782 (0.0008) [2023-10-08 02:18:27,943][52060] Updated weights for policy 0, policy_version 59060 (0.0007) [2023-10-08 02:18:28,243][52059] Updated weights for policy 1, policy_version 59792 (0.0007) [2023-10-08 02:18:28,301][52060] Updated weights for policy 0, policy_version 59070 (0.0008) [2023-10-08 02:18:28,609][52059] Updated weights for policy 1, policy_version 59802 (0.0007) [2023-10-08 02:18:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 121733120. Throughput: 0: 1721.4, 1: 1739.3. Samples: 30446062. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:31,211][50642] Avg episode reward: [(0, '18.360'), (1, '22.980')] [2023-10-08 02:18:32,328][52060] Updated weights for policy 0, policy_version 59080 (0.0007) [2023-10-08 02:18:32,509][52059] Updated weights for policy 1, policy_version 59812 (0.0007) [2023-10-08 02:18:32,687][52060] Updated weights for policy 0, policy_version 59090 (0.0009) [2023-10-08 02:18:32,865][52059] Updated weights for policy 1, policy_version 59822 (0.0008) [2023-10-08 02:18:33,051][52060] Updated weights for policy 0, policy_version 59100 (0.0007) [2023-10-08 02:18:33,233][52059] Updated weights for policy 1, policy_version 59832 (0.0009) [2023-10-08 02:18:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 121798656. Throughput: 0: 1690.4, 1: 1727.2. Samples: 30455330. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:36,211][50642] Avg episode reward: [(0, '19.320'), (1, '23.690')] [2023-10-08 02:18:37,167][52059] Updated weights for policy 1, policy_version 59842 (0.0010) [2023-10-08 02:18:37,199][52060] Updated weights for policy 0, policy_version 59110 (0.0007) [2023-10-08 02:18:37,539][52059] Updated weights for policy 1, policy_version 59852 (0.0007) [2023-10-08 02:18:37,570][52060] Updated weights for policy 0, policy_version 59120 (0.0007) [2023-10-08 02:18:37,903][52059] Updated weights for policy 1, policy_version 59862 (0.0007) [2023-10-08 02:18:37,931][52060] Updated weights for policy 0, policy_version 59130 (0.0007) [2023-10-08 02:18:38,263][52059] Updated weights for policy 1, policy_version 59872 (0.0007) [2023-10-08 02:18:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 121864192. Throughput: 0: 1706.6, 1: 1728.5. Samples: 30476452. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:41,211][50642] Avg episode reward: [(0, '19.630'), (1, '19.960')] [2023-10-08 02:18:42,125][52060] Updated weights for policy 0, policy_version 59140 (0.0007) [2023-10-08 02:18:42,181][52059] Updated weights for policy 1, policy_version 59882 (0.0007) [2023-10-08 02:18:42,496][52060] Updated weights for policy 0, policy_version 59150 (0.0007) [2023-10-08 02:18:42,549][52059] Updated weights for policy 1, policy_version 59892 (0.0008) [2023-10-08 02:18:42,872][52060] Updated weights for policy 0, policy_version 59160 (0.0008) [2023-10-08 02:18:42,919][52059] Updated weights for policy 1, policy_version 59902 (0.0009) [2023-10-08 02:18:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 121929728. Throughput: 0: 1703.7, 1: 1761.5. Samples: 30497780. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:46,211][50642] Avg episode reward: [(0, '16.870'), (1, '23.520')] [2023-10-08 02:18:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000059168_60588032.pth... [2023-10-08 02:18:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000059904_61341696.pth... [2023-10-08 02:18:46,249][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000057568_58949632.pth [2023-10-08 02:18:46,264][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000058304_59703296.pth [2023-10-08 02:18:46,701][52059] Updated weights for policy 1, policy_version 59912 (0.0007) [2023-10-08 02:18:46,885][52060] Updated weights for policy 0, policy_version 59170 (0.0009) [2023-10-08 02:18:47,064][52059] Updated weights for policy 1, policy_version 59922 (0.0008) [2023-10-08 02:18:47,251][52060] Updated weights for policy 0, policy_version 59180 (0.0008) [2023-10-08 02:18:47,427][52059] Updated weights for policy 1, policy_version 59932 (0.0007) [2023-10-08 02:18:47,625][52060] Updated weights for policy 0, policy_version 59190 (0.0008) [2023-10-08 02:18:47,989][52060] Updated weights for policy 0, policy_version 59200 (0.0008) [2023-10-08 02:18:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 121995264. Throughput: 0: 1686.4, 1: 1733.2. Samples: 30507138. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:51,211][50642] Avg episode reward: [(0, '18.270'), (1, '20.560')] [2023-10-08 02:18:51,397][52059] Updated weights for policy 1, policy_version 59942 (0.0008) [2023-10-08 02:18:51,762][52059] Updated weights for policy 1, policy_version 59952 (0.0007) [2023-10-08 02:18:52,088][52060] Updated weights for policy 0, policy_version 59210 (0.0008) [2023-10-08 02:18:52,127][52059] Updated weights for policy 1, policy_version 59962 (0.0007) [2023-10-08 02:18:52,462][52060] Updated weights for policy 0, policy_version 59220 (0.0009) [2023-10-08 02:18:52,825][52060] Updated weights for policy 0, policy_version 59230 (0.0009) [2023-10-08 02:18:55,997][52059] Updated weights for policy 1, policy_version 59972 (0.0007) [2023-10-08 02:18:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 122060800. Throughput: 0: 1700.0, 1: 1757.6. Samples: 30528214. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:18:56,211][50642] Avg episode reward: [(0, '21.030'), (1, '20.530')] [2023-10-08 02:18:56,367][52059] Updated weights for policy 1, policy_version 59982 (0.0008) [2023-10-08 02:18:56,721][52059] Updated weights for policy 1, policy_version 59992 (0.0008) [2023-10-08 02:18:56,770][52060] Updated weights for policy 0, policy_version 59240 (0.0007) [2023-10-08 02:18:57,135][52060] Updated weights for policy 0, policy_version 59250 (0.0008) [2023-10-08 02:18:57,499][52060] Updated weights for policy 0, policy_version 59260 (0.0009) [2023-10-08 02:19:00,716][52059] Updated weights for policy 1, policy_version 60002 (0.0008) [2023-10-08 02:19:01,117][52059] Updated weights for policy 1, policy_version 60012 (0.0008) [2023-10-08 02:19:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 122126336. Throughput: 0: 1697.2, 1: 1763.0. Samples: 30549244. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:19:01,211][50642] Avg episode reward: [(0, '17.050'), (1, '21.950')] [2023-10-08 02:19:01,482][52059] Updated weights for policy 1, policy_version 60022 (0.0007) [2023-10-08 02:19:01,642][52060] Updated weights for policy 0, policy_version 59270 (0.0008) [2023-10-08 02:19:01,847][52059] Updated weights for policy 1, policy_version 60032 (0.0007) [2023-10-08 02:19:02,009][52060] Updated weights for policy 0, policy_version 59280 (0.0010) [2023-10-08 02:19:02,379][52060] Updated weights for policy 0, policy_version 59290 (0.0011) [2023-10-08 02:19:05,682][52059] Updated weights for policy 1, policy_version 60042 (0.0007) [2023-10-08 02:19:06,041][52059] Updated weights for policy 1, policy_version 60052 (0.0010) [2023-10-08 02:19:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 122191872. Throughput: 0: 1700.4, 1: 1747.7. Samples: 30559116. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:19:06,211][50642] Avg episode reward: [(0, '17.880'), (1, '22.600')] [2023-10-08 02:19:06,397][52059] Updated weights for policy 1, policy_version 60062 (0.0009) [2023-10-08 02:19:06,442][52060] Updated weights for policy 0, policy_version 59300 (0.0009) [2023-10-08 02:19:06,833][52060] Updated weights for policy 0, policy_version 59310 (0.0007) [2023-10-08 02:19:07,194][52060] Updated weights for policy 0, policy_version 59320 (0.0007) [2023-10-08 02:19:10,228][52059] Updated weights for policy 1, policy_version 60072 (0.0008) [2023-10-08 02:19:10,588][52059] Updated weights for policy 1, policy_version 60082 (0.0009) [2023-10-08 02:19:10,958][52059] Updated weights for policy 1, policy_version 60092 (0.0008) [2023-10-08 02:19:11,116][52060] Updated weights for policy 0, policy_version 59330 (0.0007) [2023-10-08 02:19:11,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 122290176. Throughput: 0: 1702.5, 1: 1764.4. Samples: 30580384. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:19:11,211][50642] Avg episode reward: [(0, '21.410'), (1, '20.910')] [2023-10-08 02:19:11,485][52060] Updated weights for policy 0, policy_version 59340 (0.0008) [2023-10-08 02:19:11,848][52060] Updated weights for policy 0, policy_version 59350 (0.0007) [2023-10-08 02:19:12,218][52060] Updated weights for policy 0, policy_version 59360 (0.0009) [2023-10-08 02:19:14,947][52059] Updated weights for policy 1, policy_version 60102 (0.0007) [2023-10-08 02:19:15,308][52059] Updated weights for policy 1, policy_version 60112 (0.0008) [2023-10-08 02:19:15,670][52059] Updated weights for policy 1, policy_version 60122 (0.0009) [2023-10-08 02:19:16,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 122355712. Throughput: 0: 1699.5, 1: 1729.1. Samples: 30600348. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:19:16,211][50642] Avg episode reward: [(0, '18.280'), (1, '20.890')] [2023-10-08 02:19:16,226][52060] Updated weights for policy 0, policy_version 59370 (0.0008) [2023-10-08 02:19:16,589][52060] Updated weights for policy 0, policy_version 59380 (0.0007) [2023-10-08 02:19:16,954][52060] Updated weights for policy 0, policy_version 59390 (0.0010) [2023-10-08 02:19:19,698][52059] Updated weights for policy 1, policy_version 60132 (0.0008) [2023-10-08 02:19:20,067][52059] Updated weights for policy 1, policy_version 60142 (0.0008) [2023-10-08 02:19:20,437][52059] Updated weights for policy 1, policy_version 60152 (0.0009) [2023-10-08 02:19:20,788][52060] Updated weights for policy 0, policy_version 59400 (0.0008) [2023-10-08 02:19:21,157][52060] Updated weights for policy 0, policy_version 59410 (0.0007) [2023-10-08 02:19:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 122421248. Throughput: 0: 1702.0, 1: 1758.0. Samples: 30611032. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:19:21,211][50642] Avg episode reward: [(0, '16.900'), (1, '22.560')] [2023-10-08 02:19:21,528][52060] Updated weights for policy 0, policy_version 59420 (0.0007) [2023-10-08 02:19:24,332][52059] Updated weights for policy 1, policy_version 60162 (0.0008) [2023-10-08 02:19:24,700][52059] Updated weights for policy 1, policy_version 60172 (0.0008) [2023-10-08 02:19:25,067][52059] Updated weights for policy 1, policy_version 60182 (0.0007) [2023-10-08 02:19:25,430][52059] Updated weights for policy 1, policy_version 60192 (0.0008) [2023-10-08 02:19:25,500][52060] Updated weights for policy 0, policy_version 59430 (0.0009) [2023-10-08 02:19:25,876][52060] Updated weights for policy 0, policy_version 59440 (0.0008) [2023-10-08 02:19:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 122486784. Throughput: 0: 1711.0, 1: 1744.7. Samples: 30631958. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:26,211][50642] Avg episode reward: [(0, '20.450'), (1, '20.630')] [2023-10-08 02:19:26,244][52060] Updated weights for policy 0, policy_version 59450 (0.0010) [2023-10-08 02:19:29,281][52059] Updated weights for policy 1, policy_version 60202 (0.0008) [2023-10-08 02:19:29,648][52059] Updated weights for policy 1, policy_version 60212 (0.0008) [2023-10-08 02:19:30,004][52059] Updated weights for policy 1, policy_version 60222 (0.0008) [2023-10-08 02:19:30,189][52060] Updated weights for policy 0, policy_version 59460 (0.0008) [2023-10-08 02:19:30,558][52060] Updated weights for policy 0, policy_version 59470 (0.0008) [2023-10-08 02:19:30,929][52060] Updated weights for policy 0, policy_version 59480 (0.0010) [2023-10-08 02:19:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 122552320. Throughput: 0: 1697.5, 1: 1722.6. Samples: 30651684. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:31,211][50642] Avg episode reward: [(0, '21.800'), (1, '20.190')] [2023-10-08 02:19:33,926][52059] Updated weights for policy 1, policy_version 60232 (0.0008) [2023-10-08 02:19:34,283][52059] Updated weights for policy 1, policy_version 60242 (0.0008) [2023-10-08 02:19:34,647][52059] Updated weights for policy 1, policy_version 60252 (0.0010) [2023-10-08 02:19:34,917][52060] Updated weights for policy 0, policy_version 59490 (0.0008) [2023-10-08 02:19:35,287][52060] Updated weights for policy 0, policy_version 59500 (0.0008) [2023-10-08 02:19:35,653][52060] Updated weights for policy 0, policy_version 59510 (0.0009) [2023-10-08 02:19:36,030][52060] Updated weights for policy 0, policy_version 59520 (0.0007) [2023-10-08 02:19:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 122650624. Throughput: 0: 1713.0, 1: 1746.1. Samples: 30662800. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:36,211][50642] Avg episode reward: [(0, '15.910'), (1, '19.440')] [2023-10-08 02:19:38,591][52059] Updated weights for policy 1, policy_version 60262 (0.0009) [2023-10-08 02:19:38,964][52059] Updated weights for policy 1, policy_version 60272 (0.0008) [2023-10-08 02:19:39,326][52059] Updated weights for policy 1, policy_version 60282 (0.0008) [2023-10-08 02:19:39,842][52060] Updated weights for policy 0, policy_version 59530 (0.0008) [2023-10-08 02:19:40,210][52060] Updated weights for policy 0, policy_version 59540 (0.0008) [2023-10-08 02:19:40,574][52060] Updated weights for policy 0, policy_version 59550 (0.0010) [2023-10-08 02:19:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 122716160. Throughput: 0: 1714.2, 1: 1721.1. Samples: 30682802. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:41,211][50642] Avg episode reward: [(0, '19.020'), (1, '23.460')] [2023-10-08 02:19:43,195][52059] Updated weights for policy 1, policy_version 60292 (0.0007) [2023-10-08 02:19:43,558][52059] Updated weights for policy 1, policy_version 60302 (0.0009) [2023-10-08 02:19:43,930][52059] Updated weights for policy 1, policy_version 60312 (0.0008) [2023-10-08 02:19:44,620][52060] Updated weights for policy 0, policy_version 59560 (0.0007) [2023-10-08 02:19:44,991][52060] Updated weights for policy 0, policy_version 59570 (0.0008) [2023-10-08 02:19:45,353][52060] Updated weights for policy 0, policy_version 59580 (0.0007) [2023-10-08 02:19:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 122781696. Throughput: 0: 1695.6, 1: 1726.5. Samples: 30703236. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:46,211][50642] Avg episode reward: [(0, '20.960'), (1, '22.710')] [2023-10-08 02:19:47,958][52059] Updated weights for policy 1, policy_version 60322 (0.0009) [2023-10-08 02:19:48,363][52059] Updated weights for policy 1, policy_version 60332 (0.0010) [2023-10-08 02:19:48,731][52059] Updated weights for policy 1, policy_version 60342 (0.0007) [2023-10-08 02:19:49,096][52059] Updated weights for policy 1, policy_version 60352 (0.0008) [2023-10-08 02:19:49,512][52060] Updated weights for policy 0, policy_version 59590 (0.0010) [2023-10-08 02:19:49,887][52060] Updated weights for policy 0, policy_version 59600 (0.0009) [2023-10-08 02:19:50,256][52060] Updated weights for policy 0, policy_version 59610 (0.0010) [2023-10-08 02:19:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 122847232. Throughput: 0: 1720.2, 1: 1720.1. Samples: 30713930. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:51,211][50642] Avg episode reward: [(0, '15.970'), (1, '21.460')] [2023-10-08 02:19:53,073][52059] Updated weights for policy 1, policy_version 60362 (0.0009) [2023-10-08 02:19:53,436][52059] Updated weights for policy 1, policy_version 60372 (0.0008) [2023-10-08 02:19:53,812][52059] Updated weights for policy 1, policy_version 60382 (0.0008) [2023-10-08 02:19:54,333][52060] Updated weights for policy 0, policy_version 59620 (0.0008) [2023-10-08 02:19:54,699][52060] Updated weights for policy 0, policy_version 59630 (0.0009) [2023-10-08 02:19:55,077][52060] Updated weights for policy 0, policy_version 59640 (0.0008) [2023-10-08 02:19:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 122912768. Throughput: 0: 1706.7, 1: 1708.1. Samples: 30734052. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:19:56,211][50642] Avg episode reward: [(0, '18.010'), (1, '20.140')] [2023-10-08 02:19:57,622][52059] Updated weights for policy 1, policy_version 60392 (0.0010) [2023-10-08 02:19:57,984][52059] Updated weights for policy 1, policy_version 60402 (0.0009) [2023-10-08 02:19:58,349][52059] Updated weights for policy 1, policy_version 60412 (0.0008) [2023-10-08 02:19:58,967][52060] Updated weights for policy 0, policy_version 59650 (0.0008) [2023-10-08 02:19:59,332][52060] Updated weights for policy 0, policy_version 59660 (0.0007) [2023-10-08 02:19:59,703][52060] Updated weights for policy 0, policy_version 59670 (0.0007) [2023-10-08 02:20:00,070][52060] Updated weights for policy 0, policy_version 59680 (0.0009) [2023-10-08 02:20:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 122978304. Throughput: 0: 1694.8, 1: 1740.5. Samples: 30754940. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:01,211][50642] Avg episode reward: [(0, '22.310'), (1, '22.000')] [2023-10-08 02:20:02,361][52059] Updated weights for policy 1, policy_version 60422 (0.0009) [2023-10-08 02:20:02,711][52059] Updated weights for policy 1, policy_version 60432 (0.0008) [2023-10-08 02:20:03,080][52059] Updated weights for policy 1, policy_version 60442 (0.0010) [2023-10-08 02:20:03,840][52060] Updated weights for policy 0, policy_version 59690 (0.0009) [2023-10-08 02:20:04,209][52060] Updated weights for policy 0, policy_version 59700 (0.0008) [2023-10-08 02:20:04,577][52060] Updated weights for policy 0, policy_version 59710 (0.0007) [2023-10-08 02:20:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 123043840. Throughput: 0: 1714.8, 1: 1711.1. Samples: 30765194. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:06,211][50642] Avg episode reward: [(0, '16.930'), (1, '24.480')] [2023-10-08 02:20:07,066][52059] Updated weights for policy 1, policy_version 60452 (0.0008) [2023-10-08 02:20:07,426][52059] Updated weights for policy 1, policy_version 60462 (0.0009) [2023-10-08 02:20:07,791][52059] Updated weights for policy 1, policy_version 60472 (0.0007) [2023-10-08 02:20:08,611][52060] Updated weights for policy 0, policy_version 59720 (0.0009) [2023-10-08 02:20:08,981][52060] Updated weights for policy 0, policy_version 59730 (0.0009) [2023-10-08 02:20:09,352][52060] Updated weights for policy 0, policy_version 59740 (0.0007) [2023-10-08 02:20:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 123109376. Throughput: 0: 1692.7, 1: 1725.7. Samples: 30785786. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:11,211][50642] Avg episode reward: [(0, '18.110'), (1, '23.680')] [2023-10-08 02:20:11,594][52059] Updated weights for policy 1, policy_version 60482 (0.0009) [2023-10-08 02:20:11,955][52059] Updated weights for policy 1, policy_version 60492 (0.0007) [2023-10-08 02:20:12,331][52059] Updated weights for policy 1, policy_version 60502 (0.0009) [2023-10-08 02:20:12,696][52059] Updated weights for policy 1, policy_version 60512 (0.0008) [2023-10-08 02:20:13,403][52060] Updated weights for policy 0, policy_version 59750 (0.0011) [2023-10-08 02:20:13,761][52060] Updated weights for policy 0, policy_version 59760 (0.0008) [2023-10-08 02:20:14,146][52060] Updated weights for policy 0, policy_version 59770 (0.0009) [2023-10-08 02:20:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 123174912. Throughput: 0: 1708.7, 1: 1743.6. Samples: 30807034. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:16,211][50642] Avg episode reward: [(0, '21.770'), (1, '21.030')] [2023-10-08 02:20:16,552][52059] Updated weights for policy 1, policy_version 60522 (0.0009) [2023-10-08 02:20:16,913][52059] Updated weights for policy 1, policy_version 60532 (0.0009) [2023-10-08 02:20:17,279][52059] Updated weights for policy 1, policy_version 60542 (0.0008) [2023-10-08 02:20:18,180][52060] Updated weights for policy 0, policy_version 59780 (0.0008) [2023-10-08 02:20:18,555][52060] Updated weights for policy 0, policy_version 59790 (0.0008) [2023-10-08 02:20:18,921][52060] Updated weights for policy 0, policy_version 59800 (0.0009) [2023-10-08 02:20:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 123240448. Throughput: 0: 1703.1, 1: 1719.6. Samples: 30816820. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:21,211][50642] Avg episode reward: [(0, '17.480'), (1, '21.700')] [2023-10-08 02:20:21,309][52059] Updated weights for policy 1, policy_version 60552 (0.0010) [2023-10-08 02:20:21,675][52059] Updated weights for policy 1, policy_version 60562 (0.0009) [2023-10-08 02:20:22,041][52059] Updated weights for policy 1, policy_version 60572 (0.0007) [2023-10-08 02:20:22,912][52060] Updated weights for policy 0, policy_version 59810 (0.0008) [2023-10-08 02:20:23,281][52060] Updated weights for policy 0, policy_version 59820 (0.0009) [2023-10-08 02:20:23,650][52060] Updated weights for policy 0, policy_version 59830 (0.0008) [2023-10-08 02:20:24,016][52060] Updated weights for policy 0, policy_version 59840 (0.0009) [2023-10-08 02:20:25,865][52059] Updated weights for policy 1, policy_version 60582 (0.0008) [2023-10-08 02:20:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 123305984. Throughput: 0: 1698.3, 1: 1749.5. Samples: 30837954. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:26,211][50642] Avg episode reward: [(0, '18.640'), (1, '22.440')] [2023-10-08 02:20:26,232][52059] Updated weights for policy 1, policy_version 60592 (0.0007) [2023-10-08 02:20:26,595][52059] Updated weights for policy 1, policy_version 60602 (0.0010) [2023-10-08 02:20:28,105][52060] Updated weights for policy 0, policy_version 59850 (0.0009) [2023-10-08 02:20:28,473][52060] Updated weights for policy 0, policy_version 59860 (0.0009) [2023-10-08 02:20:28,830][52060] Updated weights for policy 0, policy_version 59870 (0.0007) [2023-10-08 02:20:30,549][52059] Updated weights for policy 1, policy_version 60612 (0.0010) [2023-10-08 02:20:30,919][52059] Updated weights for policy 1, policy_version 60622 (0.0009) [2023-10-08 02:20:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 123371520. Throughput: 0: 1720.5, 1: 1737.9. Samples: 30858862. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:31,211][50642] Avg episode reward: [(0, '20.480'), (1, '20.710')] [2023-10-08 02:20:31,280][52059] Updated weights for policy 1, policy_version 60632 (0.0008) [2023-10-08 02:20:32,757][52060] Updated weights for policy 0, policy_version 59880 (0.0008) [2023-10-08 02:20:33,118][52060] Updated weights for policy 0, policy_version 59890 (0.0007) [2023-10-08 02:20:33,485][52060] Updated weights for policy 0, policy_version 59900 (0.0007) [2023-10-08 02:20:35,128][52059] Updated weights for policy 1, policy_version 60642 (0.0009) [2023-10-08 02:20:35,505][52059] Updated weights for policy 1, policy_version 60652 (0.0008) [2023-10-08 02:20:35,870][52059] Updated weights for policy 1, policy_version 60662 (0.0007) [2023-10-08 02:20:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 123437056. Throughput: 0: 1691.5, 1: 1750.3. Samples: 30868810. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:20:36,211][50642] Avg episode reward: [(0, '21.230'), (1, '19.980')] [2023-10-08 02:20:36,240][52059] Updated weights for policy 1, policy_version 60672 (0.0009) [2023-10-08 02:20:37,320][52060] Updated weights for policy 0, policy_version 59910 (0.0008) [2023-10-08 02:20:37,692][52060] Updated weights for policy 0, policy_version 59920 (0.0007) [2023-10-08 02:20:38,065][52060] Updated weights for policy 0, policy_version 59930 (0.0008) [2023-10-08 02:20:40,119][52059] Updated weights for policy 1, policy_version 60682 (0.0007) [2023-10-08 02:20:40,495][52059] Updated weights for policy 1, policy_version 60692 (0.0007) [2023-10-08 02:20:40,855][52059] Updated weights for policy 1, policy_version 60702 (0.0008) [2023-10-08 02:20:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 123535360. Throughput: 0: 1708.6, 1: 1755.3. Samples: 30889928. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:20:41,211][50642] Avg episode reward: [(0, '17.920'), (1, '22.480')] [2023-10-08 02:20:42,164][52060] Updated weights for policy 0, policy_version 59940 (0.0008) [2023-10-08 02:20:42,547][52060] Updated weights for policy 0, policy_version 59950 (0.0010) [2023-10-08 02:20:42,920][52060] Updated weights for policy 0, policy_version 59960 (0.0008) [2023-10-08 02:20:44,718][52059] Updated weights for policy 1, policy_version 60712 (0.0007) [2023-10-08 02:20:45,091][52059] Updated weights for policy 1, policy_version 60722 (0.0010) [2023-10-08 02:20:45,451][52059] Updated weights for policy 1, policy_version 60732 (0.0009) [2023-10-08 02:20:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 123600896. Throughput: 0: 1721.2, 1: 1727.8. Samples: 30910146. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:20:46,211][50642] Avg episode reward: [(0, '19.210'), (1, '23.700')] [2023-10-08 02:20:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000059968_61407232.pth... [2023-10-08 02:20:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000060736_62193664.pth... [2023-10-08 02:20:46,257][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000059104_60522496.pth [2023-10-08 02:20:46,260][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000058368_59768832.pth [2023-10-08 02:20:46,860][52060] Updated weights for policy 0, policy_version 59970 (0.0008) [2023-10-08 02:20:47,222][52060] Updated weights for policy 0, policy_version 59980 (0.0007) [2023-10-08 02:20:47,586][52060] Updated weights for policy 0, policy_version 59990 (0.0007) [2023-10-08 02:20:47,960][52060] Updated weights for policy 0, policy_version 60000 (0.0008) [2023-10-08 02:20:49,472][52059] Updated weights for policy 1, policy_version 60742 (0.0008) [2023-10-08 02:20:49,835][52059] Updated weights for policy 1, policy_version 60752 (0.0011) [2023-10-08 02:20:50,204][52059] Updated weights for policy 1, policy_version 60762 (0.0011) [2023-10-08 02:20:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 123666432. Throughput: 0: 1699.2, 1: 1760.1. Samples: 30920860. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:20:51,211][50642] Avg episode reward: [(0, '20.410'), (1, '18.640')] [2023-10-08 02:20:51,873][52060] Updated weights for policy 0, policy_version 60010 (0.0009) [2023-10-08 02:20:52,248][52060] Updated weights for policy 0, policy_version 60020 (0.0007) [2023-10-08 02:20:52,608][52060] Updated weights for policy 0, policy_version 60030 (0.0008) [2023-10-08 02:20:54,227][52059] Updated weights for policy 1, policy_version 60772 (0.0011) [2023-10-08 02:20:54,600][52059] Updated weights for policy 1, policy_version 60782 (0.0008) [2023-10-08 02:20:54,968][52059] Updated weights for policy 1, policy_version 60792 (0.0007) [2023-10-08 02:20:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 123731968. Throughput: 0: 1722.3, 1: 1737.4. Samples: 30941474. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:20:56,211][50642] Avg episode reward: [(0, '17.860'), (1, '21.060')] [2023-10-08 02:20:56,645][52060] Updated weights for policy 0, policy_version 60040 (0.0010) [2023-10-08 02:20:57,003][52060] Updated weights for policy 0, policy_version 60050 (0.0008) [2023-10-08 02:20:57,375][52060] Updated weights for policy 0, policy_version 60060 (0.0007) [2023-10-08 02:20:58,627][52059] Updated weights for policy 1, policy_version 60802 (0.0007) [2023-10-08 02:20:58,985][52059] Updated weights for policy 1, policy_version 60812 (0.0009) [2023-10-08 02:20:59,352][52059] Updated weights for policy 1, policy_version 60822 (0.0008) [2023-10-08 02:20:59,717][52059] Updated weights for policy 1, policy_version 60832 (0.0007) [2023-10-08 02:21:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 123797504. Throughput: 0: 1727.2, 1: 1728.1. Samples: 30962524. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:21:01,211][50642] Avg episode reward: [(0, '20.760'), (1, '22.590')] [2023-10-08 02:21:01,290][52060] Updated weights for policy 0, policy_version 60070 (0.0008) [2023-10-08 02:21:01,653][52060] Updated weights for policy 0, policy_version 60080 (0.0009) [2023-10-08 02:21:02,027][52060] Updated weights for policy 0, policy_version 60090 (0.0011) [2023-10-08 02:21:03,626][52059] Updated weights for policy 1, policy_version 60842 (0.0007) [2023-10-08 02:21:03,985][52059] Updated weights for policy 1, policy_version 60852 (0.0007) [2023-10-08 02:21:04,344][52059] Updated weights for policy 1, policy_version 60862 (0.0008) [2023-10-08 02:21:05,891][52060] Updated weights for policy 0, policy_version 60100 (0.0008) [2023-10-08 02:21:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 123863040. Throughput: 0: 1718.9, 1: 1745.1. Samples: 30972704. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:21:06,211][50642] Avg episode reward: [(0, '22.290'), (1, '20.990')] [2023-10-08 02:21:06,259][52060] Updated weights for policy 0, policy_version 60110 (0.0009) [2023-10-08 02:21:06,633][52060] Updated weights for policy 0, policy_version 60120 (0.0008) [2023-10-08 02:21:08,082][52059] Updated weights for policy 1, policy_version 60872 (0.0007) [2023-10-08 02:21:08,450][52059] Updated weights for policy 1, policy_version 60882 (0.0007) [2023-10-08 02:21:08,812][52059] Updated weights for policy 1, policy_version 60892 (0.0007) [2023-10-08 02:21:10,662][52060] Updated weights for policy 0, policy_version 60130 (0.0009) [2023-10-08 02:21:11,034][52060] Updated weights for policy 0, policy_version 60140 (0.0008) [2023-10-08 02:21:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 123928576. Throughput: 0: 1726.7, 1: 1734.6. Samples: 30993710. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:21:11,211][50642] Avg episode reward: [(0, '19.470'), (1, '20.580')] [2023-10-08 02:21:11,393][52060] Updated weights for policy 0, policy_version 60150 (0.0007) [2023-10-08 02:21:11,762][52060] Updated weights for policy 0, policy_version 60160 (0.0008) [2023-10-08 02:21:12,677][52059] Updated weights for policy 1, policy_version 60902 (0.0008) [2023-10-08 02:21:13,035][52059] Updated weights for policy 1, policy_version 60912 (0.0007) [2023-10-08 02:21:13,402][52059] Updated weights for policy 1, policy_version 60922 (0.0008) [2023-10-08 02:21:15,627][52060] Updated weights for policy 0, policy_version 60170 (0.0009) [2023-10-08 02:21:15,999][52060] Updated weights for policy 0, policy_version 60180 (0.0009) [2023-10-08 02:21:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 123994112. Throughput: 0: 1714.8, 1: 1745.2. Samples: 31014558. Policy #0 lag: (min: 31.0, avg: 37.6, max: 63.0) [2023-10-08 02:21:16,211][50642] Avg episode reward: [(0, '18.420'), (1, '22.900')] [2023-10-08 02:21:16,381][52060] Updated weights for policy 0, policy_version 60190 (0.0010) [2023-10-08 02:21:17,370][52059] Updated weights for policy 1, policy_version 60932 (0.0009) [2023-10-08 02:21:17,730][52059] Updated weights for policy 1, policy_version 60942 (0.0008) [2023-10-08 02:21:18,097][52059] Updated weights for policy 1, policy_version 60952 (0.0008) [2023-10-08 02:21:20,357][52060] Updated weights for policy 0, policy_version 60200 (0.0009) [2023-10-08 02:21:20,725][52060] Updated weights for policy 0, policy_version 60210 (0.0008) [2023-10-08 02:21:21,094][52060] Updated weights for policy 0, policy_version 60220 (0.0010) [2023-10-08 02:21:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 124059648. Throughput: 0: 1732.8, 1: 1733.3. Samples: 31024782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:21,211][50642] Avg episode reward: [(0, '20.410'), (1, '22.030')] [2023-10-08 02:21:21,944][52059] Updated weights for policy 1, policy_version 60962 (0.0009) [2023-10-08 02:21:22,320][52059] Updated weights for policy 1, policy_version 60972 (0.0011) [2023-10-08 02:21:22,682][52059] Updated weights for policy 1, policy_version 60982 (0.0008) [2023-10-08 02:21:23,041][52059] Updated weights for policy 1, policy_version 60992 (0.0007) [2023-10-08 02:21:25,056][52060] Updated weights for policy 0, policy_version 60230 (0.0009) [2023-10-08 02:21:25,425][52060] Updated weights for policy 0, policy_version 60240 (0.0010) [2023-10-08 02:21:25,792][52060] Updated weights for policy 0, policy_version 60250 (0.0011) [2023-10-08 02:21:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 124157952. Throughput: 0: 1733.3, 1: 1737.2. Samples: 31046100. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:26,211][50642] Avg episode reward: [(0, '18.930'), (1, '20.740')] [2023-10-08 02:21:27,139][52059] Updated weights for policy 1, policy_version 61002 (0.0008) [2023-10-08 02:21:27,507][52059] Updated weights for policy 1, policy_version 61012 (0.0010) [2023-10-08 02:21:27,878][52059] Updated weights for policy 1, policy_version 61022 (0.0011) [2023-10-08 02:21:29,772][52060] Updated weights for policy 0, policy_version 60260 (0.0009) [2023-10-08 02:21:30,161][52060] Updated weights for policy 0, policy_version 60270 (0.0007) [2023-10-08 02:21:30,527][52060] Updated weights for policy 0, policy_version 60280 (0.0009) [2023-10-08 02:21:31,211][50642] Fps is (10 sec: 16383.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 124223488. Throughput: 0: 1703.3, 1: 1764.6. Samples: 31066202. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:31,212][50642] Avg episode reward: [(0, '17.660'), (1, '19.610')] [2023-10-08 02:21:31,667][52059] Updated weights for policy 1, policy_version 61032 (0.0009) [2023-10-08 02:21:32,027][52059] Updated weights for policy 1, policy_version 61042 (0.0008) [2023-10-08 02:21:32,393][52059] Updated weights for policy 1, policy_version 61052 (0.0007) [2023-10-08 02:21:34,401][52060] Updated weights for policy 0, policy_version 60290 (0.0010) [2023-10-08 02:21:34,762][52060] Updated weights for policy 0, policy_version 60300 (0.0009) [2023-10-08 02:21:35,134][52060] Updated weights for policy 0, policy_version 60310 (0.0009) [2023-10-08 02:21:35,497][52060] Updated weights for policy 0, policy_version 60320 (0.0009) [2023-10-08 02:21:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 124289024. Throughput: 0: 1737.1, 1: 1732.9. Samples: 31077014. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:36,211][50642] Avg episode reward: [(0, '19.620'), (1, '22.330')] [2023-10-08 02:21:36,234][52059] Updated weights for policy 1, policy_version 61062 (0.0007) [2023-10-08 02:21:36,600][52059] Updated weights for policy 1, policy_version 61072 (0.0007) [2023-10-08 02:21:36,954][52059] Updated weights for policy 1, policy_version 61082 (0.0009) [2023-10-08 02:21:39,426][52060] Updated weights for policy 0, policy_version 60330 (0.0008) [2023-10-08 02:21:39,786][52060] Updated weights for policy 0, policy_version 60340 (0.0009) [2023-10-08 02:21:40,156][52060] Updated weights for policy 0, policy_version 60350 (0.0007) [2023-10-08 02:21:40,832][52059] Updated weights for policy 1, policy_version 61092 (0.0009) [2023-10-08 02:21:41,193][52059] Updated weights for policy 1, policy_version 61102 (0.0008) [2023-10-08 02:21:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 124354560. Throughput: 0: 1716.2, 1: 1755.5. Samples: 31097698. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:41,211][50642] Avg episode reward: [(0, '17.740'), (1, '21.950')] [2023-10-08 02:21:41,566][52059] Updated weights for policy 1, policy_version 61112 (0.0007) [2023-10-08 02:21:44,186][52060] Updated weights for policy 0, policy_version 60360 (0.0008) [2023-10-08 02:21:44,555][52060] Updated weights for policy 0, policy_version 60370 (0.0008) [2023-10-08 02:21:44,927][52060] Updated weights for policy 0, policy_version 60380 (0.0008) [2023-10-08 02:21:45,416][52059] Updated weights for policy 1, policy_version 61122 (0.0008) [2023-10-08 02:21:45,770][52059] Updated weights for policy 1, policy_version 61132 (0.0007) [2023-10-08 02:21:46,146][52059] Updated weights for policy 1, policy_version 61142 (0.0007) [2023-10-08 02:21:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 124420096. Throughput: 0: 1704.0, 1: 1752.6. Samples: 31118070. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:46,211][50642] Avg episode reward: [(0, '18.080'), (1, '21.650')] [2023-10-08 02:21:46,507][52059] Updated weights for policy 1, policy_version 61152 (0.0007) [2023-10-08 02:21:48,959][52060] Updated weights for policy 0, policy_version 60390 (0.0009) [2023-10-08 02:21:49,328][52060] Updated weights for policy 0, policy_version 60400 (0.0010) [2023-10-08 02:21:49,694][52060] Updated weights for policy 0, policy_version 60410 (0.0009) [2023-10-08 02:21:50,405][52059] Updated weights for policy 1, policy_version 61162 (0.0008) [2023-10-08 02:21:50,771][52059] Updated weights for policy 1, policy_version 61172 (0.0009) [2023-10-08 02:21:51,127][52059] Updated weights for policy 1, policy_version 61182 (0.0008) [2023-10-08 02:21:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 124518400. Throughput: 0: 1732.0, 1: 1750.4. Samples: 31129408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:21:51,211][50642] Avg episode reward: [(0, '19.300'), (1, '22.700')] [2023-10-08 02:21:53,708][52060] Updated weights for policy 0, policy_version 60420 (0.0007) [2023-10-08 02:21:54,080][52060] Updated weights for policy 0, policy_version 60430 (0.0010) [2023-10-08 02:21:54,444][52060] Updated weights for policy 0, policy_version 60440 (0.0008) [2023-10-08 02:21:55,071][52059] Updated weights for policy 1, policy_version 61192 (0.0008) [2023-10-08 02:21:55,437][52059] Updated weights for policy 1, policy_version 61202 (0.0009) [2023-10-08 02:21:55,812][52059] Updated weights for policy 1, policy_version 61212 (0.0010) [2023-10-08 02:21:56,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 124583936. Throughput: 0: 1706.6, 1: 1755.9. Samples: 31149520. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:21:56,211][50642] Avg episode reward: [(0, '22.260'), (1, '22.630')] [2023-10-08 02:21:58,320][52060] Updated weights for policy 0, policy_version 60450 (0.0008) [2023-10-08 02:21:58,689][52060] Updated weights for policy 0, policy_version 60460 (0.0007) [2023-10-08 02:21:59,058][52060] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-10-08 02:21:59,420][52060] Updated weights for policy 0, policy_version 60480 (0.0008) [2023-10-08 02:21:59,631][52059] Updated weights for policy 1, policy_version 61222 (0.0008) [2023-10-08 02:21:59,998][52059] Updated weights for policy 1, policy_version 61232 (0.0008) [2023-10-08 02:22:00,362][52059] Updated weights for policy 1, policy_version 61242 (0.0008) [2023-10-08 02:22:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 124649472. Throughput: 0: 1717.3, 1: 1731.6. Samples: 31169756. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:01,211][50642] Avg episode reward: [(0, '19.450'), (1, '21.960')] [2023-10-08 02:22:03,445][52060] Updated weights for policy 0, policy_version 60490 (0.0008) [2023-10-08 02:22:03,802][52060] Updated weights for policy 0, policy_version 60500 (0.0007) [2023-10-08 02:22:04,169][52060] Updated weights for policy 0, policy_version 60510 (0.0007) [2023-10-08 02:22:04,236][52059] Updated weights for policy 1, policy_version 61252 (0.0008) [2023-10-08 02:22:04,597][52059] Updated weights for policy 1, policy_version 61262 (0.0008) [2023-10-08 02:22:04,957][52059] Updated weights for policy 1, policy_version 61272 (0.0008) [2023-10-08 02:22:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 124715008. Throughput: 0: 1709.0, 1: 1764.9. Samples: 31181108. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:06,211][50642] Avg episode reward: [(0, '20.560'), (1, '22.840')] [2023-10-08 02:22:08,009][52060] Updated weights for policy 0, policy_version 60520 (0.0007) [2023-10-08 02:22:08,383][52060] Updated weights for policy 0, policy_version 60530 (0.0009) [2023-10-08 02:22:08,747][52060] Updated weights for policy 0, policy_version 60540 (0.0008) [2023-10-08 02:22:09,059][52059] Updated weights for policy 1, policy_version 61282 (0.0009) [2023-10-08 02:22:09,433][52059] Updated weights for policy 1, policy_version 61292 (0.0009) [2023-10-08 02:22:09,795][52059] Updated weights for policy 1, policy_version 61302 (0.0007) [2023-10-08 02:22:10,160][52059] Updated weights for policy 1, policy_version 61312 (0.0008) [2023-10-08 02:22:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 124780544. Throughput: 0: 1704.7, 1: 1740.1. Samples: 31201114. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:11,211][50642] Avg episode reward: [(0, '21.850'), (1, '22.810')] [2023-10-08 02:22:12,565][52060] Updated weights for policy 0, policy_version 60550 (0.0010) [2023-10-08 02:22:12,927][52060] Updated weights for policy 0, policy_version 60560 (0.0008) [2023-10-08 02:22:13,294][52060] Updated weights for policy 0, policy_version 60570 (0.0008) [2023-10-08 02:22:14,226][52059] Updated weights for policy 1, policy_version 61322 (0.0007) [2023-10-08 02:22:14,588][52059] Updated weights for policy 1, policy_version 61332 (0.0009) [2023-10-08 02:22:14,962][52059] Updated weights for policy 1, policy_version 61342 (0.0008) [2023-10-08 02:22:16,211][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 124846080. Throughput: 0: 1730.2, 1: 1725.9. Samples: 31221726. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:16,212][50642] Avg episode reward: [(0, '20.140'), (1, '21.990')] [2023-10-08 02:22:17,372][52060] Updated weights for policy 0, policy_version 60580 (0.0007) [2023-10-08 02:22:17,760][52060] Updated weights for policy 0, policy_version 60590 (0.0007) [2023-10-08 02:22:18,122][52060] Updated weights for policy 0, policy_version 60600 (0.0008) [2023-10-08 02:22:18,943][52059] Updated weights for policy 1, policy_version 61352 (0.0008) [2023-10-08 02:22:19,303][52059] Updated weights for policy 1, policy_version 61362 (0.0010) [2023-10-08 02:22:19,670][52059] Updated weights for policy 1, policy_version 61372 (0.0009) [2023-10-08 02:22:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 124911616. Throughput: 0: 1695.0, 1: 1749.9. Samples: 31232036. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:21,211][50642] Avg episode reward: [(0, '20.100'), (1, '21.870')] [2023-10-08 02:22:22,197][52060] Updated weights for policy 0, policy_version 60610 (0.0009) [2023-10-08 02:22:22,562][52060] Updated weights for policy 0, policy_version 60620 (0.0008) [2023-10-08 02:22:22,936][52060] Updated weights for policy 0, policy_version 60630 (0.0009) [2023-10-08 02:22:23,309][52060] Updated weights for policy 0, policy_version 60640 (0.0008) [2023-10-08 02:22:23,618][52059] Updated weights for policy 1, policy_version 61382 (0.0007) [2023-10-08 02:22:23,981][52059] Updated weights for policy 1, policy_version 61392 (0.0008) [2023-10-08 02:22:24,358][52059] Updated weights for policy 1, policy_version 61402 (0.0009) [2023-10-08 02:22:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 124977152. Throughput: 0: 1712.2, 1: 1723.1. Samples: 31252286. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:26,211][50642] Avg episode reward: [(0, '20.160'), (1, '25.620')] [2023-10-08 02:22:27,361][52060] Updated weights for policy 0, policy_version 60650 (0.0008) [2023-10-08 02:22:27,726][52060] Updated weights for policy 0, policy_version 60660 (0.0009) [2023-10-08 02:22:28,103][52060] Updated weights for policy 0, policy_version 60670 (0.0009) [2023-10-08 02:22:28,236][52059] Updated weights for policy 1, policy_version 61412 (0.0008) [2023-10-08 02:22:28,599][52059] Updated weights for policy 1, policy_version 61422 (0.0008) [2023-10-08 02:22:28,959][52059] Updated weights for policy 1, policy_version 61432 (0.0009) [2023-10-08 02:22:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 125042688. Throughput: 0: 1723.7, 1: 1738.4. Samples: 31273862. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:31,211][50642] Avg episode reward: [(0, '20.100'), (1, '22.510')] [2023-10-08 02:22:32,122][52060] Updated weights for policy 0, policy_version 60680 (0.0008) [2023-10-08 02:22:32,494][52060] Updated weights for policy 0, policy_version 60690 (0.0007) [2023-10-08 02:22:32,718][52059] Updated weights for policy 1, policy_version 61442 (0.0007) [2023-10-08 02:22:32,859][52060] Updated weights for policy 0, policy_version 60700 (0.0007) [2023-10-08 02:22:33,071][52059] Updated weights for policy 1, policy_version 61452 (0.0010) [2023-10-08 02:22:33,439][52059] Updated weights for policy 1, policy_version 61462 (0.0009) [2023-10-08 02:22:33,811][52059] Updated weights for policy 1, policy_version 61472 (0.0009) [2023-10-08 02:22:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 125108224. Throughput: 0: 1693.2, 1: 1725.2. Samples: 31283236. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 02:22:36,211][50642] Avg episode reward: [(0, '20.230'), (1, '20.910')] [2023-10-08 02:22:36,752][52060] Updated weights for policy 0, policy_version 60710 (0.0007) [2023-10-08 02:22:37,117][52060] Updated weights for policy 0, policy_version 60720 (0.0007) [2023-10-08 02:22:37,487][52060] Updated weights for policy 0, policy_version 60730 (0.0007) [2023-10-08 02:22:37,762][52059] Updated weights for policy 1, policy_version 61482 (0.0009) [2023-10-08 02:22:38,133][52059] Updated weights for policy 1, policy_version 61492 (0.0008) [2023-10-08 02:22:38,500][52059] Updated weights for policy 1, policy_version 61502 (0.0009) [2023-10-08 02:22:41,211][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 125173760. Throughput: 0: 1722.8, 1: 1723.5. Samples: 31304604. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:22:41,212][50642] Avg episode reward: [(0, '20.690'), (1, '22.660')] [2023-10-08 02:22:41,430][52060] Updated weights for policy 0, policy_version 60740 (0.0008) [2023-10-08 02:22:41,794][52060] Updated weights for policy 0, policy_version 60750 (0.0008) [2023-10-08 02:22:42,172][52060] Updated weights for policy 0, policy_version 60760 (0.0010) [2023-10-08 02:22:42,581][52059] Updated weights for policy 1, policy_version 61512 (0.0010) [2023-10-08 02:22:42,942][52059] Updated weights for policy 1, policy_version 61522 (0.0008) [2023-10-08 02:22:43,313][52059] Updated weights for policy 1, policy_version 61532 (0.0008) [2023-10-08 02:22:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 125239296. Throughput: 0: 1721.7, 1: 1749.1. Samples: 31325942. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:22:46,211][50642] Avg episode reward: [(0, '19.110'), (1, '20.490')] [2023-10-08 02:22:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000061536_63012864.pth... [2023-10-08 02:22:46,251][52060] Updated weights for policy 0, policy_version 60770 (0.0009) [2023-10-08 02:22:46,255][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000059904_61341696.pth [2023-10-08 02:22:46,625][52060] Updated weights for policy 0, policy_version 60780 (0.0008) [2023-10-08 02:22:46,997][52060] Updated weights for policy 0, policy_version 60790 (0.0009) [2023-10-08 02:22:47,132][52059] Updated weights for policy 1, policy_version 61542 (0.0007) [2023-10-08 02:22:47,359][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000060800_62259200.pth... [2023-10-08 02:22:47,363][52060] Updated weights for policy 0, policy_version 60800 (0.0009) [2023-10-08 02:22:47,396][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000059168_60588032.pth [2023-10-08 02:22:47,500][52059] Updated weights for policy 1, policy_version 61552 (0.0010) [2023-10-08 02:22:47,873][52059] Updated weights for policy 1, policy_version 61562 (0.0010) [2023-10-08 02:22:51,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 125304832. Throughput: 0: 1710.5, 1: 1711.5. Samples: 31335096. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:22:51,211][50642] Avg episode reward: [(0, '20.940'), (1, '21.310')] [2023-10-08 02:22:51,436][52060] Updated weights for policy 0, policy_version 60810 (0.0007) [2023-10-08 02:22:51,797][52060] Updated weights for policy 0, policy_version 60820 (0.0007) [2023-10-08 02:22:51,872][52059] Updated weights for policy 1, policy_version 61572 (0.0009) [2023-10-08 02:22:52,171][52060] Updated weights for policy 0, policy_version 60830 (0.0008) [2023-10-08 02:22:52,231][52059] Updated weights for policy 1, policy_version 61582 (0.0008) [2023-10-08 02:22:52,604][52059] Updated weights for policy 1, policy_version 61592 (0.0009) [2023-10-08 02:22:56,021][52060] Updated weights for policy 0, policy_version 60840 (0.0007) [2023-10-08 02:22:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 125370368. Throughput: 0: 1718.1, 1: 1735.7. Samples: 31356538. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:22:56,211][50642] Avg episode reward: [(0, '22.400'), (1, '19.210')] [2023-10-08 02:22:56,385][52060] Updated weights for policy 0, policy_version 60850 (0.0009) [2023-10-08 02:22:56,470][52059] Updated weights for policy 1, policy_version 61602 (0.0009) [2023-10-08 02:22:56,753][52060] Updated weights for policy 0, policy_version 60860 (0.0008) [2023-10-08 02:22:56,842][52059] Updated weights for policy 1, policy_version 61612 (0.0010) [2023-10-08 02:22:57,201][52059] Updated weights for policy 1, policy_version 61622 (0.0010) [2023-10-08 02:22:57,568][52059] Updated weights for policy 1, policy_version 61632 (0.0008) [2023-10-08 02:23:00,553][52060] Updated weights for policy 0, policy_version 60870 (0.0008) [2023-10-08 02:23:00,923][52060] Updated weights for policy 0, policy_version 60880 (0.0008) [2023-10-08 02:23:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 125435904. Throughput: 0: 1707.9, 1: 1748.9. Samples: 31377280. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:23:01,211][50642] Avg episode reward: [(0, '20.130'), (1, '20.390')] [2023-10-08 02:23:01,290][52060] Updated weights for policy 0, policy_version 60890 (0.0008) [2023-10-08 02:23:01,581][52059] Updated weights for policy 1, policy_version 61642 (0.0007) [2023-10-08 02:23:01,945][52059] Updated weights for policy 1, policy_version 61652 (0.0007) [2023-10-08 02:23:02,310][52059] Updated weights for policy 1, policy_version 61662 (0.0009) [2023-10-08 02:23:05,523][52060] Updated weights for policy 0, policy_version 60900 (0.0009) [2023-10-08 02:23:05,908][52060] Updated weights for policy 0, policy_version 60910 (0.0009) [2023-10-08 02:23:06,148][52059] Updated weights for policy 1, policy_version 61672 (0.0008) [2023-10-08 02:23:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 125501440. Throughput: 0: 1721.1, 1: 1725.0. Samples: 31387110. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:23:06,211][50642] Avg episode reward: [(0, '19.800'), (1, '20.750')] [2023-10-08 02:23:06,278][52060] Updated weights for policy 0, policy_version 60920 (0.0007) [2023-10-08 02:23:06,506][52059] Updated weights for policy 1, policy_version 61682 (0.0008) [2023-10-08 02:23:06,866][52059] Updated weights for policy 1, policy_version 61692 (0.0008) [2023-10-08 02:23:10,270][52060] Updated weights for policy 0, policy_version 60930 (0.0007) [2023-10-08 02:23:10,643][52060] Updated weights for policy 0, policy_version 60940 (0.0008) [2023-10-08 02:23:10,719][52059] Updated weights for policy 1, policy_version 61702 (0.0010) [2023-10-08 02:23:11,017][52060] Updated weights for policy 0, policy_version 60950 (0.0010) [2023-10-08 02:23:11,082][52059] Updated weights for policy 1, policy_version 61712 (0.0009) [2023-10-08 02:23:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 125566976. Throughput: 0: 1715.9, 1: 1750.8. Samples: 31408286. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:23:11,211][50642] Avg episode reward: [(0, '21.150'), (1, '21.700')] [2023-10-08 02:23:11,376][52060] Updated weights for policy 0, policy_version 60960 (0.0009) [2023-10-08 02:23:11,446][52059] Updated weights for policy 1, policy_version 61722 (0.0007) [2023-10-08 02:23:15,392][52059] Updated weights for policy 1, policy_version 61732 (0.0007) [2023-10-08 02:23:15,487][52060] Updated weights for policy 0, policy_version 60970 (0.0009) [2023-10-08 02:23:15,751][52059] Updated weights for policy 1, policy_version 61742 (0.0008) [2023-10-08 02:23:15,847][52060] Updated weights for policy 0, policy_version 60980 (0.0009) [2023-10-08 02:23:16,100][52059] Updated weights for policy 1, policy_version 61752 (0.0007) [2023-10-08 02:23:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 125632512. Throughput: 0: 1695.3, 1: 1731.5. Samples: 31428068. Policy #0 lag: (min: 21.0, avg: 31.0, max: 53.0) [2023-10-08 02:23:16,211][50642] Avg episode reward: [(0, '20.400'), (1, '18.650')] [2023-10-08 02:23:16,219][52060] Updated weights for policy 0, policy_version 60990 (0.0009) [2023-10-08 02:23:20,134][52059] Updated weights for policy 1, policy_version 61762 (0.0009) [2023-10-08 02:23:20,245][52060] Updated weights for policy 0, policy_version 61000 (0.0008) [2023-10-08 02:23:20,503][52059] Updated weights for policy 1, policy_version 61772 (0.0009) [2023-10-08 02:23:20,618][52060] Updated weights for policy 0, policy_version 61010 (0.0008) [2023-10-08 02:23:20,855][52059] Updated weights for policy 1, policy_version 61782 (0.0009) [2023-10-08 02:23:20,980][52060] Updated weights for policy 0, policy_version 61020 (0.0007) [2023-10-08 02:23:21,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 125730816. Throughput: 0: 1714.2, 1: 1740.0. Samples: 31438676. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:21,211][50642] Avg episode reward: [(0, '21.240'), (1, '20.250')] [2023-10-08 02:23:21,219][52059] Updated weights for policy 1, policy_version 61792 (0.0008) [2023-10-08 02:23:24,934][52060] Updated weights for policy 0, policy_version 61030 (0.0008) [2023-10-08 02:23:25,156][52059] Updated weights for policy 1, policy_version 61802 (0.0010) [2023-10-08 02:23:25,299][52060] Updated weights for policy 0, policy_version 61040 (0.0007) [2023-10-08 02:23:25,517][52059] Updated weights for policy 1, policy_version 61812 (0.0008) [2023-10-08 02:23:25,663][52060] Updated weights for policy 0, policy_version 61050 (0.0009) [2023-10-08 02:23:25,875][52059] Updated weights for policy 1, policy_version 61822 (0.0009) [2023-10-08 02:23:26,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 125829120. Throughput: 0: 1705.3, 1: 1740.3. Samples: 31459652. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:26,211][50642] Avg episode reward: [(0, '20.150'), (1, '20.940')] [2023-10-08 02:23:29,653][52060] Updated weights for policy 0, policy_version 61060 (0.0009) [2023-10-08 02:23:29,787][52059] Updated weights for policy 1, policy_version 61832 (0.0010) [2023-10-08 02:23:30,012][52060] Updated weights for policy 0, policy_version 61070 (0.0008) [2023-10-08 02:23:30,157][52059] Updated weights for policy 1, policy_version 61842 (0.0009) [2023-10-08 02:23:30,385][52060] Updated weights for policy 0, policy_version 61080 (0.0009) [2023-10-08 02:23:30,524][52059] Updated weights for policy 1, policy_version 61852 (0.0008) [2023-10-08 02:23:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 125894656. Throughput: 0: 1677.9, 1: 1711.0. Samples: 31478442. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:31,211][50642] Avg episode reward: [(0, '20.050'), (1, '21.660')] [2023-10-08 02:23:34,339][52059] Updated weights for policy 1, policy_version 61862 (0.0007) [2023-10-08 02:23:34,466][52060] Updated weights for policy 0, policy_version 61090 (0.0008) [2023-10-08 02:23:34,718][52059] Updated weights for policy 1, policy_version 61872 (0.0009) [2023-10-08 02:23:34,836][52060] Updated weights for policy 0, policy_version 61100 (0.0008) [2023-10-08 02:23:35,087][52059] Updated weights for policy 1, policy_version 61882 (0.0008) [2023-10-08 02:23:35,211][52060] Updated weights for policy 0, policy_version 61110 (0.0007) [2023-10-08 02:23:35,572][52060] Updated weights for policy 0, policy_version 61120 (0.0008) [2023-10-08 02:23:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 125960192. Throughput: 0: 1708.3, 1: 1746.4. Samples: 31490560. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:36,211][50642] Avg episode reward: [(0, '19.780'), (1, '22.690')] [2023-10-08 02:23:38,990][52059] Updated weights for policy 1, policy_version 61892 (0.0010) [2023-10-08 02:23:39,349][52059] Updated weights for policy 1, policy_version 61902 (0.0008) [2023-10-08 02:23:39,468][52060] Updated weights for policy 0, policy_version 61130 (0.0008) [2023-10-08 02:23:39,714][52059] Updated weights for policy 1, policy_version 61912 (0.0009) [2023-10-08 02:23:39,832][52060] Updated weights for policy 0, policy_version 61140 (0.0007) [2023-10-08 02:23:40,209][52060] Updated weights for policy 0, policy_version 61150 (0.0008) [2023-10-08 02:23:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 126025728. Throughput: 0: 1689.3, 1: 1719.5. Samples: 31509932. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:41,211][50642] Avg episode reward: [(0, '19.700'), (1, '20.920')] [2023-10-08 02:23:43,838][52059] Updated weights for policy 1, policy_version 61922 (0.0010) [2023-10-08 02:23:44,208][52059] Updated weights for policy 1, policy_version 61932 (0.0008) [2023-10-08 02:23:44,245][52060] Updated weights for policy 0, policy_version 61160 (0.0009) [2023-10-08 02:23:44,578][52059] Updated weights for policy 1, policy_version 61942 (0.0007) [2023-10-08 02:23:44,612][52060] Updated weights for policy 0, policy_version 61170 (0.0009) [2023-10-08 02:23:44,945][52059] Updated weights for policy 1, policy_version 61952 (0.0008) [2023-10-08 02:23:44,984][52060] Updated weights for policy 0, policy_version 61180 (0.0008) [2023-10-08 02:23:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 126091264. Throughput: 0: 1688.1, 1: 1705.7. Samples: 31530002. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:46,211][50642] Avg episode reward: [(0, '20.430'), (1, '23.170')] [2023-10-08 02:23:48,949][52059] Updated weights for policy 1, policy_version 61962 (0.0008) [2023-10-08 02:23:48,978][52060] Updated weights for policy 0, policy_version 61190 (0.0008) [2023-10-08 02:23:49,323][52059] Updated weights for policy 1, policy_version 61972 (0.0009) [2023-10-08 02:23:49,345][52060] Updated weights for policy 0, policy_version 61200 (0.0007) [2023-10-08 02:23:49,687][52059] Updated weights for policy 1, policy_version 61982 (0.0009) [2023-10-08 02:23:49,715][52060] Updated weights for policy 0, policy_version 61210 (0.0008) [2023-10-08 02:23:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 126156800. Throughput: 0: 1702.0, 1: 1723.6. Samples: 31541262. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:51,211][50642] Avg episode reward: [(0, '20.450'), (1, '23.300')] [2023-10-08 02:23:53,537][52059] Updated weights for policy 1, policy_version 61992 (0.0009) [2023-10-08 02:23:53,748][52060] Updated weights for policy 0, policy_version 61220 (0.0010) [2023-10-08 02:23:53,902][52059] Updated weights for policy 1, policy_version 62002 (0.0008) [2023-10-08 02:23:54,121][52060] Updated weights for policy 0, policy_version 61230 (0.0008) [2023-10-08 02:23:54,268][52059] Updated weights for policy 1, policy_version 62012 (0.0008) [2023-10-08 02:23:54,480][52060] Updated weights for policy 0, policy_version 61240 (0.0007) [2023-10-08 02:23:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 126222336. Throughput: 0: 1678.7, 1: 1696.9. Samples: 31560190. Policy #0 lag: (min: 30.0, avg: 37.0, max: 62.0) [2023-10-08 02:23:56,211][50642] Avg episode reward: [(0, '20.860'), (1, '23.690')] [2023-10-08 02:23:58,298][52059] Updated weights for policy 1, policy_version 62022 (0.0008) [2023-10-08 02:23:58,652][52060] Updated weights for policy 0, policy_version 61250 (0.0008) [2023-10-08 02:23:58,659][52059] Updated weights for policy 1, policy_version 62032 (0.0008) [2023-10-08 02:23:59,028][52059] Updated weights for policy 1, policy_version 62042 (0.0007) [2023-10-08 02:23:59,063][52060] Updated weights for policy 0, policy_version 61260 (0.0007) [2023-10-08 02:23:59,422][52060] Updated weights for policy 0, policy_version 61270 (0.0009) [2023-10-08 02:23:59,787][52060] Updated weights for policy 0, policy_version 61280 (0.0010) [2023-10-08 02:24:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 126287872. Throughput: 0: 1687.8, 1: 1711.3. Samples: 31581030. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:01,211][50642] Avg episode reward: [(0, '21.330'), (1, '21.900')] [2023-10-08 02:24:02,822][52059] Updated weights for policy 1, policy_version 62052 (0.0007) [2023-10-08 02:24:03,183][52059] Updated weights for policy 1, policy_version 62062 (0.0008) [2023-10-08 02:24:03,557][52059] Updated weights for policy 1, policy_version 62072 (0.0007) [2023-10-08 02:24:03,746][52060] Updated weights for policy 0, policy_version 61290 (0.0007) [2023-10-08 02:24:04,113][52060] Updated weights for policy 0, policy_version 61300 (0.0007) [2023-10-08 02:24:04,485][52060] Updated weights for policy 0, policy_version 61310 (0.0007) [2023-10-08 02:24:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 126353408. Throughput: 0: 1686.6, 1: 1705.7. Samples: 31591330. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:06,211][50642] Avg episode reward: [(0, '18.900'), (1, '19.710')] [2023-10-08 02:24:07,496][52059] Updated weights for policy 1, policy_version 62082 (0.0007) [2023-10-08 02:24:07,856][52059] Updated weights for policy 1, policy_version 62092 (0.0008) [2023-10-08 02:24:08,221][52059] Updated weights for policy 1, policy_version 62102 (0.0009) [2023-10-08 02:24:08,485][52060] Updated weights for policy 0, policy_version 61320 (0.0007) [2023-10-08 02:24:08,582][52059] Updated weights for policy 1, policy_version 62112 (0.0008) [2023-10-08 02:24:08,858][52060] Updated weights for policy 0, policy_version 61330 (0.0008) [2023-10-08 02:24:09,226][52060] Updated weights for policy 0, policy_version 61340 (0.0009) [2023-10-08 02:24:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 126418944. Throughput: 0: 1674.3, 1: 1707.7. Samples: 31611842. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:11,211][50642] Avg episode reward: [(0, '17.840'), (1, '23.000')] [2023-10-08 02:24:12,736][52059] Updated weights for policy 1, policy_version 62122 (0.0011) [2023-10-08 02:24:13,117][52059] Updated weights for policy 1, policy_version 62132 (0.0009) [2023-10-08 02:24:13,230][52060] Updated weights for policy 0, policy_version 61350 (0.0008) [2023-10-08 02:24:13,481][52059] Updated weights for policy 1, policy_version 62142 (0.0007) [2023-10-08 02:24:13,594][52060] Updated weights for policy 0, policy_version 61360 (0.0009) [2023-10-08 02:24:13,974][52060] Updated weights for policy 0, policy_version 61370 (0.0009) [2023-10-08 02:24:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 126484480. Throughput: 0: 1706.5, 1: 1731.5. Samples: 31633152. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:16,210][50642] Avg episode reward: [(0, '21.870'), (1, '21.710')] [2023-10-08 02:24:17,454][52059] Updated weights for policy 1, policy_version 62152 (0.0010) [2023-10-08 02:24:17,821][52059] Updated weights for policy 1, policy_version 62162 (0.0008) [2023-10-08 02:24:17,886][52060] Updated weights for policy 0, policy_version 61380 (0.0009) [2023-10-08 02:24:18,176][52059] Updated weights for policy 1, policy_version 62172 (0.0008) [2023-10-08 02:24:18,265][52060] Updated weights for policy 0, policy_version 61390 (0.0009) [2023-10-08 02:24:18,631][52060] Updated weights for policy 0, policy_version 61400 (0.0008) [2023-10-08 02:24:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 126550016. Throughput: 0: 1683.9, 1: 1695.3. Samples: 31642626. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:21,211][50642] Avg episode reward: [(0, '19.340'), (1, '20.690')] [2023-10-08 02:24:22,190][52059] Updated weights for policy 1, policy_version 62182 (0.0007) [2023-10-08 02:24:22,552][52059] Updated weights for policy 1, policy_version 62192 (0.0007) [2023-10-08 02:24:22,693][52060] Updated weights for policy 0, policy_version 61410 (0.0010) [2023-10-08 02:24:22,908][52059] Updated weights for policy 1, policy_version 62202 (0.0007) [2023-10-08 02:24:23,070][52060] Updated weights for policy 0, policy_version 61420 (0.0008) [2023-10-08 02:24:23,434][52060] Updated weights for policy 0, policy_version 61430 (0.0009) [2023-10-08 02:24:23,802][52060] Updated weights for policy 0, policy_version 61440 (0.0007) [2023-10-08 02:24:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 126615552. Throughput: 0: 1697.3, 1: 1720.2. Samples: 31663718. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:26,211][50642] Avg episode reward: [(0, '17.830'), (1, '21.470')] [2023-10-08 02:24:26,686][52059] Updated weights for policy 1, policy_version 62212 (0.0008) [2023-10-08 02:24:27,047][52059] Updated weights for policy 1, policy_version 62222 (0.0009) [2023-10-08 02:24:27,415][52059] Updated weights for policy 1, policy_version 62232 (0.0009) [2023-10-08 02:24:27,614][52060] Updated weights for policy 0, policy_version 61450 (0.0008) [2023-10-08 02:24:27,986][52060] Updated weights for policy 0, policy_version 61460 (0.0007) [2023-10-08 02:24:28,342][52060] Updated weights for policy 0, policy_version 61470 (0.0008) [2023-10-08 02:24:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 126681088. Throughput: 0: 1712.7, 1: 1731.9. Samples: 31685006. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:31,211][50642] Avg episode reward: [(0, '20.020'), (1, '26.550')] [2023-10-08 02:24:31,424][52059] Updated weights for policy 1, policy_version 62242 (0.0008) [2023-10-08 02:24:31,785][52059] Updated weights for policy 1, policy_version 62252 (0.0007) [2023-10-08 02:24:32,153][52059] Updated weights for policy 1, policy_version 62262 (0.0007) [2023-10-08 02:24:32,432][52060] Updated weights for policy 0, policy_version 61480 (0.0009) [2023-10-08 02:24:32,521][52059] Updated weights for policy 1, policy_version 62272 (0.0008) [2023-10-08 02:24:32,801][52060] Updated weights for policy 0, policy_version 61490 (0.0009) [2023-10-08 02:24:33,185][52060] Updated weights for policy 0, policy_version 61500 (0.0011) [2023-10-08 02:24:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 126746624. Throughput: 0: 1686.1, 1: 1714.8. Samples: 31694302. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:36,211][50642] Avg episode reward: [(0, '24.380'), (1, '23.280')] [2023-10-08 02:24:36,212][51605] Saving new best policy, reward=24.380! [2023-10-08 02:24:36,674][52059] Updated weights for policy 1, policy_version 62282 (0.0009) [2023-10-08 02:24:37,044][52059] Updated weights for policy 1, policy_version 62292 (0.0009) [2023-10-08 02:24:37,143][52060] Updated weights for policy 0, policy_version 61510 (0.0008) [2023-10-08 02:24:37,400][52059] Updated weights for policy 1, policy_version 62302 (0.0008) [2023-10-08 02:24:37,509][52060] Updated weights for policy 0, policy_version 61520 (0.0009) [2023-10-08 02:24:37,881][52060] Updated weights for policy 0, policy_version 61530 (0.0008) [2023-10-08 02:24:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 126812160. Throughput: 0: 1716.3, 1: 1734.4. Samples: 31715470. Policy #0 lag: (min: 31.0, avg: 34.3, max: 63.0) [2023-10-08 02:24:41,211][50642] Avg episode reward: [(0, '17.790'), (1, '21.840')] [2023-10-08 02:24:41,439][52059] Updated weights for policy 1, policy_version 62312 (0.0009) [2023-10-08 02:24:41,803][52059] Updated weights for policy 1, policy_version 62322 (0.0008) [2023-10-08 02:24:41,812][52060] Updated weights for policy 0, policy_version 61540 (0.0008) [2023-10-08 02:24:42,166][52059] Updated weights for policy 1, policy_version 62332 (0.0008) [2023-10-08 02:24:42,189][52060] Updated weights for policy 0, policy_version 61550 (0.0008) [2023-10-08 02:24:42,553][52060] Updated weights for policy 0, policy_version 61560 (0.0007) [2023-10-08 02:24:45,946][52059] Updated weights for policy 1, policy_version 62342 (0.0007) [2023-10-08 02:24:46,211][50642] Fps is (10 sec: 13106.5, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 126877696. Throughput: 0: 1726.9, 1: 1733.3. Samples: 31736738. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:24:46,212][50642] Avg episode reward: [(0, '18.390'), (1, '21.000')] [2023-10-08 02:24:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000061568_63045632.pth... [2023-10-08 02:24:46,260][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000059968_61407232.pth [2023-10-08 02:24:46,316][52059] Updated weights for policy 1, policy_version 62352 (0.0009) [2023-10-08 02:24:46,598][52060] Updated weights for policy 0, policy_version 61570 (0.0008) [2023-10-08 02:24:46,680][52059] Updated weights for policy 1, policy_version 62362 (0.0008) [2023-10-08 02:24:46,900][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000062368_63864832.pth... [2023-10-08 02:24:46,928][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000060736_62193664.pth [2023-10-08 02:24:46,982][52060] Updated weights for policy 0, policy_version 61580 (0.0010) [2023-10-08 02:24:47,348][52060] Updated weights for policy 0, policy_version 61590 (0.0009) [2023-10-08 02:24:47,727][52060] Updated weights for policy 0, policy_version 61600 (0.0009) [2023-10-08 02:24:50,734][52059] Updated weights for policy 1, policy_version 62372 (0.0008) [2023-10-08 02:24:51,104][52059] Updated weights for policy 1, policy_version 62382 (0.0008) [2023-10-08 02:24:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 126943232. Throughput: 0: 1707.8, 1: 1731.7. Samples: 31746108. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:24:51,211][50642] Avg episode reward: [(0, '21.930'), (1, '16.400')] [2023-10-08 02:24:51,469][52059] Updated weights for policy 1, policy_version 62392 (0.0008) [2023-10-08 02:24:51,717][52060] Updated weights for policy 0, policy_version 61610 (0.0009) [2023-10-08 02:24:52,086][52060] Updated weights for policy 0, policy_version 61620 (0.0009) [2023-10-08 02:24:52,456][52060] Updated weights for policy 0, policy_version 61630 (0.0008) [2023-10-08 02:24:55,292][52059] Updated weights for policy 1, policy_version 62402 (0.0009) [2023-10-08 02:24:55,648][52059] Updated weights for policy 1, policy_version 62412 (0.0009) [2023-10-08 02:24:56,015][52059] Updated weights for policy 1, policy_version 62422 (0.0009) [2023-10-08 02:24:56,210][50642] Fps is (10 sec: 13108.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 127008768. Throughput: 0: 1723.5, 1: 1736.7. Samples: 31767550. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:24:56,211][50642] Avg episode reward: [(0, '21.300'), (1, '17.860')] [2023-10-08 02:24:56,383][52059] Updated weights for policy 1, policy_version 62432 (0.0009) [2023-10-08 02:24:56,461][52060] Updated weights for policy 0, policy_version 61640 (0.0008) [2023-10-08 02:24:56,837][52060] Updated weights for policy 0, policy_version 61650 (0.0008) [2023-10-08 02:24:57,201][52060] Updated weights for policy 0, policy_version 61660 (0.0008) [2023-10-08 02:25:00,236][52059] Updated weights for policy 1, policy_version 62442 (0.0008) [2023-10-08 02:25:00,604][52059] Updated weights for policy 1, policy_version 62452 (0.0007) [2023-10-08 02:25:00,965][52059] Updated weights for policy 1, policy_version 62462 (0.0007) [2023-10-08 02:25:01,156][52060] Updated weights for policy 0, policy_version 61670 (0.0008) [2023-10-08 02:25:01,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 127107072. Throughput: 0: 1721.1, 1: 1719.5. Samples: 31787982. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:25:01,211][50642] Avg episode reward: [(0, '16.430'), (1, '17.440')] [2023-10-08 02:25:01,522][52060] Updated weights for policy 0, policy_version 61680 (0.0009) [2023-10-08 02:25:01,889][52060] Updated weights for policy 0, policy_version 61690 (0.0009) [2023-10-08 02:25:04,927][52059] Updated weights for policy 1, policy_version 62472 (0.0007) [2023-10-08 02:25:05,294][52059] Updated weights for policy 1, policy_version 62482 (0.0008) [2023-10-08 02:25:05,666][52059] Updated weights for policy 1, policy_version 62492 (0.0008) [2023-10-08 02:25:05,924][52060] Updated weights for policy 0, policy_version 61700 (0.0007) [2023-10-08 02:25:06,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 127172608. Throughput: 0: 1716.8, 1: 1751.0. Samples: 31798674. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:25:06,211][50642] Avg episode reward: [(0, '20.020'), (1, '16.220')] [2023-10-08 02:25:06,294][52060] Updated weights for policy 0, policy_version 61710 (0.0009) [2023-10-08 02:25:06,671][52060] Updated weights for policy 0, policy_version 61720 (0.0008) [2023-10-08 02:25:09,385][52059] Updated weights for policy 1, policy_version 62502 (0.0008) [2023-10-08 02:25:09,738][52059] Updated weights for policy 1, policy_version 62512 (0.0007) [2023-10-08 02:25:10,111][52059] Updated weights for policy 1, policy_version 62522 (0.0007) [2023-10-08 02:25:10,480][52060] Updated weights for policy 0, policy_version 61730 (0.0008) [2023-10-08 02:25:10,856][52060] Updated weights for policy 0, policy_version 61740 (0.0009) [2023-10-08 02:25:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 127238144. Throughput: 0: 1720.9, 1: 1740.0. Samples: 31819456. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:25:11,211][50642] Avg episode reward: [(0, '23.700'), (1, '16.650')] [2023-10-08 02:25:11,236][52060] Updated weights for policy 0, policy_version 61750 (0.0008) [2023-10-08 02:25:11,595][52060] Updated weights for policy 0, policy_version 61760 (0.0008) [2023-10-08 02:25:14,012][52059] Updated weights for policy 1, policy_version 62532 (0.0008) [2023-10-08 02:25:14,379][52059] Updated weights for policy 1, policy_version 62542 (0.0007) [2023-10-08 02:25:14,749][52059] Updated weights for policy 1, policy_version 62552 (0.0009) [2023-10-08 02:25:15,435][52060] Updated weights for policy 0, policy_version 61770 (0.0009) [2023-10-08 02:25:15,814][52060] Updated weights for policy 0, policy_version 61780 (0.0008) [2023-10-08 02:25:16,173][52060] Updated weights for policy 0, policy_version 61790 (0.0007) [2023-10-08 02:25:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 127303680. Throughput: 0: 1706.1, 1: 1728.3. Samples: 31839550. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:25:16,211][50642] Avg episode reward: [(0, '18.440'), (1, '18.360')] [2023-10-08 02:25:18,755][52059] Updated weights for policy 1, policy_version 62562 (0.0007) [2023-10-08 02:25:19,112][52059] Updated weights for policy 1, policy_version 62572 (0.0009) [2023-10-08 02:25:19,477][52059] Updated weights for policy 1, policy_version 62582 (0.0008) [2023-10-08 02:25:19,844][52059] Updated weights for policy 1, policy_version 62592 (0.0009) [2023-10-08 02:25:20,119][52060] Updated weights for policy 0, policy_version 61800 (0.0009) [2023-10-08 02:25:20,486][52060] Updated weights for policy 0, policy_version 61810 (0.0010) [2023-10-08 02:25:20,854][52060] Updated weights for policy 0, policy_version 61820 (0.0010) [2023-10-08 02:25:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 127401984. Throughput: 0: 1724.9, 1: 1749.8. Samples: 31850666. Policy #0 lag: (min: 31.0, avg: 40.8, max: 63.0) [2023-10-08 02:25:21,211][50642] Avg episode reward: [(0, '17.730'), (1, '19.730')] [2023-10-08 02:25:23,580][52059] Updated weights for policy 1, policy_version 62602 (0.0008) [2023-10-08 02:25:23,937][52059] Updated weights for policy 1, policy_version 62612 (0.0009) [2023-10-08 02:25:24,304][52059] Updated weights for policy 1, policy_version 62622 (0.0010) [2023-10-08 02:25:24,944][52060] Updated weights for policy 0, policy_version 61830 (0.0010) [2023-10-08 02:25:25,313][52060] Updated weights for policy 0, policy_version 61840 (0.0010) [2023-10-08 02:25:25,684][52060] Updated weights for policy 0, policy_version 61850 (0.0009) [2023-10-08 02:25:26,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 127467520. Throughput: 0: 1718.5, 1: 1735.6. Samples: 31870906. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:26,211][50642] Avg episode reward: [(0, '21.620'), (1, '20.630')] [2023-10-08 02:25:28,347][52059] Updated weights for policy 1, policy_version 62632 (0.0009) [2023-10-08 02:25:28,722][52059] Updated weights for policy 1, policy_version 62642 (0.0009) [2023-10-08 02:25:29,085][52059] Updated weights for policy 1, policy_version 62652 (0.0010) [2023-10-08 02:25:29,580][52060] Updated weights for policy 0, policy_version 61860 (0.0010) [2023-10-08 02:25:29,945][52060] Updated weights for policy 0, policy_version 61870 (0.0007) [2023-10-08 02:25:30,313][52060] Updated weights for policy 0, policy_version 61880 (0.0009) [2023-10-08 02:25:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 127533056. Throughput: 0: 1692.8, 1: 1737.2. Samples: 31891086. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:31,211][50642] Avg episode reward: [(0, '23.320'), (1, '20.640')] [2023-10-08 02:25:32,904][52059] Updated weights for policy 1, policy_version 62662 (0.0009) [2023-10-08 02:25:33,262][52059] Updated weights for policy 1, policy_version 62672 (0.0011) [2023-10-08 02:25:33,631][52059] Updated weights for policy 1, policy_version 62682 (0.0007) [2023-10-08 02:25:34,387][52060] Updated weights for policy 0, policy_version 61890 (0.0008) [2023-10-08 02:25:34,796][52060] Updated weights for policy 0, policy_version 61900 (0.0007) [2023-10-08 02:25:35,155][52060] Updated weights for policy 0, policy_version 61910 (0.0008) [2023-10-08 02:25:35,519][52060] Updated weights for policy 0, policy_version 61920 (0.0008) [2023-10-08 02:25:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 127598592. Throughput: 0: 1725.2, 1: 1736.3. Samples: 31901876. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:36,211][50642] Avg episode reward: [(0, '16.930'), (1, '21.500')] [2023-10-08 02:25:37,520][52059] Updated weights for policy 1, policy_version 62692 (0.0008) [2023-10-08 02:25:37,882][52059] Updated weights for policy 1, policy_version 62702 (0.0007) [2023-10-08 02:25:38,248][52059] Updated weights for policy 1, policy_version 62712 (0.0010) [2023-10-08 02:25:39,656][52060] Updated weights for policy 0, policy_version 61930 (0.0007) [2023-10-08 02:25:40,014][52060] Updated weights for policy 0, policy_version 61940 (0.0007) [2023-10-08 02:25:40,384][52060] Updated weights for policy 0, policy_version 61950 (0.0007) [2023-10-08 02:25:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 127664128. Throughput: 0: 1711.9, 1: 1731.0. Samples: 31922484. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:41,211][50642] Avg episode reward: [(0, '19.370'), (1, '17.160')] [2023-10-08 02:25:42,169][52059] Updated weights for policy 1, policy_version 62722 (0.0007) [2023-10-08 02:25:42,537][52059] Updated weights for policy 1, policy_version 62732 (0.0007) [2023-10-08 02:25:42,899][52059] Updated weights for policy 1, policy_version 62742 (0.0007) [2023-10-08 02:25:43,272][52059] Updated weights for policy 1, policy_version 62752 (0.0007) [2023-10-08 02:25:44,353][52060] Updated weights for policy 0, policy_version 61960 (0.0009) [2023-10-08 02:25:44,716][52060] Updated weights for policy 0, policy_version 61970 (0.0008) [2023-10-08 02:25:45,091][52060] Updated weights for policy 0, policy_version 61980 (0.0007) [2023-10-08 02:25:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.6, 300 sec: 13773.7). Total num frames: 127729664. Throughput: 0: 1697.7, 1: 1752.9. Samples: 31943258. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:46,211][50642] Avg episode reward: [(0, '23.580'), (1, '18.330')] [2023-10-08 02:25:47,148][52059] Updated weights for policy 1, policy_version 62762 (0.0009) [2023-10-08 02:25:47,519][52059] Updated weights for policy 1, policy_version 62772 (0.0009) [2023-10-08 02:25:47,890][52059] Updated weights for policy 1, policy_version 62782 (0.0008) [2023-10-08 02:25:49,021][52060] Updated weights for policy 0, policy_version 61990 (0.0009) [2023-10-08 02:25:49,382][52060] Updated weights for policy 0, policy_version 62000 (0.0008) [2023-10-08 02:25:49,748][52060] Updated weights for policy 0, policy_version 62010 (0.0008) [2023-10-08 02:25:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 127795200. Throughput: 0: 1723.3, 1: 1725.6. Samples: 31953876. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:51,211][50642] Avg episode reward: [(0, '18.260'), (1, '17.600')] [2023-10-08 02:25:51,768][52059] Updated weights for policy 1, policy_version 62792 (0.0008) [2023-10-08 02:25:52,132][52059] Updated weights for policy 1, policy_version 62802 (0.0008) [2023-10-08 02:25:52,496][52059] Updated weights for policy 1, policy_version 62812 (0.0008) [2023-10-08 02:25:53,547][52060] Updated weights for policy 0, policy_version 62020 (0.0008) [2023-10-08 02:25:53,923][52060] Updated weights for policy 0, policy_version 62030 (0.0010) [2023-10-08 02:25:54,286][52060] Updated weights for policy 0, policy_version 62040 (0.0011) [2023-10-08 02:25:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 127860736. Throughput: 0: 1694.4, 1: 1743.7. Samples: 31974172. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:25:56,211][50642] Avg episode reward: [(0, '17.470'), (1, '21.620')] [2023-10-08 02:25:56,425][52059] Updated weights for policy 1, policy_version 62822 (0.0008) [2023-10-08 02:25:56,786][52059] Updated weights for policy 1, policy_version 62832 (0.0009) [2023-10-08 02:25:57,157][52059] Updated weights for policy 1, policy_version 62842 (0.0008) [2023-10-08 02:25:58,380][52060] Updated weights for policy 0, policy_version 62050 (0.0010) [2023-10-08 02:25:58,749][52060] Updated weights for policy 0, policy_version 62060 (0.0008) [2023-10-08 02:25:59,116][52060] Updated weights for policy 0, policy_version 62070 (0.0008) [2023-10-08 02:25:59,481][52060] Updated weights for policy 0, policy_version 62080 (0.0007) [2023-10-08 02:26:01,113][52059] Updated weights for policy 1, policy_version 62852 (0.0007) [2023-10-08 02:26:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 127926272. Throughput: 0: 1705.8, 1: 1758.4. Samples: 31995440. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:26:01,211][50642] Avg episode reward: [(0, '22.590'), (1, '18.690')] [2023-10-08 02:26:01,481][52059] Updated weights for policy 1, policy_version 62862 (0.0007) [2023-10-08 02:26:01,850][52059] Updated weights for policy 1, policy_version 62872 (0.0008) [2023-10-08 02:26:03,341][52060] Updated weights for policy 0, policy_version 62090 (0.0007) [2023-10-08 02:26:03,710][52060] Updated weights for policy 0, policy_version 62100 (0.0007) [2023-10-08 02:26:04,080][52060] Updated weights for policy 0, policy_version 62110 (0.0007) [2023-10-08 02:26:05,857][52059] Updated weights for policy 1, policy_version 62882 (0.0007) [2023-10-08 02:26:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 127991808. Throughput: 0: 1698.5, 1: 1737.3. Samples: 32005280. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 02:26:06,211][50642] Avg episode reward: [(0, '24.710'), (1, '19.890')] [2023-10-08 02:26:06,212][51605] Saving new best policy, reward=24.710! [2023-10-08 02:26:06,225][52059] Updated weights for policy 1, policy_version 62892 (0.0008) [2023-10-08 02:26:06,589][52059] Updated weights for policy 1, policy_version 62902 (0.0010) [2023-10-08 02:26:06,956][52059] Updated weights for policy 1, policy_version 62912 (0.0009) [2023-10-08 02:26:08,270][52060] Updated weights for policy 0, policy_version 62120 (0.0009) [2023-10-08 02:26:08,637][52060] Updated weights for policy 0, policy_version 62130 (0.0009) [2023-10-08 02:26:09,012][52060] Updated weights for policy 0, policy_version 62140 (0.0008) [2023-10-08 02:26:10,691][52059] Updated weights for policy 1, policy_version 62922 (0.0009) [2023-10-08 02:26:11,055][52059] Updated weights for policy 1, policy_version 62932 (0.0010) [2023-10-08 02:26:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 128057344. Throughput: 0: 1697.1, 1: 1761.8. Samples: 32026558. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:11,211][50642] Avg episode reward: [(0, '18.140'), (1, '19.930')] [2023-10-08 02:26:11,425][52059] Updated weights for policy 1, policy_version 62942 (0.0008) [2023-10-08 02:26:13,037][52060] Updated weights for policy 0, policy_version 62150 (0.0008) [2023-10-08 02:26:13,406][52060] Updated weights for policy 0, policy_version 62160 (0.0007) [2023-10-08 02:26:13,779][52060] Updated weights for policy 0, policy_version 62170 (0.0008) [2023-10-08 02:26:15,477][52059] Updated weights for policy 1, policy_version 62952 (0.0010) [2023-10-08 02:26:15,860][52059] Updated weights for policy 1, policy_version 62962 (0.0009) [2023-10-08 02:26:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 128122880. Throughput: 0: 1727.3, 1: 1739.6. Samples: 32047096. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:16,211][50642] Avg episode reward: [(0, '18.670'), (1, '22.000')] [2023-10-08 02:26:16,235][52059] Updated weights for policy 1, policy_version 62972 (0.0008) [2023-10-08 02:26:17,614][52060] Updated weights for policy 0, policy_version 62180 (0.0009) [2023-10-08 02:26:17,993][52060] Updated weights for policy 0, policy_version 62190 (0.0008) [2023-10-08 02:26:18,355][52060] Updated weights for policy 0, policy_version 62200 (0.0008) [2023-10-08 02:26:20,078][52059] Updated weights for policy 1, policy_version 62982 (0.0010) [2023-10-08 02:26:20,443][52059] Updated weights for policy 1, policy_version 62992 (0.0008) [2023-10-08 02:26:20,812][52059] Updated weights for policy 1, policy_version 63002 (0.0009) [2023-10-08 02:26:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 128221184. Throughput: 0: 1694.5, 1: 1756.0. Samples: 32057148. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:21,211][50642] Avg episode reward: [(0, '22.990'), (1, '18.350')] [2023-10-08 02:26:22,469][52060] Updated weights for policy 0, policy_version 62210 (0.0010) [2023-10-08 02:26:22,837][52060] Updated weights for policy 0, policy_version 62220 (0.0007) [2023-10-08 02:26:23,203][52060] Updated weights for policy 0, policy_version 62230 (0.0008) [2023-10-08 02:26:23,577][52060] Updated weights for policy 0, policy_version 62240 (0.0008) [2023-10-08 02:26:24,635][52059] Updated weights for policy 1, policy_version 63012 (0.0008) [2023-10-08 02:26:24,994][52059] Updated weights for policy 1, policy_version 63022 (0.0010) [2023-10-08 02:26:25,367][52059] Updated weights for policy 1, policy_version 63032 (0.0008) [2023-10-08 02:26:26,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 128286720. Throughput: 0: 1709.3, 1: 1748.4. Samples: 32078080. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:26,211][50642] Avg episode reward: [(0, '17.260'), (1, '19.610')] [2023-10-08 02:26:27,529][52060] Updated weights for policy 0, policy_version 62250 (0.0008) [2023-10-08 02:26:27,895][52060] Updated weights for policy 0, policy_version 62260 (0.0010) [2023-10-08 02:26:28,261][52060] Updated weights for policy 0, policy_version 62270 (0.0009) [2023-10-08 02:26:29,175][52059] Updated weights for policy 1, policy_version 63042 (0.0009) [2023-10-08 02:26:29,546][52059] Updated weights for policy 1, policy_version 63052 (0.0007) [2023-10-08 02:26:29,908][52059] Updated weights for policy 1, policy_version 63062 (0.0007) [2023-10-08 02:26:30,265][52059] Updated weights for policy 1, policy_version 63072 (0.0010) [2023-10-08 02:26:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 128352256. Throughput: 0: 1726.8, 1: 1726.7. Samples: 32098664. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:31,211][50642] Avg episode reward: [(0, '17.820'), (1, '22.980')] [2023-10-08 02:26:31,932][52060] Updated weights for policy 0, policy_version 62280 (0.0007) [2023-10-08 02:26:32,312][52060] Updated weights for policy 0, policy_version 62290 (0.0008) [2023-10-08 02:26:32,679][52060] Updated weights for policy 0, policy_version 62300 (0.0008) [2023-10-08 02:26:34,103][52059] Updated weights for policy 1, policy_version 63082 (0.0009) [2023-10-08 02:26:34,466][52059] Updated weights for policy 1, policy_version 63092 (0.0009) [2023-10-08 02:26:34,825][52059] Updated weights for policy 1, policy_version 63102 (0.0009) [2023-10-08 02:26:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 128417792. Throughput: 0: 1700.3, 1: 1752.4. Samples: 32109250. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:36,211][50642] Avg episode reward: [(0, '20.880'), (1, '19.730')] [2023-10-08 02:26:36,623][52060] Updated weights for policy 0, policy_version 62310 (0.0008) [2023-10-08 02:26:36,994][52060] Updated weights for policy 0, policy_version 62320 (0.0009) [2023-10-08 02:26:37,358][52060] Updated weights for policy 0, policy_version 62330 (0.0009) [2023-10-08 02:26:38,609][52059] Updated weights for policy 1, policy_version 63112 (0.0009) [2023-10-08 02:26:38,981][52059] Updated weights for policy 1, policy_version 63122 (0.0009) [2023-10-08 02:26:39,341][52059] Updated weights for policy 1, policy_version 63132 (0.0010) [2023-10-08 02:26:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 128483328. Throughput: 0: 1729.4, 1: 1721.9. Samples: 32129480. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:41,211][50642] Avg episode reward: [(0, '21.060'), (1, '18.630')] [2023-10-08 02:26:41,291][52060] Updated weights for policy 0, policy_version 62340 (0.0008) [2023-10-08 02:26:41,660][52060] Updated weights for policy 0, policy_version 62350 (0.0010) [2023-10-08 02:26:42,023][52060] Updated weights for policy 0, policy_version 62360 (0.0010) [2023-10-08 02:26:43,270][52059] Updated weights for policy 1, policy_version 63142 (0.0009) [2023-10-08 02:26:43,631][52059] Updated weights for policy 1, policy_version 63152 (0.0009) [2023-10-08 02:26:43,995][52059] Updated weights for policy 1, policy_version 63162 (0.0010) [2023-10-08 02:26:45,833][52060] Updated weights for policy 0, policy_version 62370 (0.0008) [2023-10-08 02:26:46,200][52060] Updated weights for policy 0, policy_version 62380 (0.0008) [2023-10-08 02:26:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 128548864. Throughput: 0: 1735.3, 1: 1721.1. Samples: 32150980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:46,211][50642] Avg episode reward: [(0, '18.100'), (1, '20.160')] [2023-10-08 02:26:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000063168_64684032.pth... [2023-10-08 02:26:46,256][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000061536_63012864.pth [2023-10-08 02:26:46,575][52060] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-10-08 02:26:46,935][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000062400_63897600.pth... [2023-10-08 02:26:46,937][52060] Updated weights for policy 0, policy_version 62400 (0.0007) [2023-10-08 02:26:46,974][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000060800_62259200.pth [2023-10-08 02:26:47,943][52059] Updated weights for policy 1, policy_version 63172 (0.0009) [2023-10-08 02:26:48,308][52059] Updated weights for policy 1, policy_version 63182 (0.0008) [2023-10-08 02:26:48,669][52059] Updated weights for policy 1, policy_version 63192 (0.0010) [2023-10-08 02:26:50,919][52060] Updated weights for policy 0, policy_version 62410 (0.0009) [2023-10-08 02:26:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 128614400. Throughput: 0: 1730.2, 1: 1726.3. Samples: 32160820. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:26:51,211][50642] Avg episode reward: [(0, '19.440'), (1, '25.870')] [2023-10-08 02:26:51,284][52060] Updated weights for policy 0, policy_version 62420 (0.0010) [2023-10-08 02:26:51,653][52060] Updated weights for policy 0, policy_version 62430 (0.0009) [2023-10-08 02:26:52,589][52059] Updated weights for policy 1, policy_version 63202 (0.0010) [2023-10-08 02:26:52,966][52059] Updated weights for policy 1, policy_version 63212 (0.0008) [2023-10-08 02:26:53,327][52059] Updated weights for policy 1, policy_version 63222 (0.0010) [2023-10-08 02:26:53,693][52059] Updated weights for policy 1, policy_version 63232 (0.0009) [2023-10-08 02:26:55,724][52060] Updated weights for policy 0, policy_version 62440 (0.0009) [2023-10-08 02:26:56,099][52060] Updated weights for policy 0, policy_version 62450 (0.0008) [2023-10-08 02:26:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 128679936. Throughput: 0: 1741.2, 1: 1717.7. Samples: 32182208. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:26:56,211][50642] Avg episode reward: [(0, '22.750'), (1, '18.910')] [2023-10-08 02:26:56,466][52060] Updated weights for policy 0, policy_version 62460 (0.0008) [2023-10-08 02:26:57,604][52059] Updated weights for policy 1, policy_version 63242 (0.0010) [2023-10-08 02:26:57,972][52059] Updated weights for policy 1, policy_version 63252 (0.0010) [2023-10-08 02:26:58,333][52059] Updated weights for policy 1, policy_version 63262 (0.0011) [2023-10-08 02:27:00,359][52060] Updated weights for policy 0, policy_version 62470 (0.0009) [2023-10-08 02:27:00,734][52060] Updated weights for policy 0, policy_version 62480 (0.0009) [2023-10-08 02:27:01,093][52060] Updated weights for policy 0, policy_version 62490 (0.0010) [2023-10-08 02:27:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 128745472. Throughput: 0: 1721.0, 1: 1738.7. Samples: 32202782. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:01,211][50642] Avg episode reward: [(0, '19.250'), (1, '20.560')] [2023-10-08 02:27:02,433][52059] Updated weights for policy 1, policy_version 63272 (0.0009) [2023-10-08 02:27:02,793][52059] Updated weights for policy 1, policy_version 63282 (0.0008) [2023-10-08 02:27:03,158][52059] Updated weights for policy 1, policy_version 63292 (0.0007) [2023-10-08 02:27:05,142][52060] Updated weights for policy 0, policy_version 62500 (0.0009) [2023-10-08 02:27:05,517][52060] Updated weights for policy 0, policy_version 62510 (0.0009) [2023-10-08 02:27:05,875][52060] Updated weights for policy 0, policy_version 62520 (0.0007) [2023-10-08 02:27:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 128843776. Throughput: 0: 1741.2, 1: 1717.3. Samples: 32212780. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:06,211][50642] Avg episode reward: [(0, '17.620'), (1, '20.850')] [2023-10-08 02:27:07,128][52059] Updated weights for policy 1, policy_version 63302 (0.0007) [2023-10-08 02:27:07,488][52059] Updated weights for policy 1, policy_version 63312 (0.0007) [2023-10-08 02:27:07,853][52059] Updated weights for policy 1, policy_version 63322 (0.0008) [2023-10-08 02:27:09,865][52060] Updated weights for policy 0, policy_version 62530 (0.0009) [2023-10-08 02:27:10,238][52060] Updated weights for policy 0, policy_version 62540 (0.0009) [2023-10-08 02:27:10,605][52060] Updated weights for policy 0, policy_version 62550 (0.0009) [2023-10-08 02:27:10,972][52060] Updated weights for policy 0, policy_version 62560 (0.0010) [2023-10-08 02:27:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 128909312. Throughput: 0: 1742.7, 1: 1725.6. Samples: 32234154. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:11,211][50642] Avg episode reward: [(0, '21.540'), (1, '22.180')] [2023-10-08 02:27:11,781][52059] Updated weights for policy 1, policy_version 63332 (0.0008) [2023-10-08 02:27:12,144][52059] Updated weights for policy 1, policy_version 63342 (0.0009) [2023-10-08 02:27:12,501][52059] Updated weights for policy 1, policy_version 63352 (0.0008) [2023-10-08 02:27:15,002][52060] Updated weights for policy 0, policy_version 62570 (0.0007) [2023-10-08 02:27:15,377][52060] Updated weights for policy 0, policy_version 62580 (0.0007) [2023-10-08 02:27:15,734][52060] Updated weights for policy 0, policy_version 62590 (0.0009) [2023-10-08 02:27:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 128974848. Throughput: 0: 1709.4, 1: 1747.2. Samples: 32254208. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:16,211][50642] Avg episode reward: [(0, '22.680'), (1, '18.820')] [2023-10-08 02:27:16,384][52059] Updated weights for policy 1, policy_version 63362 (0.0009) [2023-10-08 02:27:16,758][52059] Updated weights for policy 1, policy_version 63372 (0.0010) [2023-10-08 02:27:17,113][52059] Updated weights for policy 1, policy_version 63382 (0.0010) [2023-10-08 02:27:17,473][52059] Updated weights for policy 1, policy_version 63392 (0.0010) [2023-10-08 02:27:19,597][52060] Updated weights for policy 0, policy_version 62600 (0.0008) [2023-10-08 02:27:19,966][52060] Updated weights for policy 0, policy_version 62610 (0.0007) [2023-10-08 02:27:20,331][52060] Updated weights for policy 0, policy_version 62620 (0.0007) [2023-10-08 02:27:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 129040384. Throughput: 0: 1739.9, 1: 1720.9. Samples: 32264988. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:21,211][50642] Avg episode reward: [(0, '19.260'), (1, '21.460')] [2023-10-08 02:27:21,444][52059] Updated weights for policy 1, policy_version 63402 (0.0010) [2023-10-08 02:27:21,819][52059] Updated weights for policy 1, policy_version 63412 (0.0011) [2023-10-08 02:27:22,190][52059] Updated weights for policy 1, policy_version 63422 (0.0011) [2023-10-08 02:27:24,063][52060] Updated weights for policy 0, policy_version 62630 (0.0008) [2023-10-08 02:27:24,441][52060] Updated weights for policy 0, policy_version 62640 (0.0009) [2023-10-08 02:27:24,803][52060] Updated weights for policy 0, policy_version 62650 (0.0009) [2023-10-08 02:27:26,182][52059] Updated weights for policy 1, policy_version 63432 (0.0009) [2023-10-08 02:27:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 129105920. Throughput: 0: 1717.8, 1: 1741.5. Samples: 32285152. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:26,211][50642] Avg episode reward: [(0, '19.310'), (1, '15.500')] [2023-10-08 02:27:26,536][52059] Updated weights for policy 1, policy_version 63442 (0.0008) [2023-10-08 02:27:26,908][52059] Updated weights for policy 1, policy_version 63452 (0.0007) [2023-10-08 02:27:28,747][52060] Updated weights for policy 0, policy_version 62660 (0.0008) [2023-10-08 02:27:29,118][52060] Updated weights for policy 0, policy_version 62670 (0.0009) [2023-10-08 02:27:29,491][52060] Updated weights for policy 0, policy_version 62680 (0.0008) [2023-10-08 02:27:30,853][52059] Updated weights for policy 1, policy_version 63462 (0.0008) [2023-10-08 02:27:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 129171456. Throughput: 0: 1710.7, 1: 1734.8. Samples: 32306032. Policy #0 lag: (min: 8.0, avg: 29.1, max: 40.0) [2023-10-08 02:27:31,211][50642] Avg episode reward: [(0, '21.520'), (1, '14.970')] [2023-10-08 02:27:31,213][52059] Updated weights for policy 1, policy_version 63472 (0.0007) [2023-10-08 02:27:31,579][52059] Updated weights for policy 1, policy_version 63482 (0.0010) [2023-10-08 02:27:33,294][52060] Updated weights for policy 0, policy_version 62690 (0.0007) [2023-10-08 02:27:33,666][52060] Updated weights for policy 0, policy_version 62700 (0.0010) [2023-10-08 02:27:34,041][52060] Updated weights for policy 0, policy_version 62710 (0.0010) [2023-10-08 02:27:34,401][52060] Updated weights for policy 0, policy_version 62720 (0.0008) [2023-10-08 02:27:35,425][52059] Updated weights for policy 1, policy_version 63492 (0.0009) [2023-10-08 02:27:35,797][52059] Updated weights for policy 1, policy_version 63502 (0.0009) [2023-10-08 02:27:36,162][52059] Updated weights for policy 1, policy_version 63512 (0.0008) [2023-10-08 02:27:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 129236992. Throughput: 0: 1726.4, 1: 1738.2. Samples: 32316728. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:27:36,211][50642] Avg episode reward: [(0, '18.220'), (1, '17.970')] [2023-10-08 02:27:38,468][52060] Updated weights for policy 0, policy_version 62730 (0.0007) [2023-10-08 02:27:38,843][52060] Updated weights for policy 0, policy_version 62740 (0.0009) [2023-10-08 02:27:39,210][52060] Updated weights for policy 0, policy_version 62750 (0.0009) [2023-10-08 02:27:40,180][52059] Updated weights for policy 1, policy_version 63522 (0.0009) [2023-10-08 02:27:40,551][52059] Updated weights for policy 1, policy_version 63532 (0.0008) [2023-10-08 02:27:40,910][52059] Updated weights for policy 1, policy_version 63542 (0.0010) [2023-10-08 02:27:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 129302528. Throughput: 0: 1706.8, 1: 1740.1. Samples: 32337318. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:27:41,211][50642] Avg episode reward: [(0, '18.210'), (1, '16.870')] [2023-10-08 02:27:41,272][52059] Updated weights for policy 1, policy_version 63552 (0.0009) [2023-10-08 02:27:43,134][52060] Updated weights for policy 0, policy_version 62760 (0.0008) [2023-10-08 02:27:43,497][52060] Updated weights for policy 0, policy_version 62770 (0.0007) [2023-10-08 02:27:43,866][52060] Updated weights for policy 0, policy_version 62780 (0.0008) [2023-10-08 02:27:45,246][52059] Updated weights for policy 1, policy_version 63562 (0.0010) [2023-10-08 02:27:45,611][52059] Updated weights for policy 1, policy_version 63572 (0.0010) [2023-10-08 02:27:45,972][52059] Updated weights for policy 1, policy_version 63582 (0.0008) [2023-10-08 02:27:46,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 129400832. Throughput: 0: 1727.6, 1: 1718.7. Samples: 32357866. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:27:46,211][50642] Avg episode reward: [(0, '21.440'), (1, '17.450')] [2023-10-08 02:27:47,798][52060] Updated weights for policy 0, policy_version 62790 (0.0009) [2023-10-08 02:27:48,162][52060] Updated weights for policy 0, policy_version 62800 (0.0008) [2023-10-08 02:27:48,532][52060] Updated weights for policy 0, policy_version 62810 (0.0008) [2023-10-08 02:27:49,963][52059] Updated weights for policy 1, policy_version 63592 (0.0007) [2023-10-08 02:27:50,332][52059] Updated weights for policy 1, policy_version 63602 (0.0008) [2023-10-08 02:27:50,689][52059] Updated weights for policy 1, policy_version 63612 (0.0008) [2023-10-08 02:27:51,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 129466368. Throughput: 0: 1708.1, 1: 1750.3. Samples: 32368406. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:27:51,211][50642] Avg episode reward: [(0, '20.010'), (1, '19.070')] [2023-10-08 02:27:52,452][52060] Updated weights for policy 0, policy_version 62820 (0.0007) [2023-10-08 02:27:52,820][52060] Updated weights for policy 0, policy_version 62830 (0.0008) [2023-10-08 02:27:53,186][52060] Updated weights for policy 0, policy_version 62840 (0.0009) [2023-10-08 02:27:54,703][52059] Updated weights for policy 1, policy_version 63622 (0.0009) [2023-10-08 02:27:55,060][52059] Updated weights for policy 1, policy_version 63632 (0.0009) [2023-10-08 02:27:55,430][52059] Updated weights for policy 1, policy_version 63642 (0.0008) [2023-10-08 02:27:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 129531904. Throughput: 0: 1715.3, 1: 1736.7. Samples: 32389494. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:27:56,211][50642] Avg episode reward: [(0, '20.360'), (1, '18.520')] [2023-10-08 02:27:57,149][52060] Updated weights for policy 0, policy_version 62850 (0.0007) [2023-10-08 02:27:57,505][52060] Updated weights for policy 0, policy_version 62860 (0.0007) [2023-10-08 02:27:57,877][52060] Updated weights for policy 0, policy_version 62870 (0.0008) [2023-10-08 02:27:58,246][52060] Updated weights for policy 0, policy_version 62880 (0.0007) [2023-10-08 02:27:59,212][52059] Updated weights for policy 1, policy_version 63652 (0.0009) [2023-10-08 02:27:59,582][52059] Updated weights for policy 1, policy_version 63662 (0.0009) [2023-10-08 02:27:59,943][52059] Updated weights for policy 1, policy_version 63672 (0.0008) [2023-10-08 02:28:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 129597440. Throughput: 0: 1749.3, 1: 1715.6. Samples: 32410130. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:28:01,211][50642] Avg episode reward: [(0, '20.670'), (1, '16.990')] [2023-10-08 02:28:02,138][52060] Updated weights for policy 0, policy_version 62890 (0.0007) [2023-10-08 02:28:02,502][52060] Updated weights for policy 0, policy_version 62900 (0.0008) [2023-10-08 02:28:02,862][52060] Updated weights for policy 0, policy_version 62910 (0.0009) [2023-10-08 02:28:03,826][52059] Updated weights for policy 1, policy_version 63682 (0.0010) [2023-10-08 02:28:04,190][52059] Updated weights for policy 1, policy_version 63692 (0.0008) [2023-10-08 02:28:04,559][52059] Updated weights for policy 1, policy_version 63702 (0.0009) [2023-10-08 02:28:04,928][52059] Updated weights for policy 1, policy_version 63712 (0.0008) [2023-10-08 02:28:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 129662976. Throughput: 0: 1716.4, 1: 1747.0. Samples: 32420840. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:28:06,211][50642] Avg episode reward: [(0, '19.400'), (1, '19.880')] [2023-10-08 02:28:06,684][52060] Updated weights for policy 0, policy_version 62920 (0.0010) [2023-10-08 02:28:07,061][52060] Updated weights for policy 0, policy_version 62930 (0.0012) [2023-10-08 02:28:07,422][52060] Updated weights for policy 0, policy_version 62940 (0.0007) [2023-10-08 02:28:08,894][52059] Updated weights for policy 1, policy_version 63722 (0.0011) [2023-10-08 02:28:09,264][52059] Updated weights for policy 1, policy_version 63732 (0.0008) [2023-10-08 02:28:09,621][52059] Updated weights for policy 1, policy_version 63742 (0.0008) [2023-10-08 02:28:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 129728512. Throughput: 0: 1739.6, 1: 1725.5. Samples: 32441080. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:28:11,211][50642] Avg episode reward: [(0, '20.100'), (1, '18.950')] [2023-10-08 02:28:11,488][52060] Updated weights for policy 0, policy_version 62950 (0.0007) [2023-10-08 02:28:11,851][52060] Updated weights for policy 0, policy_version 62960 (0.0008) [2023-10-08 02:28:12,218][52060] Updated weights for policy 0, policy_version 62970 (0.0007) [2023-10-08 02:28:13,462][52059] Updated weights for policy 1, policy_version 63752 (0.0008) [2023-10-08 02:28:13,830][52059] Updated weights for policy 1, policy_version 63762 (0.0007) [2023-10-08 02:28:14,187][52059] Updated weights for policy 1, policy_version 63772 (0.0010) [2023-10-08 02:28:16,100][52060] Updated weights for policy 0, policy_version 62980 (0.0008) [2023-10-08 02:28:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 129794048. Throughput: 0: 1745.3, 1: 1729.5. Samples: 32462400. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 02:28:16,211][50642] Avg episode reward: [(0, '19.980'), (1, '18.940')] [2023-10-08 02:28:16,467][52060] Updated weights for policy 0, policy_version 62990 (0.0010) [2023-10-08 02:28:16,839][52060] Updated weights for policy 0, policy_version 63000 (0.0009) [2023-10-08 02:28:18,086][52059] Updated weights for policy 1, policy_version 63782 (0.0008) [2023-10-08 02:28:18,454][52059] Updated weights for policy 1, policy_version 63792 (0.0007) [2023-10-08 02:28:18,820][52059] Updated weights for policy 1, policy_version 63802 (0.0007) [2023-10-08 02:28:20,840][52060] Updated weights for policy 0, policy_version 63010 (0.0008) [2023-10-08 02:28:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 129859584. Throughput: 0: 1725.1, 1: 1728.1. Samples: 32472120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:21,211][50642] Avg episode reward: [(0, '19.690'), (1, '20.650')] [2023-10-08 02:28:21,215][52060] Updated weights for policy 0, policy_version 63020 (0.0008) [2023-10-08 02:28:21,576][52060] Updated weights for policy 0, policy_version 63030 (0.0009) [2023-10-08 02:28:21,939][52060] Updated weights for policy 0, policy_version 63040 (0.0010) [2023-10-08 02:28:22,712][52059] Updated weights for policy 1, policy_version 63812 (0.0008) [2023-10-08 02:28:23,081][52059] Updated weights for policy 1, policy_version 63822 (0.0009) [2023-10-08 02:28:23,441][52059] Updated weights for policy 1, policy_version 63832 (0.0008) [2023-10-08 02:28:25,923][52060] Updated weights for policy 0, policy_version 63050 (0.0008) [2023-10-08 02:28:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 129925120. Throughput: 0: 1741.0, 1: 1722.9. Samples: 32493194. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:26,211][50642] Avg episode reward: [(0, '19.620'), (1, '21.580')] [2023-10-08 02:28:26,293][52060] Updated weights for policy 0, policy_version 63060 (0.0007) [2023-10-08 02:28:26,655][52060] Updated weights for policy 0, policy_version 63070 (0.0007) [2023-10-08 02:28:27,434][52059] Updated weights for policy 1, policy_version 63842 (0.0008) [2023-10-08 02:28:27,799][52059] Updated weights for policy 1, policy_version 63852 (0.0008) [2023-10-08 02:28:28,153][52059] Updated weights for policy 1, policy_version 63862 (0.0008) [2023-10-08 02:28:28,512][52059] Updated weights for policy 1, policy_version 63872 (0.0010) [2023-10-08 02:28:30,608][52060] Updated weights for policy 0, policy_version 63080 (0.0008) [2023-10-08 02:28:30,977][52060] Updated weights for policy 0, policy_version 63090 (0.0010) [2023-10-08 02:28:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 129990656. Throughput: 0: 1721.7, 1: 1747.4. Samples: 32513976. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:31,211][50642] Avg episode reward: [(0, '20.790'), (1, '21.110')] [2023-10-08 02:28:31,355][52060] Updated weights for policy 0, policy_version 63100 (0.0007) [2023-10-08 02:28:32,338][52059] Updated weights for policy 1, policy_version 63882 (0.0008) [2023-10-08 02:28:32,702][52059] Updated weights for policy 1, policy_version 63892 (0.0007) [2023-10-08 02:28:33,070][52059] Updated weights for policy 1, policy_version 63902 (0.0008) [2023-10-08 02:28:35,226][52060] Updated weights for policy 0, policy_version 63110 (0.0007) [2023-10-08 02:28:35,590][52060] Updated weights for policy 0, policy_version 63120 (0.0007) [2023-10-08 02:28:35,969][52060] Updated weights for policy 0, policy_version 63130 (0.0007) [2023-10-08 02:28:36,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 130088960. Throughput: 0: 1740.8, 1: 1718.7. Samples: 32524082. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:36,211][50642] Avg episode reward: [(0, '18.900'), (1, '18.940')] [2023-10-08 02:28:37,196][52059] Updated weights for policy 1, policy_version 63912 (0.0008) [2023-10-08 02:28:37,565][52059] Updated weights for policy 1, policy_version 63922 (0.0008) [2023-10-08 02:28:37,928][52059] Updated weights for policy 1, policy_version 63932 (0.0009) [2023-10-08 02:28:39,891][52060] Updated weights for policy 0, policy_version 63140 (0.0007) [2023-10-08 02:28:40,260][52060] Updated weights for policy 0, policy_version 63150 (0.0008) [2023-10-08 02:28:40,627][52060] Updated weights for policy 0, policy_version 63160 (0.0008) [2023-10-08 02:28:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 130154496. Throughput: 0: 1735.0, 1: 1727.0. Samples: 32545286. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:41,211][50642] Avg episode reward: [(0, '19.300'), (1, '19.820')] [2023-10-08 02:28:41,820][52059] Updated weights for policy 1, policy_version 63942 (0.0009) [2023-10-08 02:28:42,187][52059] Updated weights for policy 1, policy_version 63952 (0.0010) [2023-10-08 02:28:42,548][52059] Updated weights for policy 1, policy_version 63962 (0.0011) [2023-10-08 02:28:44,734][52060] Updated weights for policy 0, policy_version 63170 (0.0007) [2023-10-08 02:28:45,095][52060] Updated weights for policy 0, policy_version 63180 (0.0008) [2023-10-08 02:28:45,472][52060] Updated weights for policy 0, policy_version 63190 (0.0008) [2023-10-08 02:28:45,840][52060] Updated weights for policy 0, policy_version 63200 (0.0008) [2023-10-08 02:28:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 130220032. Throughput: 0: 1708.3, 1: 1746.3. Samples: 32565584. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:46,211][50642] Avg episode reward: [(0, '20.280'), (1, '18.980')] [2023-10-08 02:28:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000063200_64716800.pth... [2023-10-08 02:28:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000063968_65503232.pth... [2023-10-08 02:28:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000061568_63045632.pth [2023-10-08 02:28:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000062368_63864832.pth [2023-10-08 02:28:46,541][52059] Updated weights for policy 1, policy_version 63972 (0.0009) [2023-10-08 02:28:46,901][52059] Updated weights for policy 1, policy_version 63982 (0.0007) [2023-10-08 02:28:47,260][52059] Updated weights for policy 1, policy_version 63992 (0.0007) [2023-10-08 02:28:49,826][52060] Updated weights for policy 0, policy_version 63210 (0.0010) [2023-10-08 02:28:50,197][52060] Updated weights for policy 0, policy_version 63220 (0.0010) [2023-10-08 02:28:50,577][52060] Updated weights for policy 0, policy_version 63230 (0.0011) [2023-10-08 02:28:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 130285568. Throughput: 0: 1739.6, 1: 1715.1. Samples: 32576298. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:51,211][50642] Avg episode reward: [(0, '18.350'), (1, '19.270')] [2023-10-08 02:28:51,252][52059] Updated weights for policy 1, policy_version 64002 (0.0007) [2023-10-08 02:28:51,620][52059] Updated weights for policy 1, policy_version 64012 (0.0009) [2023-10-08 02:28:51,981][52059] Updated weights for policy 1, policy_version 64022 (0.0008) [2023-10-08 02:28:52,343][52059] Updated weights for policy 1, policy_version 64032 (0.0007) [2023-10-08 02:28:54,365][52060] Updated weights for policy 0, policy_version 63240 (0.0010) [2023-10-08 02:28:54,731][52060] Updated weights for policy 0, policy_version 63250 (0.0009) [2023-10-08 02:28:55,105][52060] Updated weights for policy 0, policy_version 63260 (0.0008) [2023-10-08 02:28:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 130351104. Throughput: 0: 1715.8, 1: 1744.8. Samples: 32596804. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:28:56,211][50642] Avg episode reward: [(0, '18.430'), (1, '17.950')] [2023-10-08 02:28:56,237][52059] Updated weights for policy 1, policy_version 64042 (0.0011) [2023-10-08 02:28:56,590][52059] Updated weights for policy 1, policy_version 64052 (0.0009) [2023-10-08 02:28:56,967][52059] Updated weights for policy 1, policy_version 64062 (0.0011) [2023-10-08 02:28:59,061][52060] Updated weights for policy 0, policy_version 63270 (0.0009) [2023-10-08 02:28:59,433][52060] Updated weights for policy 0, policy_version 63280 (0.0010) [2023-10-08 02:28:59,794][52060] Updated weights for policy 0, policy_version 63290 (0.0008) [2023-10-08 02:29:00,825][52059] Updated weights for policy 1, policy_version 64072 (0.0009) [2023-10-08 02:29:01,189][52059] Updated weights for policy 1, policy_version 64082 (0.0010) [2023-10-08 02:29:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 130416640. Throughput: 0: 1700.0, 1: 1737.8. Samples: 32617104. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:01,211][50642] Avg episode reward: [(0, '20.490'), (1, '20.630')] [2023-10-08 02:29:01,551][52059] Updated weights for policy 1, policy_version 64092 (0.0010) [2023-10-08 02:29:03,782][52060] Updated weights for policy 0, policy_version 63300 (0.0009) [2023-10-08 02:29:04,156][52060] Updated weights for policy 0, policy_version 63310 (0.0011) [2023-10-08 02:29:04,526][52060] Updated weights for policy 0, policy_version 63320 (0.0009) [2023-10-08 02:29:05,603][52059] Updated weights for policy 1, policy_version 64102 (0.0008) [2023-10-08 02:29:05,971][52059] Updated weights for policy 1, policy_version 64112 (0.0007) [2023-10-08 02:29:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 130482176. Throughput: 0: 1727.4, 1: 1741.8. Samples: 32628232. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:06,211][50642] Avg episode reward: [(0, '20.360'), (1, '18.290')] [2023-10-08 02:29:06,342][52059] Updated weights for policy 1, policy_version 64122 (0.0007) [2023-10-08 02:29:08,566][52060] Updated weights for policy 0, policy_version 63330 (0.0010) [2023-10-08 02:29:08,939][52060] Updated weights for policy 0, policy_version 63340 (0.0008) [2023-10-08 02:29:09,313][52060] Updated weights for policy 0, policy_version 63350 (0.0009) [2023-10-08 02:29:09,672][52060] Updated weights for policy 0, policy_version 63360 (0.0009) [2023-10-08 02:29:10,203][52059] Updated weights for policy 1, policy_version 64132 (0.0008) [2023-10-08 02:29:10,574][52059] Updated weights for policy 1, policy_version 64142 (0.0010) [2023-10-08 02:29:10,940][52059] Updated weights for policy 1, policy_version 64152 (0.0010) [2023-10-08 02:29:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 130547712. Throughput: 0: 1698.3, 1: 1752.9. Samples: 32648494. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:11,211][50642] Avg episode reward: [(0, '18.830'), (1, '20.550')] [2023-10-08 02:29:13,609][52060] Updated weights for policy 0, policy_version 63370 (0.0007) [2023-10-08 02:29:13,962][52060] Updated weights for policy 0, policy_version 63380 (0.0008) [2023-10-08 02:29:14,331][52060] Updated weights for policy 0, policy_version 63390 (0.0010) [2023-10-08 02:29:14,580][52059] Updated weights for policy 1, policy_version 64162 (0.0009) [2023-10-08 02:29:14,940][52059] Updated weights for policy 1, policy_version 64172 (0.0008) [2023-10-08 02:29:15,313][52059] Updated weights for policy 1, policy_version 64182 (0.0011) [2023-10-08 02:29:15,670][52059] Updated weights for policy 1, policy_version 64192 (0.0009) [2023-10-08 02:29:16,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 130646016. Throughput: 0: 1715.1, 1: 1725.0. Samples: 32668782. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:16,211][50642] Avg episode reward: [(0, '20.490'), (1, '17.180')] [2023-10-08 02:29:18,371][52060] Updated weights for policy 0, policy_version 63400 (0.0007) [2023-10-08 02:29:18,743][52060] Updated weights for policy 0, policy_version 63410 (0.0009) [2023-10-08 02:29:19,108][52060] Updated weights for policy 0, policy_version 63420 (0.0009) [2023-10-08 02:29:19,538][52059] Updated weights for policy 1, policy_version 64202 (0.0007) [2023-10-08 02:29:19,903][52059] Updated weights for policy 1, policy_version 64212 (0.0008) [2023-10-08 02:29:20,269][52059] Updated weights for policy 1, policy_version 64222 (0.0008) [2023-10-08 02:29:21,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 130711552. Throughput: 0: 1710.0, 1: 1761.0. Samples: 32680276. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:21,211][50642] Avg episode reward: [(0, '20.100'), (1, '21.550')] [2023-10-08 02:29:23,097][52060] Updated weights for policy 0, policy_version 63430 (0.0008) [2023-10-08 02:29:23,467][52060] Updated weights for policy 0, policy_version 63440 (0.0009) [2023-10-08 02:29:23,839][52060] Updated weights for policy 0, policy_version 63450 (0.0009) [2023-10-08 02:29:24,154][52059] Updated weights for policy 1, policy_version 64232 (0.0009) [2023-10-08 02:29:24,539][52059] Updated weights for policy 1, policy_version 64242 (0.0008) [2023-10-08 02:29:24,900][52059] Updated weights for policy 1, policy_version 64252 (0.0008) [2023-10-08 02:29:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 130777088. Throughput: 0: 1698.8, 1: 1740.5. Samples: 32700056. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:26,211][50642] Avg episode reward: [(0, '20.480'), (1, '18.050')] [2023-10-08 02:29:27,780][52060] Updated weights for policy 0, policy_version 63460 (0.0007) [2023-10-08 02:29:28,151][52060] Updated weights for policy 0, policy_version 63470 (0.0009) [2023-10-08 02:29:28,525][52060] Updated weights for policy 0, policy_version 63480 (0.0008) [2023-10-08 02:29:28,816][52059] Updated weights for policy 1, policy_version 64262 (0.0007) [2023-10-08 02:29:29,181][52059] Updated weights for policy 1, policy_version 64272 (0.0007) [2023-10-08 02:29:29,547][52059] Updated weights for policy 1, policy_version 64282 (0.0009) [2023-10-08 02:29:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 130842624. Throughput: 0: 1724.1, 1: 1732.8. Samples: 32721146. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:31,211][50642] Avg episode reward: [(0, '19.380'), (1, '19.910')] [2023-10-08 02:29:32,420][52060] Updated weights for policy 0, policy_version 63490 (0.0009) [2023-10-08 02:29:32,784][52060] Updated weights for policy 0, policy_version 63500 (0.0008) [2023-10-08 02:29:33,146][52060] Updated weights for policy 0, policy_version 63510 (0.0007) [2023-10-08 02:29:33,504][52059] Updated weights for policy 1, policy_version 64292 (0.0007) [2023-10-08 02:29:33,515][52060] Updated weights for policy 0, policy_version 63520 (0.0008) [2023-10-08 02:29:33,875][52059] Updated weights for policy 1, policy_version 64302 (0.0007) [2023-10-08 02:29:34,232][52059] Updated weights for policy 1, policy_version 64312 (0.0007) [2023-10-08 02:29:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 130908160. Throughput: 0: 1693.9, 1: 1746.5. Samples: 32731116. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:36,211][50642] Avg episode reward: [(0, '20.160'), (1, '16.600')] [2023-10-08 02:29:37,679][52060] Updated weights for policy 0, policy_version 63530 (0.0007) [2023-10-08 02:29:38,045][52060] Updated weights for policy 0, policy_version 63540 (0.0007) [2023-10-08 02:29:38,129][52059] Updated weights for policy 1, policy_version 64322 (0.0008) [2023-10-08 02:29:38,411][52060] Updated weights for policy 0, policy_version 63550 (0.0008) [2023-10-08 02:29:38,493][52059] Updated weights for policy 1, policy_version 64332 (0.0008) [2023-10-08 02:29:38,853][52059] Updated weights for policy 1, policy_version 64342 (0.0010) [2023-10-08 02:29:39,218][52059] Updated weights for policy 1, policy_version 64352 (0.0009) [2023-10-08 02:29:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.8). Total num frames: 130973696. Throughput: 0: 1718.3, 1: 1729.9. Samples: 32751972. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:29:41,211][50642] Avg episode reward: [(0, '20.380'), (1, '21.280')] [2023-10-08 02:29:42,184][52060] Updated weights for policy 0, policy_version 63560 (0.0008) [2023-10-08 02:29:42,550][52060] Updated weights for policy 0, policy_version 63570 (0.0008) [2023-10-08 02:29:42,926][52060] Updated weights for policy 0, policy_version 63580 (0.0007) [2023-10-08 02:29:43,064][52059] Updated weights for policy 1, policy_version 64362 (0.0008) [2023-10-08 02:29:43,423][52059] Updated weights for policy 1, policy_version 64372 (0.0007) [2023-10-08 02:29:43,794][52059] Updated weights for policy 1, policy_version 64382 (0.0009) [2023-10-08 02:29:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 131039232. Throughput: 0: 1737.5, 1: 1736.1. Samples: 32773416. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:29:46,212][50642] Avg episode reward: [(0, '20.240'), (1, '18.100')] [2023-10-08 02:29:46,913][52060] Updated weights for policy 0, policy_version 63590 (0.0009) [2023-10-08 02:29:47,276][52060] Updated weights for policy 0, policy_version 63600 (0.0011) [2023-10-08 02:29:47,656][52060] Updated weights for policy 0, policy_version 63610 (0.0008) [2023-10-08 02:29:47,811][52059] Updated weights for policy 1, policy_version 64392 (0.0008) [2023-10-08 02:29:48,170][52059] Updated weights for policy 1, policy_version 64402 (0.0008) [2023-10-08 02:29:48,534][52059] Updated weights for policy 1, policy_version 64412 (0.0008) [2023-10-08 02:29:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 131104768. Throughput: 0: 1708.3, 1: 1725.9. Samples: 32782774. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:29:51,211][50642] Avg episode reward: [(0, '21.560'), (1, '19.100')] [2023-10-08 02:29:51,568][52060] Updated weights for policy 0, policy_version 63620 (0.0008) [2023-10-08 02:29:51,938][52060] Updated weights for policy 0, policy_version 63630 (0.0009) [2023-10-08 02:29:52,305][52060] Updated weights for policy 0, policy_version 63640 (0.0010) [2023-10-08 02:29:52,401][52059] Updated weights for policy 1, policy_version 64422 (0.0007) [2023-10-08 02:29:52,766][52059] Updated weights for policy 1, policy_version 64432 (0.0008) [2023-10-08 02:29:53,137][52059] Updated weights for policy 1, policy_version 64442 (0.0009) [2023-10-08 02:29:56,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 131170304. Throughput: 0: 1734.7, 1: 1720.8. Samples: 32803990. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:29:56,211][50642] Avg episode reward: [(0, '20.970'), (1, '19.140')] [2023-10-08 02:29:56,224][52060] Updated weights for policy 0, policy_version 63650 (0.0009) [2023-10-08 02:29:56,604][52060] Updated weights for policy 0, policy_version 63660 (0.0009) [2023-10-08 02:29:56,940][52059] Updated weights for policy 1, policy_version 64452 (0.0009) [2023-10-08 02:29:56,981][52060] Updated weights for policy 0, policy_version 63670 (0.0009) [2023-10-08 02:29:57,301][52059] Updated weights for policy 1, policy_version 64462 (0.0008) [2023-10-08 02:29:57,342][52060] Updated weights for policy 0, policy_version 63680 (0.0007) [2023-10-08 02:29:57,660][52059] Updated weights for policy 1, policy_version 64472 (0.0011) [2023-10-08 02:30:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131235840. Throughput: 0: 1733.6, 1: 1744.7. Samples: 32825304. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:30:01,211][50642] Avg episode reward: [(0, '19.850'), (1, '20.580')] [2023-10-08 02:30:01,414][52060] Updated weights for policy 0, policy_version 63690 (0.0008) [2023-10-08 02:30:01,754][52059] Updated weights for policy 1, policy_version 64482 (0.0008) [2023-10-08 02:30:01,774][52060] Updated weights for policy 0, policy_version 63700 (0.0009) [2023-10-08 02:30:02,118][52059] Updated weights for policy 1, policy_version 64492 (0.0008) [2023-10-08 02:30:02,144][52060] Updated weights for policy 0, policy_version 63710 (0.0007) [2023-10-08 02:30:02,475][52059] Updated weights for policy 1, policy_version 64502 (0.0007) [2023-10-08 02:30:02,844][52059] Updated weights for policy 1, policy_version 64512 (0.0007) [2023-10-08 02:30:06,035][52060] Updated weights for policy 0, policy_version 63720 (0.0008) [2023-10-08 02:30:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131301376. Throughput: 0: 1722.0, 1: 1709.7. Samples: 32834704. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:30:06,211][50642] Avg episode reward: [(0, '19.670'), (1, '18.140')] [2023-10-08 02:30:06,402][52060] Updated weights for policy 0, policy_version 63730 (0.0008) [2023-10-08 02:30:06,769][52060] Updated weights for policy 0, policy_version 63740 (0.0007) [2023-10-08 02:30:06,849][52059] Updated weights for policy 1, policy_version 64522 (0.0007) [2023-10-08 02:30:07,214][52059] Updated weights for policy 1, policy_version 64532 (0.0009) [2023-10-08 02:30:07,574][52059] Updated weights for policy 1, policy_version 64542 (0.0008) [2023-10-08 02:30:10,838][52060] Updated weights for policy 0, policy_version 63750 (0.0007) [2023-10-08 02:30:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131366912. Throughput: 0: 1726.3, 1: 1733.2. Samples: 32855730. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:30:11,211][50642] Avg episode reward: [(0, '20.590'), (1, '19.780')] [2023-10-08 02:30:11,213][52060] Updated weights for policy 0, policy_version 63760 (0.0007) [2023-10-08 02:30:11,576][52060] Updated weights for policy 0, policy_version 63770 (0.0007) [2023-10-08 02:30:11,611][52059] Updated weights for policy 1, policy_version 64552 (0.0008) [2023-10-08 02:30:11,983][52059] Updated weights for policy 1, policy_version 64562 (0.0009) [2023-10-08 02:30:12,340][52059] Updated weights for policy 1, policy_version 64572 (0.0008) [2023-10-08 02:30:15,596][52060] Updated weights for policy 0, policy_version 63780 (0.0009) [2023-10-08 02:30:15,959][52060] Updated weights for policy 0, policy_version 63790 (0.0010) [2023-10-08 02:30:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 131432448. Throughput: 0: 1716.5, 1: 1733.6. Samples: 32876400. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:30:16,211][50642] Avg episode reward: [(0, '21.000'), (1, '18.630')] [2023-10-08 02:30:16,324][52060] Updated weights for policy 0, policy_version 63800 (0.0009) [2023-10-08 02:30:16,333][52059] Updated weights for policy 1, policy_version 64582 (0.0008) [2023-10-08 02:30:16,690][52059] Updated weights for policy 1, policy_version 64592 (0.0008) [2023-10-08 02:30:17,057][52059] Updated weights for policy 1, policy_version 64602 (0.0007) [2023-10-08 02:30:20,071][52060] Updated weights for policy 0, policy_version 63810 (0.0007) [2023-10-08 02:30:20,440][52060] Updated weights for policy 0, policy_version 63820 (0.0009) [2023-10-08 02:30:20,804][52060] Updated weights for policy 0, policy_version 63830 (0.0009) [2023-10-08 02:30:21,092][52059] Updated weights for policy 1, policy_version 64612 (0.0010) [2023-10-08 02:30:21,159][52060] Updated weights for policy 0, policy_version 63840 (0.0010) [2023-10-08 02:30:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131530752. Throughput: 0: 1731.4, 1: 1718.8. Samples: 32886376. Policy #0 lag: (min: 2.0, avg: 5.5, max: 34.0) [2023-10-08 02:30:21,211][50642] Avg episode reward: [(0, '19.380'), (1, '17.550')] [2023-10-08 02:30:21,457][52059] Updated weights for policy 1, policy_version 64622 (0.0008) [2023-10-08 02:30:21,817][52059] Updated weights for policy 1, policy_version 64632 (0.0007) [2023-10-08 02:30:25,090][52060] Updated weights for policy 0, policy_version 63850 (0.0009) [2023-10-08 02:30:25,465][52060] Updated weights for policy 0, policy_version 63860 (0.0008) [2023-10-08 02:30:25,604][52059] Updated weights for policy 1, policy_version 64642 (0.0009) [2023-10-08 02:30:25,826][52060] Updated weights for policy 0, policy_version 63870 (0.0009) [2023-10-08 02:30:25,967][52059] Updated weights for policy 1, policy_version 64652 (0.0008) [2023-10-08 02:30:26,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131596288. Throughput: 0: 1727.0, 1: 1735.9. Samples: 32907802. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:26,211][50642] Avg episode reward: [(0, '20.290'), (1, '17.900')] [2023-10-08 02:30:26,329][52059] Updated weights for policy 1, policy_version 64662 (0.0009) [2023-10-08 02:30:26,696][52059] Updated weights for policy 1, policy_version 64672 (0.0008) [2023-10-08 02:30:29,762][52060] Updated weights for policy 0, policy_version 63880 (0.0008) [2023-10-08 02:30:30,132][52060] Updated weights for policy 0, policy_version 63890 (0.0007) [2023-10-08 02:30:30,500][52060] Updated weights for policy 0, policy_version 63900 (0.0007) [2023-10-08 02:30:30,571][52059] Updated weights for policy 1, policy_version 64682 (0.0010) [2023-10-08 02:30:30,937][52059] Updated weights for policy 1, policy_version 64692 (0.0007) [2023-10-08 02:30:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131661824. Throughput: 0: 1696.5, 1: 1723.9. Samples: 32927336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:31,211][50642] Avg episode reward: [(0, '21.910'), (1, '18.360')] [2023-10-08 02:30:31,297][52059] Updated weights for policy 1, policy_version 64702 (0.0008) [2023-10-08 02:30:34,555][52060] Updated weights for policy 0, policy_version 63910 (0.0009) [2023-10-08 02:30:34,914][52060] Updated weights for policy 0, policy_version 63920 (0.0007) [2023-10-08 02:30:35,275][52060] Updated weights for policy 0, policy_version 63930 (0.0008) [2023-10-08 02:30:35,356][52059] Updated weights for policy 1, policy_version 64712 (0.0008) [2023-10-08 02:30:35,723][52059] Updated weights for policy 1, policy_version 64722 (0.0007) [2023-10-08 02:30:36,088][52059] Updated weights for policy 1, policy_version 64732 (0.0010) [2023-10-08 02:30:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 131727360. Throughput: 0: 1730.0, 1: 1737.7. Samples: 32938820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:36,211][50642] Avg episode reward: [(0, '21.610'), (1, '19.880')] [2023-10-08 02:30:39,299][52060] Updated weights for policy 0, policy_version 63940 (0.0009) [2023-10-08 02:30:39,675][52060] Updated weights for policy 0, policy_version 63950 (0.0010) [2023-10-08 02:30:40,044][52060] Updated weights for policy 0, policy_version 63960 (0.0007) [2023-10-08 02:30:40,144][52059] Updated weights for policy 1, policy_version 64742 (0.0008) [2023-10-08 02:30:40,507][52059] Updated weights for policy 1, policy_version 64752 (0.0009) [2023-10-08 02:30:40,864][52059] Updated weights for policy 1, policy_version 64762 (0.0011) [2023-10-08 02:30:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 131825664. Throughput: 0: 1714.5, 1: 1735.7. Samples: 32959252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:41,211][50642] Avg episode reward: [(0, '19.050'), (1, '20.030')] [2023-10-08 02:30:43,997][52060] Updated weights for policy 0, policy_version 63970 (0.0009) [2023-10-08 02:30:44,360][52060] Updated weights for policy 0, policy_version 63980 (0.0009) [2023-10-08 02:30:44,738][52059] Updated weights for policy 1, policy_version 64772 (0.0009) [2023-10-08 02:30:44,739][52060] Updated weights for policy 0, policy_version 63990 (0.0008) [2023-10-08 02:30:45,098][52060] Updated weights for policy 0, policy_version 64000 (0.0009) [2023-10-08 02:30:45,101][52059] Updated weights for policy 1, policy_version 64782 (0.0009) [2023-10-08 02:30:45,470][52059] Updated weights for policy 1, policy_version 64792 (0.0008) [2023-10-08 02:30:46,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 131891200. Throughput: 0: 1699.9, 1: 1707.0. Samples: 32978616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:46,211][50642] Avg episode reward: [(0, '19.490'), (1, '21.130')] [2023-10-08 02:30:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000064000_65536000.pth... [2023-10-08 02:30:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000064800_66355200.pth... [2023-10-08 02:30:46,272][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000063168_64684032.pth [2023-10-08 02:30:46,272][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000062400_63897600.pth [2023-10-08 02:30:46,278][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000064800_66355200.pth [2023-10-08 02:30:46,278][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000064000_65536000.pth [2023-10-08 02:30:49,174][52060] Updated weights for policy 0, policy_version 64010 (0.0009) [2023-10-08 02:30:49,308][52059] Updated weights for policy 1, policy_version 64802 (0.0008) [2023-10-08 02:30:49,544][52060] Updated weights for policy 0, policy_version 64020 (0.0008) [2023-10-08 02:30:49,680][52059] Updated weights for policy 1, policy_version 64812 (0.0008) [2023-10-08 02:30:49,909][52060] Updated weights for policy 0, policy_version 64030 (0.0010) [2023-10-08 02:30:50,037][52059] Updated weights for policy 1, policy_version 64822 (0.0010) [2023-10-08 02:30:50,404][52059] Updated weights for policy 1, policy_version 64832 (0.0009) [2023-10-08 02:30:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 131956736. Throughput: 0: 1724.9, 1: 1735.6. Samples: 32990430. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:51,211][50642] Avg episode reward: [(0, '20.000'), (1, '18.110')] [2023-10-08 02:30:53,953][52060] Updated weights for policy 0, policy_version 64040 (0.0009) [2023-10-08 02:30:54,323][52060] Updated weights for policy 0, policy_version 64050 (0.0007) [2023-10-08 02:30:54,467][52059] Updated weights for policy 1, policy_version 64842 (0.0008) [2023-10-08 02:30:54,690][52060] Updated weights for policy 0, policy_version 64060 (0.0008) [2023-10-08 02:30:54,834][52059] Updated weights for policy 1, policy_version 64852 (0.0008) [2023-10-08 02:30:55,191][52059] Updated weights for policy 1, policy_version 64862 (0.0009) [2023-10-08 02:30:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 132022272. Throughput: 0: 1698.0, 1: 1718.7. Samples: 33009478. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:30:56,211][50642] Avg episode reward: [(0, '17.840'), (1, '20.580')] [2023-10-08 02:30:58,626][52060] Updated weights for policy 0, policy_version 64070 (0.0009) [2023-10-08 02:30:59,000][52060] Updated weights for policy 0, policy_version 64080 (0.0010) [2023-10-08 02:30:59,360][52060] Updated weights for policy 0, policy_version 64090 (0.0007) [2023-10-08 02:30:59,437][52059] Updated weights for policy 1, policy_version 64872 (0.0008) [2023-10-08 02:30:59,818][52059] Updated weights for policy 1, policy_version 64882 (0.0010) [2023-10-08 02:31:00,181][52059] Updated weights for policy 1, policy_version 64892 (0.0009) [2023-10-08 02:31:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 132087808. Throughput: 0: 1703.8, 1: 1709.8. Samples: 33030014. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:31:01,211][50642] Avg episode reward: [(0, '17.140'), (1, '17.410')] [2023-10-08 02:31:03,430][52060] Updated weights for policy 0, policy_version 64100 (0.0009) [2023-10-08 02:31:03,798][52060] Updated weights for policy 0, policy_version 64110 (0.0009) [2023-10-08 02:31:04,130][52059] Updated weights for policy 1, policy_version 64902 (0.0009) [2023-10-08 02:31:04,176][52060] Updated weights for policy 0, policy_version 64120 (0.0009) [2023-10-08 02:31:04,482][52059] Updated weights for policy 1, policy_version 64912 (0.0009) [2023-10-08 02:31:04,843][52059] Updated weights for policy 1, policy_version 64922 (0.0011) [2023-10-08 02:31:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 132153344. Throughput: 0: 1704.5, 1: 1738.1. Samples: 33041294. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) [2023-10-08 02:31:06,211][50642] Avg episode reward: [(0, '18.880'), (1, '16.730')] [2023-10-08 02:31:08,245][52060] Updated weights for policy 0, policy_version 64130 (0.0009) [2023-10-08 02:31:08,615][52060] Updated weights for policy 0, policy_version 64140 (0.0010) [2023-10-08 02:31:08,671][52059] Updated weights for policy 1, policy_version 64932 (0.0009) [2023-10-08 02:31:08,974][52060] Updated weights for policy 0, policy_version 64150 (0.0009) [2023-10-08 02:31:09,037][52059] Updated weights for policy 1, policy_version 64942 (0.0007) [2023-10-08 02:31:09,347][52060] Updated weights for policy 0, policy_version 64160 (0.0009) [2023-10-08 02:31:09,403][52059] Updated weights for policy 1, policy_version 64952 (0.0009) [2023-10-08 02:31:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 132218880. Throughput: 0: 1691.9, 1: 1707.8. Samples: 33060792. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:11,211][50642] Avg episode reward: [(0, '21.470'), (1, '20.070')] [2023-10-08 02:31:13,324][52060] Updated weights for policy 0, policy_version 64170 (0.0009) [2023-10-08 02:31:13,356][52059] Updated weights for policy 1, policy_version 64962 (0.0008) [2023-10-08 02:31:13,697][52060] Updated weights for policy 0, policy_version 64180 (0.0008) [2023-10-08 02:31:13,716][52059] Updated weights for policy 1, policy_version 64972 (0.0007) [2023-10-08 02:31:14,060][52060] Updated weights for policy 0, policy_version 64190 (0.0010) [2023-10-08 02:31:14,081][52059] Updated weights for policy 1, policy_version 64982 (0.0008) [2023-10-08 02:31:14,447][52059] Updated weights for policy 1, policy_version 64992 (0.0010) [2023-10-08 02:31:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 132284416. Throughput: 0: 1715.6, 1: 1722.0. Samples: 33082026. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:16,211][50642] Avg episode reward: [(0, '18.340'), (1, '16.400')] [2023-10-08 02:31:17,897][52060] Updated weights for policy 0, policy_version 64200 (0.0008) [2023-10-08 02:31:18,263][52060] Updated weights for policy 0, policy_version 64210 (0.0007) [2023-10-08 02:31:18,307][52059] Updated weights for policy 1, policy_version 65002 (0.0008) [2023-10-08 02:31:18,627][52060] Updated weights for policy 0, policy_version 64220 (0.0007) [2023-10-08 02:31:18,671][52059] Updated weights for policy 1, policy_version 65012 (0.0010) [2023-10-08 02:31:19,037][52059] Updated weights for policy 1, policy_version 65022 (0.0011) [2023-10-08 02:31:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 132349952. Throughput: 0: 1685.5, 1: 1715.1. Samples: 33091846. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:21,211][50642] Avg episode reward: [(0, '18.870'), (1, '17.490')] [2023-10-08 02:31:22,632][52060] Updated weights for policy 0, policy_version 64230 (0.0010) [2023-10-08 02:31:23,002][52060] Updated weights for policy 0, policy_version 64240 (0.0009) [2023-10-08 02:31:23,145][52059] Updated weights for policy 1, policy_version 65032 (0.0008) [2023-10-08 02:31:23,366][52060] Updated weights for policy 0, policy_version 64250 (0.0007) [2023-10-08 02:31:23,501][52059] Updated weights for policy 1, policy_version 65042 (0.0009) [2023-10-08 02:31:23,863][52059] Updated weights for policy 1, policy_version 65052 (0.0007) [2023-10-08 02:31:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 132415488. Throughput: 0: 1701.7, 1: 1708.7. Samples: 33112720. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:26,211][50642] Avg episode reward: [(0, '21.940'), (1, '18.270')] [2023-10-08 02:31:27,314][52060] Updated weights for policy 0, policy_version 64260 (0.0008) [2023-10-08 02:31:27,585][52059] Updated weights for policy 1, policy_version 65062 (0.0007) [2023-10-08 02:31:27,677][52060] Updated weights for policy 0, policy_version 64270 (0.0007) [2023-10-08 02:31:27,953][52059] Updated weights for policy 1, policy_version 65072 (0.0008) [2023-10-08 02:31:28,046][52060] Updated weights for policy 0, policy_version 64280 (0.0008) [2023-10-08 02:31:28,312][52059] Updated weights for policy 1, policy_version 65082 (0.0008) [2023-10-08 02:31:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 132481024. Throughput: 0: 1714.9, 1: 1743.3. Samples: 33134234. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:31,211][50642] Avg episode reward: [(0, '21.630'), (1, '17.670')] [2023-10-08 02:31:32,018][52060] Updated weights for policy 0, policy_version 64290 (0.0008) [2023-10-08 02:31:32,139][52059] Updated weights for policy 1, policy_version 65092 (0.0007) [2023-10-08 02:31:32,382][52060] Updated weights for policy 0, policy_version 64300 (0.0009) [2023-10-08 02:31:32,500][52059] Updated weights for policy 1, policy_version 65102 (0.0008) [2023-10-08 02:31:32,751][52060] Updated weights for policy 0, policy_version 64310 (0.0008) [2023-10-08 02:31:32,860][52059] Updated weights for policy 1, policy_version 65112 (0.0007) [2023-10-08 02:31:33,117][52060] Updated weights for policy 0, policy_version 64320 (0.0007) [2023-10-08 02:31:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 132546560. Throughput: 0: 1689.4, 1: 1714.8. Samples: 33143620. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:36,211][50642] Avg episode reward: [(0, '19.060'), (1, '17.280')] [2023-10-08 02:31:36,929][52060] Updated weights for policy 0, policy_version 64330 (0.0007) [2023-10-08 02:31:37,015][52059] Updated weights for policy 1, policy_version 65122 (0.0008) [2023-10-08 02:31:37,290][52060] Updated weights for policy 0, policy_version 64340 (0.0008) [2023-10-08 02:31:37,377][52059] Updated weights for policy 1, policy_version 65132 (0.0007) [2023-10-08 02:31:37,657][52060] Updated weights for policy 0, policy_version 64350 (0.0008) [2023-10-08 02:31:37,740][52059] Updated weights for policy 1, policy_version 65142 (0.0007) [2023-10-08 02:31:38,101][52059] Updated weights for policy 1, policy_version 65152 (0.0009) [2023-10-08 02:31:41,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13107.3, 300 sec: 13773.7). Total num frames: 132612096. Throughput: 0: 1723.2, 1: 1730.2. Samples: 33164884. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:41,211][50642] Avg episode reward: [(0, '19.070'), (1, '20.400')] [2023-10-08 02:31:41,565][52060] Updated weights for policy 0, policy_version 64360 (0.0008) [2023-10-08 02:31:41,921][52060] Updated weights for policy 0, policy_version 64370 (0.0007) [2023-10-08 02:31:42,003][52059] Updated weights for policy 1, policy_version 65162 (0.0007) [2023-10-08 02:31:42,293][52060] Updated weights for policy 0, policy_version 64380 (0.0007) [2023-10-08 02:31:42,365][52059] Updated weights for policy 1, policy_version 65172 (0.0009) [2023-10-08 02:31:42,730][52059] Updated weights for policy 1, policy_version 65182 (0.0009) [2023-10-08 02:31:46,164][52060] Updated weights for policy 0, policy_version 64390 (0.0009) [2023-10-08 02:31:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 132677632. Throughput: 0: 1726.4, 1: 1741.4. Samples: 33186064. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:46,211][50642] Avg episode reward: [(0, '21.880'), (1, '18.230')] [2023-10-08 02:31:46,531][52060] Updated weights for policy 0, policy_version 64400 (0.0009) [2023-10-08 02:31:46,864][52059] Updated weights for policy 1, policy_version 65192 (0.0008) [2023-10-08 02:31:46,902][52060] Updated weights for policy 0, policy_version 64410 (0.0007) [2023-10-08 02:31:47,245][52059] Updated weights for policy 1, policy_version 65202 (0.0010) [2023-10-08 02:31:47,605][52059] Updated weights for policy 1, policy_version 65212 (0.0010) [2023-10-08 02:31:50,934][52060] Updated weights for policy 0, policy_version 64420 (0.0008) [2023-10-08 02:31:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 132743168. Throughput: 0: 1709.3, 1: 1709.1. Samples: 33195120. Policy #0 lag: (min: 26.0, avg: 27.9, max: 49.0) [2023-10-08 02:31:51,211][50642] Avg episode reward: [(0, '19.450'), (1, '19.080')] [2023-10-08 02:31:51,296][52060] Updated weights for policy 0, policy_version 64430 (0.0010) [2023-10-08 02:31:51,524][52059] Updated weights for policy 1, policy_version 65222 (0.0008) [2023-10-08 02:31:51,665][52060] Updated weights for policy 0, policy_version 64440 (0.0008) [2023-10-08 02:31:51,886][52059] Updated weights for policy 1, policy_version 65232 (0.0008) [2023-10-08 02:31:52,245][52059] Updated weights for policy 1, policy_version 65242 (0.0008) [2023-10-08 02:31:55,838][52060] Updated weights for policy 0, policy_version 64450 (0.0008) [2023-10-08 02:31:56,078][52059] Updated weights for policy 1, policy_version 65252 (0.0008) [2023-10-08 02:31:56,203][52060] Updated weights for policy 0, policy_version 64460 (0.0008) [2023-10-08 02:31:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 132808704. Throughput: 0: 1722.7, 1: 1736.5. Samples: 33216454. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:31:56,211][50642] Avg episode reward: [(0, '17.990'), (1, '21.870')] [2023-10-08 02:31:56,439][52059] Updated weights for policy 1, policy_version 65262 (0.0007) [2023-10-08 02:31:56,574][52060] Updated weights for policy 0, policy_version 64470 (0.0009) [2023-10-08 02:31:56,797][52059] Updated weights for policy 1, policy_version 65272 (0.0007) [2023-10-08 02:31:56,937][52060] Updated weights for policy 0, policy_version 64480 (0.0008) [2023-10-08 02:32:00,627][52059] Updated weights for policy 1, policy_version 65282 (0.0007) [2023-10-08 02:32:00,981][52060] Updated weights for policy 0, policy_version 64490 (0.0007) [2023-10-08 02:32:00,984][52059] Updated weights for policy 1, policy_version 65292 (0.0007) [2023-10-08 02:32:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 132874240. Throughput: 0: 1717.0, 1: 1729.2. Samples: 33237106. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:01,211][50642] Avg episode reward: [(0, '20.650'), (1, '19.080')] [2023-10-08 02:32:01,346][52059] Updated weights for policy 1, policy_version 65302 (0.0008) [2023-10-08 02:32:01,346][52060] Updated weights for policy 0, policy_version 64500 (0.0009) [2023-10-08 02:32:01,703][52059] Updated weights for policy 1, policy_version 65312 (0.0010) [2023-10-08 02:32:01,708][52060] Updated weights for policy 0, policy_version 64510 (0.0009) [2023-10-08 02:32:05,764][52059] Updated weights for policy 1, policy_version 65322 (0.0009) [2023-10-08 02:32:05,778][52060] Updated weights for policy 0, policy_version 64520 (0.0009) [2023-10-08 02:32:06,131][52059] Updated weights for policy 1, policy_version 65332 (0.0008) [2023-10-08 02:32:06,153][52060] Updated weights for policy 0, policy_version 64530 (0.0009) [2023-10-08 02:32:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 132939776. Throughput: 0: 1718.3, 1: 1724.3. Samples: 33246762. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:06,211][50642] Avg episode reward: [(0, '21.830'), (1, '19.440')] [2023-10-08 02:32:06,489][52059] Updated weights for policy 1, policy_version 65342 (0.0008) [2023-10-08 02:32:06,524][52060] Updated weights for policy 0, policy_version 64540 (0.0008) [2023-10-08 02:32:10,365][52059] Updated weights for policy 1, policy_version 65352 (0.0008) [2023-10-08 02:32:10,427][52060] Updated weights for policy 0, policy_version 64550 (0.0008) [2023-10-08 02:32:10,724][52059] Updated weights for policy 1, policy_version 65362 (0.0007) [2023-10-08 02:32:10,790][52060] Updated weights for policy 0, policy_version 64560 (0.0007) [2023-10-08 02:32:11,084][52059] Updated weights for policy 1, policy_version 65372 (0.0008) [2023-10-08 02:32:11,163][52060] Updated weights for policy 0, policy_version 64570 (0.0008) [2023-10-08 02:32:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 133005312. Throughput: 0: 1721.5, 1: 1733.5. Samples: 33268196. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:11,211][50642] Avg episode reward: [(0, '20.120'), (1, '20.930')] [2023-10-08 02:32:15,031][52059] Updated weights for policy 1, policy_version 65382 (0.0008) [2023-10-08 02:32:15,117][52060] Updated weights for policy 0, policy_version 64580 (0.0007) [2023-10-08 02:32:15,382][52059] Updated weights for policy 1, policy_version 65392 (0.0009) [2023-10-08 02:32:15,486][52060] Updated weights for policy 0, policy_version 64590 (0.0009) [2023-10-08 02:32:15,741][52059] Updated weights for policy 1, policy_version 65402 (0.0009) [2023-10-08 02:32:15,849][52060] Updated weights for policy 0, policy_version 64600 (0.0007) [2023-10-08 02:32:16,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 133136384. Throughput: 0: 1699.2, 1: 1703.9. Samples: 33287372. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:16,211][50642] Avg episode reward: [(0, '17.400'), (1, '20.500')] [2023-10-08 02:32:19,774][52059] Updated weights for policy 1, policy_version 65412 (0.0009) [2023-10-08 02:32:19,951][52060] Updated weights for policy 0, policy_version 64610 (0.0008) [2023-10-08 02:32:20,147][52059] Updated weights for policy 1, policy_version 65422 (0.0007) [2023-10-08 02:32:20,326][52060] Updated weights for policy 0, policy_version 64620 (0.0007) [2023-10-08 02:32:20,512][52059] Updated weights for policy 1, policy_version 65432 (0.0008) [2023-10-08 02:32:20,692][52060] Updated weights for policy 0, policy_version 64630 (0.0008) [2023-10-08 02:32:21,053][52060] Updated weights for policy 0, policy_version 64640 (0.0010) [2023-10-08 02:32:21,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 133201920. Throughput: 0: 1722.3, 1: 1725.5. Samples: 33298772. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:21,211][50642] Avg episode reward: [(0, '21.790'), (1, '18.290')] [2023-10-08 02:32:24,537][52059] Updated weights for policy 1, policy_version 65442 (0.0010) [2023-10-08 02:32:24,894][52059] Updated weights for policy 1, policy_version 65452 (0.0007) [2023-10-08 02:32:25,085][52060] Updated weights for policy 0, policy_version 64650 (0.0007) [2023-10-08 02:32:25,272][52059] Updated weights for policy 1, policy_version 65462 (0.0009) [2023-10-08 02:32:25,456][52060] Updated weights for policy 0, policy_version 64660 (0.0007) [2023-10-08 02:32:25,627][52059] Updated weights for policy 1, policy_version 65472 (0.0010) [2023-10-08 02:32:25,828][52060] Updated weights for policy 0, policy_version 64670 (0.0009) [2023-10-08 02:32:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 133267456. Throughput: 0: 1716.5, 1: 1722.9. Samples: 33319656. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:26,211][50642] Avg episode reward: [(0, '22.400'), (1, '19.440')] [2023-10-08 02:32:29,494][52059] Updated weights for policy 1, policy_version 65482 (0.0007) [2023-10-08 02:32:29,793][52060] Updated weights for policy 0, policy_version 64680 (0.0007) [2023-10-08 02:32:29,847][52059] Updated weights for policy 1, policy_version 65492 (0.0007) [2023-10-08 02:32:30,159][52060] Updated weights for policy 0, policy_version 64690 (0.0007) [2023-10-08 02:32:30,214][52059] Updated weights for policy 1, policy_version 65502 (0.0007) [2023-10-08 02:32:30,531][52060] Updated weights for policy 0, policy_version 64700 (0.0010) [2023-10-08 02:32:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 133332992. Throughput: 0: 1685.0, 1: 1713.1. Samples: 33338978. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 02:32:31,211][50642] Avg episode reward: [(0, '20.130'), (1, '20.340')] [2023-10-08 02:32:34,258][52059] Updated weights for policy 1, policy_version 65512 (0.0008) [2023-10-08 02:32:34,547][52060] Updated weights for policy 0, policy_version 64710 (0.0008) [2023-10-08 02:32:34,633][52059] Updated weights for policy 1, policy_version 65522 (0.0008) [2023-10-08 02:32:34,918][52060] Updated weights for policy 0, policy_version 64720 (0.0008) [2023-10-08 02:32:34,997][52059] Updated weights for policy 1, policy_version 65532 (0.0009) [2023-10-08 02:32:35,280][52060] Updated weights for policy 0, policy_version 64730 (0.0010) [2023-10-08 02:32:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 133398528. Throughput: 0: 1714.1, 1: 1747.5. Samples: 33350896. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:32:36,211][50642] Avg episode reward: [(0, '19.270'), (1, '20.960')] [2023-10-08 02:32:38,942][52059] Updated weights for policy 1, policy_version 65542 (0.0008) [2023-10-08 02:32:39,230][52060] Updated weights for policy 0, policy_version 64740 (0.0008) [2023-10-08 02:32:39,305][52059] Updated weights for policy 1, policy_version 65552 (0.0008) [2023-10-08 02:32:39,603][52060] Updated weights for policy 0, policy_version 64750 (0.0009) [2023-10-08 02:32:39,676][52059] Updated weights for policy 1, policy_version 65562 (0.0008) [2023-10-08 02:32:39,962][52060] Updated weights for policy 0, policy_version 64760 (0.0009) [2023-10-08 02:32:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 133464064. Throughput: 0: 1696.1, 1: 1711.9. Samples: 33369816. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:32:41,211][50642] Avg episode reward: [(0, '20.170'), (1, '20.540')] [2023-10-08 02:32:43,526][52059] Updated weights for policy 1, policy_version 65572 (0.0007) [2023-10-08 02:32:43,896][52059] Updated weights for policy 1, policy_version 65582 (0.0009) [2023-10-08 02:32:44,077][52060] Updated weights for policy 0, policy_version 64770 (0.0010) [2023-10-08 02:32:44,269][52059] Updated weights for policy 1, policy_version 65592 (0.0008) [2023-10-08 02:32:44,439][52060] Updated weights for policy 0, policy_version 64780 (0.0008) [2023-10-08 02:32:44,812][52060] Updated weights for policy 0, policy_version 64790 (0.0007) [2023-10-08 02:32:45,170][52060] Updated weights for policy 0, policy_version 64800 (0.0009) [2023-10-08 02:32:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 133529600. Throughput: 0: 1686.0, 1: 1720.7. Samples: 33390408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:32:46,211][50642] Avg episode reward: [(0, '20.490'), (1, '20.370')] [2023-10-08 02:32:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000065600_67174400.pth... [2023-10-08 02:32:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000064800_66355200.pth... [2023-10-08 02:32:46,250][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000063968_65503232.pth [2023-10-08 02:32:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000063200_64716800.pth [2023-10-08 02:32:48,068][52059] Updated weights for policy 1, policy_version 65602 (0.0009) [2023-10-08 02:32:48,431][52059] Updated weights for policy 1, policy_version 65612 (0.0007) [2023-10-08 02:32:48,794][52059] Updated weights for policy 1, policy_version 65622 (0.0007) [2023-10-08 02:32:49,161][52059] Updated weights for policy 1, policy_version 65632 (0.0007) [2023-10-08 02:32:49,255][52060] Updated weights for policy 0, policy_version 64810 (0.0007) [2023-10-08 02:32:49,613][52060] Updated weights for policy 0, policy_version 64820 (0.0007) [2023-10-08 02:32:49,991][52060] Updated weights for policy 0, policy_version 64830 (0.0008) [2023-10-08 02:32:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 133595136. Throughput: 0: 1713.9, 1: 1729.6. Samples: 33401722. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:32:51,211][50642] Avg episode reward: [(0, '18.750'), (1, '20.620')] [2023-10-08 02:32:52,975][52059] Updated weights for policy 1, policy_version 65642 (0.0008) [2023-10-08 02:32:53,326][52059] Updated weights for policy 1, policy_version 65652 (0.0010) [2023-10-08 02:32:53,692][52059] Updated weights for policy 1, policy_version 65662 (0.0009) [2023-10-08 02:32:53,946][52060] Updated weights for policy 0, policy_version 64840 (0.0009) [2023-10-08 02:32:54,316][52060] Updated weights for policy 0, policy_version 64850 (0.0009) [2023-10-08 02:32:54,691][52060] Updated weights for policy 0, policy_version 64860 (0.0010) [2023-10-08 02:32:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 133660672. Throughput: 0: 1681.2, 1: 1723.1. Samples: 33421390. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:32:56,211][50642] Avg episode reward: [(0, '18.230'), (1, '19.810')] [2023-10-08 02:32:57,530][52059] Updated weights for policy 1, policy_version 65672 (0.0010) [2023-10-08 02:32:57,891][52059] Updated weights for policy 1, policy_version 65682 (0.0008) [2023-10-08 02:32:58,250][52059] Updated weights for policy 1, policy_version 65692 (0.0008) [2023-10-08 02:32:58,678][52060] Updated weights for policy 0, policy_version 64870 (0.0010) [2023-10-08 02:32:59,040][52060] Updated weights for policy 0, policy_version 64880 (0.0007) [2023-10-08 02:32:59,413][52060] Updated weights for policy 0, policy_version 64890 (0.0008) [2023-10-08 02:33:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 133726208. Throughput: 0: 1701.3, 1: 1754.6. Samples: 33442888. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:33:01,211][50642] Avg episode reward: [(0, '20.850'), (1, '20.210')] [2023-10-08 02:33:02,189][52059] Updated weights for policy 1, policy_version 65702 (0.0008) [2023-10-08 02:33:02,560][52059] Updated weights for policy 1, policy_version 65712 (0.0007) [2023-10-08 02:33:02,927][52059] Updated weights for policy 1, policy_version 65722 (0.0009) [2023-10-08 02:33:03,373][52060] Updated weights for policy 0, policy_version 64900 (0.0009) [2023-10-08 02:33:03,744][52060] Updated weights for policy 0, policy_version 64910 (0.0009) [2023-10-08 02:33:04,103][52060] Updated weights for policy 0, policy_version 64920 (0.0008) [2023-10-08 02:33:06,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 133791744. Throughput: 0: 1692.8, 1: 1729.3. Samples: 33452768. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:33:06,212][50642] Avg episode reward: [(0, '18.970'), (1, '19.270')] [2023-10-08 02:33:06,860][52059] Updated weights for policy 1, policy_version 65732 (0.0010) [2023-10-08 02:33:07,224][52059] Updated weights for policy 1, policy_version 65742 (0.0009) [2023-10-08 02:33:07,592][52059] Updated weights for policy 1, policy_version 65752 (0.0008) [2023-10-08 02:33:08,043][52060] Updated weights for policy 0, policy_version 64930 (0.0010) [2023-10-08 02:33:08,408][52060] Updated weights for policy 0, policy_version 64940 (0.0010) [2023-10-08 02:33:08,772][52060] Updated weights for policy 0, policy_version 64950 (0.0011) [2023-10-08 02:33:09,133][52060] Updated weights for policy 0, policy_version 64960 (0.0009) [2023-10-08 02:33:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 133857280. Throughput: 0: 1684.3, 1: 1736.8. Samples: 33473606. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:33:11,211][50642] Avg episode reward: [(0, '16.870'), (1, '20.940')] [2023-10-08 02:33:11,531][52059] Updated weights for policy 1, policy_version 65762 (0.0007) [2023-10-08 02:33:11,901][52059] Updated weights for policy 1, policy_version 65772 (0.0009) [2023-10-08 02:33:12,265][52059] Updated weights for policy 1, policy_version 65782 (0.0008) [2023-10-08 02:33:12,618][52059] Updated weights for policy 1, policy_version 65792 (0.0007) [2023-10-08 02:33:13,142][52060] Updated weights for policy 0, policy_version 64970 (0.0007) [2023-10-08 02:33:13,510][52060] Updated weights for policy 0, policy_version 64980 (0.0007) [2023-10-08 02:33:13,870][52060] Updated weights for policy 0, policy_version 64990 (0.0008) [2023-10-08 02:33:16,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 133922816. Throughput: 0: 1714.0, 1: 1751.9. Samples: 33494944. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:33:16,211][50642] Avg episode reward: [(0, '19.560'), (1, '23.160')] [2023-10-08 02:33:16,647][52059] Updated weights for policy 1, policy_version 65802 (0.0007) [2023-10-08 02:33:17,005][52059] Updated weights for policy 1, policy_version 65812 (0.0009) [2023-10-08 02:33:17,367][52059] Updated weights for policy 1, policy_version 65822 (0.0011) [2023-10-08 02:33:17,861][52060] Updated weights for policy 0, policy_version 65000 (0.0008) [2023-10-08 02:33:18,238][52060] Updated weights for policy 0, policy_version 65010 (0.0011) [2023-10-08 02:33:18,604][52060] Updated weights for policy 0, policy_version 65020 (0.0010) [2023-10-08 02:33:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 133988352. Throughput: 0: 1688.1, 1: 1723.7. Samples: 33504424. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:21,211][50642] Avg episode reward: [(0, '22.650'), (1, '19.820')] [2023-10-08 02:33:21,541][52059] Updated weights for policy 1, policy_version 65832 (0.0007) [2023-10-08 02:33:21,908][52059] Updated weights for policy 1, policy_version 65842 (0.0009) [2023-10-08 02:33:22,273][52059] Updated weights for policy 1, policy_version 65852 (0.0008) [2023-10-08 02:33:22,534][52060] Updated weights for policy 0, policy_version 65030 (0.0010) [2023-10-08 02:33:22,897][52060] Updated weights for policy 0, policy_version 65040 (0.0010) [2023-10-08 02:33:23,263][52060] Updated weights for policy 0, policy_version 65050 (0.0010) [2023-10-08 02:33:26,158][52059] Updated weights for policy 1, policy_version 65862 (0.0009) [2023-10-08 02:33:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 134053888. Throughput: 0: 1703.7, 1: 1748.4. Samples: 33525162. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:26,211][50642] Avg episode reward: [(0, '16.820'), (1, '21.410')] [2023-10-08 02:33:26,515][52059] Updated weights for policy 1, policy_version 65872 (0.0008) [2023-10-08 02:33:26,882][52059] Updated weights for policy 1, policy_version 65882 (0.0008) [2023-10-08 02:33:27,208][52060] Updated weights for policy 0, policy_version 65060 (0.0008) [2023-10-08 02:33:27,571][52060] Updated weights for policy 0, policy_version 65070 (0.0007) [2023-10-08 02:33:27,937][52060] Updated weights for policy 0, policy_version 65080 (0.0009) [2023-10-08 02:33:30,769][52059] Updated weights for policy 1, policy_version 65892 (0.0010) [2023-10-08 02:33:31,139][52059] Updated weights for policy 1, policy_version 65902 (0.0007) [2023-10-08 02:33:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 134119424. Throughput: 0: 1729.4, 1: 1742.2. Samples: 33546628. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:31,211][50642] Avg episode reward: [(0, '18.160'), (1, '21.680')] [2023-10-08 02:33:31,514][52059] Updated weights for policy 1, policy_version 65912 (0.0007) [2023-10-08 02:33:31,708][52060] Updated weights for policy 0, policy_version 65090 (0.0009) [2023-10-08 02:33:32,077][52060] Updated weights for policy 0, policy_version 65100 (0.0007) [2023-10-08 02:33:32,451][52060] Updated weights for policy 0, policy_version 65110 (0.0008) [2023-10-08 02:33:32,812][52060] Updated weights for policy 0, policy_version 65120 (0.0008) [2023-10-08 02:33:35,441][52059] Updated weights for policy 1, policy_version 65922 (0.0009) [2023-10-08 02:33:35,801][52059] Updated weights for policy 1, policy_version 65932 (0.0010) [2023-10-08 02:33:36,166][52059] Updated weights for policy 1, policy_version 65942 (0.0009) [2023-10-08 02:33:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 134184960. Throughput: 0: 1698.8, 1: 1737.9. Samples: 33556370. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:36,211][50642] Avg episode reward: [(0, '21.660'), (1, '24.410')] [2023-10-08 02:33:36,533][52059] Updated weights for policy 1, policy_version 65952 (0.0008) [2023-10-08 02:33:36,797][52060] Updated weights for policy 0, policy_version 65130 (0.0010) [2023-10-08 02:33:37,160][52060] Updated weights for policy 0, policy_version 65140 (0.0010) [2023-10-08 02:33:37,528][52060] Updated weights for policy 0, policy_version 65150 (0.0011) [2023-10-08 02:33:40,366][52059] Updated weights for policy 1, policy_version 65962 (0.0008) [2023-10-08 02:33:40,728][52059] Updated weights for policy 1, policy_version 65972 (0.0008) [2023-10-08 02:33:41,092][52059] Updated weights for policy 1, policy_version 65982 (0.0007) [2023-10-08 02:33:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 134283264. Throughput: 0: 1727.2, 1: 1744.4. Samples: 33577608. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:41,211][50642] Avg episode reward: [(0, '18.350'), (1, '20.820')] [2023-10-08 02:33:41,539][52060] Updated weights for policy 0, policy_version 65160 (0.0007) [2023-10-08 02:33:41,902][52060] Updated weights for policy 0, policy_version 65170 (0.0010) [2023-10-08 02:33:42,264][52060] Updated weights for policy 0, policy_version 65180 (0.0007) [2023-10-08 02:33:44,902][52059] Updated weights for policy 1, policy_version 65992 (0.0010) [2023-10-08 02:33:45,270][52059] Updated weights for policy 1, policy_version 66002 (0.0008) [2023-10-08 02:33:45,641][52059] Updated weights for policy 1, policy_version 66012 (0.0009) [2023-10-08 02:33:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 134348800. Throughput: 0: 1732.4, 1: 1712.5. Samples: 33597906. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:46,211][50642] Avg episode reward: [(0, '17.630'), (1, '20.790')] [2023-10-08 02:33:46,248][52060] Updated weights for policy 0, policy_version 65190 (0.0010) [2023-10-08 02:33:46,624][52060] Updated weights for policy 0, policy_version 65200 (0.0007) [2023-10-08 02:33:46,991][52060] Updated weights for policy 0, policy_version 65210 (0.0009) [2023-10-08 02:33:49,537][52059] Updated weights for policy 1, policy_version 66022 (0.0010) [2023-10-08 02:33:49,903][52059] Updated weights for policy 1, policy_version 66032 (0.0010) [2023-10-08 02:33:50,269][52059] Updated weights for policy 1, policy_version 66042 (0.0009) [2023-10-08 02:33:51,056][52060] Updated weights for policy 0, policy_version 65220 (0.0009) [2023-10-08 02:33:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 134414336. Throughput: 0: 1716.0, 1: 1747.1. Samples: 33608608. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:51,212][50642] Avg episode reward: [(0, '19.770'), (1, '22.240')] [2023-10-08 02:33:51,421][52060] Updated weights for policy 0, policy_version 65230 (0.0007) [2023-10-08 02:33:51,790][52060] Updated weights for policy 0, policy_version 65240 (0.0008) [2023-10-08 02:33:54,270][52059] Updated weights for policy 1, policy_version 66052 (0.0007) [2023-10-08 02:33:54,632][52059] Updated weights for policy 1, policy_version 66062 (0.0007) [2023-10-08 02:33:54,992][52059] Updated weights for policy 1, policy_version 66072 (0.0007) [2023-10-08 02:33:55,820][52060] Updated weights for policy 0, policy_version 65250 (0.0008) [2023-10-08 02:33:56,186][52060] Updated weights for policy 0, policy_version 65260 (0.0008) [2023-10-08 02:33:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 134479872. Throughput: 0: 1730.1, 1: 1725.6. Samples: 33629114. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:33:56,211][50642] Avg episode reward: [(0, '22.190'), (1, '21.520')] [2023-10-08 02:33:56,548][52060] Updated weights for policy 0, policy_version 65270 (0.0007) [2023-10-08 02:33:56,919][52060] Updated weights for policy 0, policy_version 65280 (0.0007) [2023-10-08 02:33:58,925][52059] Updated weights for policy 1, policy_version 66082 (0.0007) [2023-10-08 02:33:59,294][52059] Updated weights for policy 1, policy_version 66092 (0.0008) [2023-10-08 02:33:59,647][52059] Updated weights for policy 1, policy_version 66102 (0.0007) [2023-10-08 02:34:00,008][52059] Updated weights for policy 1, policy_version 66112 (0.0008) [2023-10-08 02:34:00,666][52060] Updated weights for policy 0, policy_version 65290 (0.0010) [2023-10-08 02:34:01,035][52060] Updated weights for policy 0, policy_version 65300 (0.0008) [2023-10-08 02:34:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 134545408. Throughput: 0: 1719.6, 1: 1720.6. Samples: 33649752. Policy #0 lag: (min: 31.0, avg: 32.5, max: 57.0) [2023-10-08 02:34:01,211][50642] Avg episode reward: [(0, '16.000'), (1, '17.660')] [2023-10-08 02:34:01,401][52060] Updated weights for policy 0, policy_version 65310 (0.0010) [2023-10-08 02:34:03,880][52059] Updated weights for policy 1, policy_version 66122 (0.0009) [2023-10-08 02:34:04,245][52059] Updated weights for policy 1, policy_version 66132 (0.0008) [2023-10-08 02:34:04,616][52059] Updated weights for policy 1, policy_version 66142 (0.0009) [2023-10-08 02:34:05,310][52060] Updated weights for policy 0, policy_version 65320 (0.0008) [2023-10-08 02:34:05,676][52060] Updated weights for policy 0, policy_version 65330 (0.0008) [2023-10-08 02:34:06,055][52060] Updated weights for policy 0, policy_version 65340 (0.0007) [2023-10-08 02:34:06,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 134643712. Throughput: 0: 1732.7, 1: 1737.6. Samples: 33660584. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:06,211][50642] Avg episode reward: [(0, '15.720'), (1, '19.870')] [2023-10-08 02:34:08,611][52059] Updated weights for policy 1, policy_version 66152 (0.0010) [2023-10-08 02:34:08,983][52059] Updated weights for policy 1, policy_version 66162 (0.0010) [2023-10-08 02:34:09,343][52059] Updated weights for policy 1, policy_version 66172 (0.0007) [2023-10-08 02:34:10,070][52060] Updated weights for policy 0, policy_version 65350 (0.0009) [2023-10-08 02:34:10,438][52060] Updated weights for policy 0, policy_version 65360 (0.0008) [2023-10-08 02:34:10,803][52060] Updated weights for policy 0, policy_version 65370 (0.0009) [2023-10-08 02:34:11,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 134709248. Throughput: 0: 1742.3, 1: 1722.7. Samples: 33681086. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:11,211][50642] Avg episode reward: [(0, '17.120'), (1, '21.920')] [2023-10-08 02:34:13,260][52059] Updated weights for policy 1, policy_version 66182 (0.0007) [2023-10-08 02:34:13,615][52059] Updated weights for policy 1, policy_version 66192 (0.0008) [2023-10-08 02:34:13,986][52059] Updated weights for policy 1, policy_version 66202 (0.0009) [2023-10-08 02:34:14,932][52060] Updated weights for policy 0, policy_version 65380 (0.0011) [2023-10-08 02:34:15,302][52060] Updated weights for policy 0, policy_version 65390 (0.0011) [2023-10-08 02:34:15,660][52060] Updated weights for policy 0, policy_version 65400 (0.0009) [2023-10-08 02:34:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 134774784. Throughput: 0: 1703.8, 1: 1729.0. Samples: 33701104. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:16,211][50642] Avg episode reward: [(0, '14.610'), (1, '16.460')] [2023-10-08 02:34:17,851][52059] Updated weights for policy 1, policy_version 66212 (0.0010) [2023-10-08 02:34:18,226][52059] Updated weights for policy 1, policy_version 66222 (0.0010) [2023-10-08 02:34:18,578][52059] Updated weights for policy 1, policy_version 66232 (0.0010) [2023-10-08 02:34:19,575][52060] Updated weights for policy 0, policy_version 65410 (0.0009) [2023-10-08 02:34:19,949][52060] Updated weights for policy 0, policy_version 65420 (0.0007) [2023-10-08 02:34:20,311][52060] Updated weights for policy 0, policy_version 65430 (0.0007) [2023-10-08 02:34:20,687][52060] Updated weights for policy 0, policy_version 65440 (0.0007) [2023-10-08 02:34:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 134840320. Throughput: 0: 1729.3, 1: 1725.3. Samples: 33711830. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:21,211][50642] Avg episode reward: [(0, '14.740'), (1, '18.380')] [2023-10-08 02:34:22,544][52059] Updated weights for policy 1, policy_version 66242 (0.0010) [2023-10-08 02:34:22,913][52059] Updated weights for policy 1, policy_version 66252 (0.0009) [2023-10-08 02:34:23,269][52059] Updated weights for policy 1, policy_version 66262 (0.0008) [2023-10-08 02:34:23,638][52059] Updated weights for policy 1, policy_version 66272 (0.0008) [2023-10-08 02:34:24,694][52060] Updated weights for policy 0, policy_version 65450 (0.0009) [2023-10-08 02:34:25,066][52060] Updated weights for policy 0, policy_version 65460 (0.0010) [2023-10-08 02:34:25,433][52060] Updated weights for policy 0, policy_version 65470 (0.0009) [2023-10-08 02:34:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 134905856. Throughput: 0: 1720.7, 1: 1722.0. Samples: 33732526. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:26,211][50642] Avg episode reward: [(0, '17.360'), (1, '19.700')] [2023-10-08 02:34:27,476][52059] Updated weights for policy 1, policy_version 66282 (0.0007) [2023-10-08 02:34:27,842][52059] Updated weights for policy 1, policy_version 66292 (0.0007) [2023-10-08 02:34:28,207][52059] Updated weights for policy 1, policy_version 66302 (0.0010) [2023-10-08 02:34:29,344][52060] Updated weights for policy 0, policy_version 65480 (0.0007) [2023-10-08 02:34:29,712][52060] Updated weights for policy 0, policy_version 65490 (0.0008) [2023-10-08 02:34:30,069][52060] Updated weights for policy 0, policy_version 65500 (0.0007) [2023-10-08 02:34:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 134971392. Throughput: 0: 1698.6, 1: 1751.0. Samples: 33753136. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:31,211][50642] Avg episode reward: [(0, '20.680'), (1, '21.230')] [2023-10-08 02:34:32,119][52059] Updated weights for policy 1, policy_version 66312 (0.0010) [2023-10-08 02:34:32,480][52059] Updated weights for policy 1, policy_version 66322 (0.0007) [2023-10-08 02:34:32,847][52059] Updated weights for policy 1, policy_version 66332 (0.0007) [2023-10-08 02:34:33,959][52060] Updated weights for policy 0, policy_version 65510 (0.0009) [2023-10-08 02:34:34,333][52060] Updated weights for policy 0, policy_version 65520 (0.0009) [2023-10-08 02:34:34,706][52060] Updated weights for policy 0, policy_version 65530 (0.0010) [2023-10-08 02:34:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 135036928. Throughput: 0: 1727.8, 1: 1720.0. Samples: 33763758. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:36,211][50642] Avg episode reward: [(0, '15.270'), (1, '21.130')] [2023-10-08 02:34:36,757][52059] Updated weights for policy 1, policy_version 66342 (0.0007) [2023-10-08 02:34:37,122][52059] Updated weights for policy 1, policy_version 66352 (0.0008) [2023-10-08 02:34:37,484][52059] Updated weights for policy 1, policy_version 66362 (0.0008) [2023-10-08 02:34:38,562][52060] Updated weights for policy 0, policy_version 65540 (0.0008) [2023-10-08 02:34:38,927][52060] Updated weights for policy 0, policy_version 65550 (0.0008) [2023-10-08 02:34:39,291][52060] Updated weights for policy 0, policy_version 65560 (0.0008) [2023-10-08 02:34:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 135102464. Throughput: 0: 1703.0, 1: 1743.4. Samples: 33784204. Policy #0 lag: (min: 27.0, avg: 35.0, max: 59.0) [2023-10-08 02:34:41,211][50642] Avg episode reward: [(0, '17.750'), (1, '20.320')] [2023-10-08 02:34:41,420][52059] Updated weights for policy 1, policy_version 66372 (0.0008) [2023-10-08 02:34:41,772][52059] Updated weights for policy 1, policy_version 66382 (0.0008) [2023-10-08 02:34:42,139][52059] Updated weights for policy 1, policy_version 66392 (0.0008) [2023-10-08 02:34:43,382][52060] Updated weights for policy 0, policy_version 65570 (0.0008) [2023-10-08 02:34:43,747][52060] Updated weights for policy 0, policy_version 65580 (0.0008) [2023-10-08 02:34:44,113][52060] Updated weights for policy 0, policy_version 65590 (0.0007) [2023-10-08 02:34:44,477][52060] Updated weights for policy 0, policy_version 65600 (0.0008) [2023-10-08 02:34:46,165][52059] Updated weights for policy 1, policy_version 66402 (0.0007) [2023-10-08 02:34:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 135168000. Throughput: 0: 1711.5, 1: 1747.5. Samples: 33805406. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:34:46,211][50642] Avg episode reward: [(0, '20.190'), (1, '21.430')] [2023-10-08 02:34:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000065600_67174400.pth... [2023-10-08 02:34:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000064000_65536000.pth [2023-10-08 02:34:46,529][52059] Updated weights for policy 1, policy_version 66412 (0.0008) [2023-10-08 02:34:46,888][52059] Updated weights for policy 1, policy_version 66422 (0.0010) [2023-10-08 02:34:47,254][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000066432_68026368.pth... [2023-10-08 02:34:47,259][52059] Updated weights for policy 1, policy_version 66432 (0.0008) [2023-10-08 02:34:47,285][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000064800_66355200.pth [2023-10-08 02:34:48,445][52060] Updated weights for policy 0, policy_version 65610 (0.0009) [2023-10-08 02:34:48,808][52060] Updated weights for policy 0, policy_version 65620 (0.0010) [2023-10-08 02:34:49,175][52060] Updated weights for policy 0, policy_version 65630 (0.0009) [2023-10-08 02:34:51,051][52059] Updated weights for policy 1, policy_version 66442 (0.0010) [2023-10-08 02:34:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 135233536. Throughput: 0: 1711.4, 1: 1726.5. Samples: 33815292. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:34:51,211][50642] Avg episode reward: [(0, '18.940'), (1, '18.510')] [2023-10-08 02:34:51,419][52059] Updated weights for policy 1, policy_version 66452 (0.0008) [2023-10-08 02:34:51,779][52059] Updated weights for policy 1, policy_version 66462 (0.0008) [2023-10-08 02:34:53,282][52060] Updated weights for policy 0, policy_version 65640 (0.0011) [2023-10-08 02:34:53,647][52060] Updated weights for policy 0, policy_version 65650 (0.0010) [2023-10-08 02:34:54,011][52060] Updated weights for policy 0, policy_version 65660 (0.0010) [2023-10-08 02:34:55,847][52059] Updated weights for policy 1, policy_version 66472 (0.0010) [2023-10-08 02:34:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 135299072. Throughput: 0: 1686.5, 1: 1754.4. Samples: 33835926. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:34:56,211][50642] Avg episode reward: [(0, '17.050'), (1, '19.310')] [2023-10-08 02:34:56,221][52059] Updated weights for policy 1, policy_version 66482 (0.0008) [2023-10-08 02:34:56,586][52059] Updated weights for policy 1, policy_version 66492 (0.0008) [2023-10-08 02:34:58,080][52060] Updated weights for policy 0, policy_version 65670 (0.0009) [2023-10-08 02:34:58,450][52060] Updated weights for policy 0, policy_version 65680 (0.0009) [2023-10-08 02:34:58,823][52060] Updated weights for policy 0, policy_version 65690 (0.0010) [2023-10-08 02:35:00,277][52059] Updated weights for policy 1, policy_version 66502 (0.0010) [2023-10-08 02:35:00,647][52059] Updated weights for policy 1, policy_version 66512 (0.0009) [2023-10-08 02:35:01,011][52059] Updated weights for policy 1, policy_version 66522 (0.0010) [2023-10-08 02:35:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 135364608. Throughput: 0: 1713.9, 1: 1737.5. Samples: 33856416. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:35:01,211][50642] Avg episode reward: [(0, '19.760'), (1, '18.620')] [2023-10-08 02:35:02,858][52060] Updated weights for policy 0, policy_version 65700 (0.0010) [2023-10-08 02:35:03,224][52060] Updated weights for policy 0, policy_version 65710 (0.0008) [2023-10-08 02:35:03,600][52060] Updated weights for policy 0, policy_version 65720 (0.0008) [2023-10-08 02:35:04,891][52059] Updated weights for policy 1, policy_version 66532 (0.0008) [2023-10-08 02:35:05,266][52059] Updated weights for policy 1, policy_version 66542 (0.0010) [2023-10-08 02:35:05,627][52059] Updated weights for policy 1, policy_version 66552 (0.0010) [2023-10-08 02:35:06,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 135462912. Throughput: 0: 1690.5, 1: 1755.7. Samples: 33866910. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:35:06,211][50642] Avg episode reward: [(0, '20.670'), (1, '21.610')] [2023-10-08 02:35:07,449][52060] Updated weights for policy 0, policy_version 65730 (0.0009) [2023-10-08 02:35:07,823][52060] Updated weights for policy 0, policy_version 65740 (0.0008) [2023-10-08 02:35:08,190][52060] Updated weights for policy 0, policy_version 65750 (0.0007) [2023-10-08 02:35:08,566][52060] Updated weights for policy 0, policy_version 65760 (0.0007) [2023-10-08 02:35:09,521][52059] Updated weights for policy 1, policy_version 66562 (0.0010) [2023-10-08 02:35:09,878][52059] Updated weights for policy 1, policy_version 66572 (0.0010) [2023-10-08 02:35:10,247][52059] Updated weights for policy 1, policy_version 66582 (0.0008) [2023-10-08 02:35:10,605][52059] Updated weights for policy 1, policy_version 66592 (0.0007) [2023-10-08 02:35:11,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 135528448. Throughput: 0: 1696.0, 1: 1748.6. Samples: 33887534. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:35:11,211][50642] Avg episode reward: [(0, '16.640'), (1, '22.340')] [2023-10-08 02:35:12,748][52060] Updated weights for policy 0, policy_version 65770 (0.0008) [2023-10-08 02:35:13,117][52060] Updated weights for policy 0, policy_version 65780 (0.0008) [2023-10-08 02:35:13,476][52060] Updated weights for policy 0, policy_version 65790 (0.0007) [2023-10-08 02:35:14,609][52059] Updated weights for policy 1, policy_version 66602 (0.0008) [2023-10-08 02:35:14,974][52059] Updated weights for policy 1, policy_version 66612 (0.0007) [2023-10-08 02:35:15,344][52059] Updated weights for policy 1, policy_version 66622 (0.0007) [2023-10-08 02:35:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 135593984. Throughput: 0: 1714.6, 1: 1723.5. Samples: 33907848. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:35:16,211][50642] Avg episode reward: [(0, '19.910'), (1, '19.640')] [2023-10-08 02:35:17,355][52060] Updated weights for policy 0, policy_version 65800 (0.0009) [2023-10-08 02:35:17,719][52060] Updated weights for policy 0, policy_version 65810 (0.0008) [2023-10-08 02:35:18,088][52060] Updated weights for policy 0, policy_version 65820 (0.0008) [2023-10-08 02:35:19,247][52059] Updated weights for policy 1, policy_version 66632 (0.0009) [2023-10-08 02:35:19,609][52059] Updated weights for policy 1, policy_version 66642 (0.0009) [2023-10-08 02:35:19,974][52059] Updated weights for policy 1, policy_version 66652 (0.0008) [2023-10-08 02:35:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 135659520. Throughput: 0: 1685.7, 1: 1756.5. Samples: 33918656. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:35:21,211][50642] Avg episode reward: [(0, '19.900'), (1, '20.910')] [2023-10-08 02:35:22,045][52060] Updated weights for policy 0, policy_version 65830 (0.0008) [2023-10-08 02:35:22,413][52060] Updated weights for policy 0, policy_version 65840 (0.0009) [2023-10-08 02:35:22,783][52060] Updated weights for policy 0, policy_version 65850 (0.0008) [2023-10-08 02:35:23,924][52059] Updated weights for policy 1, policy_version 66662 (0.0009) [2023-10-08 02:35:24,299][52059] Updated weights for policy 1, policy_version 66672 (0.0009) [2023-10-08 02:35:24,672][52059] Updated weights for policy 1, policy_version 66682 (0.0010) [2023-10-08 02:35:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 135725056. Throughput: 0: 1710.9, 1: 1724.1. Samples: 33938780. Policy #0 lag: (min: 6.0, avg: 13.6, max: 38.0) [2023-10-08 02:35:26,211][50642] Avg episode reward: [(0, '18.380'), (1, '20.730')] [2023-10-08 02:35:26,740][52060] Updated weights for policy 0, policy_version 65860 (0.0008) [2023-10-08 02:35:27,101][52060] Updated weights for policy 0, policy_version 65870 (0.0009) [2023-10-08 02:35:27,471][52060] Updated weights for policy 0, policy_version 65880 (0.0009) [2023-10-08 02:35:28,642][52059] Updated weights for policy 1, policy_version 66692 (0.0010) [2023-10-08 02:35:29,011][52059] Updated weights for policy 1, policy_version 66702 (0.0010) [2023-10-08 02:35:29,373][52059] Updated weights for policy 1, policy_version 66712 (0.0010) [2023-10-08 02:35:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 135790592. Throughput: 0: 1714.2, 1: 1721.4. Samples: 33960008. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:35:31,211][50642] Avg episode reward: [(0, '18.130'), (1, '20.880')] [2023-10-08 02:35:31,485][52060] Updated weights for policy 0, policy_version 65890 (0.0010) [2023-10-08 02:35:31,851][52060] Updated weights for policy 0, policy_version 65900 (0.0010) [2023-10-08 02:35:32,212][52060] Updated weights for policy 0, policy_version 65910 (0.0008) [2023-10-08 02:35:32,581][52060] Updated weights for policy 0, policy_version 65920 (0.0007) [2023-10-08 02:35:33,265][52059] Updated weights for policy 1, policy_version 66722 (0.0008) [2023-10-08 02:35:33,630][52059] Updated weights for policy 1, policy_version 66732 (0.0007) [2023-10-08 02:35:33,998][52059] Updated weights for policy 1, policy_version 66742 (0.0009) [2023-10-08 02:35:34,355][52059] Updated weights for policy 1, policy_version 66752 (0.0009) [2023-10-08 02:35:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 135856128. Throughput: 0: 1699.2, 1: 1741.2. Samples: 33970112. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:35:36,211][50642] Avg episode reward: [(0, '17.170'), (1, '20.040')] [2023-10-08 02:35:36,610][52060] Updated weights for policy 0, policy_version 65930 (0.0010) [2023-10-08 02:35:36,980][52060] Updated weights for policy 0, policy_version 65940 (0.0009) [2023-10-08 02:35:37,352][52060] Updated weights for policy 0, policy_version 65950 (0.0007) [2023-10-08 02:35:38,128][52059] Updated weights for policy 1, policy_version 66762 (0.0008) [2023-10-08 02:35:38,502][52059] Updated weights for policy 1, policy_version 66772 (0.0010) [2023-10-08 02:35:38,858][52059] Updated weights for policy 1, policy_version 66782 (0.0009) [2023-10-08 02:35:41,205][52060] Updated weights for policy 0, policy_version 65960 (0.0008) [2023-10-08 02:35:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 135921664. Throughput: 0: 1716.1, 1: 1726.9. Samples: 33990864. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:35:41,211][50642] Avg episode reward: [(0, '17.680'), (1, '18.690')] [2023-10-08 02:35:41,561][52060] Updated weights for policy 0, policy_version 65970 (0.0008) [2023-10-08 02:35:41,936][52060] Updated weights for policy 0, policy_version 65980 (0.0008) [2023-10-08 02:35:42,947][52059] Updated weights for policy 1, policy_version 66792 (0.0009) [2023-10-08 02:35:43,318][52059] Updated weights for policy 1, policy_version 66802 (0.0007) [2023-10-08 02:35:43,682][52059] Updated weights for policy 1, policy_version 66812 (0.0007) [2023-10-08 02:35:45,981][52060] Updated weights for policy 0, policy_version 65990 (0.0007) [2023-10-08 02:35:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 135987200. Throughput: 0: 1719.6, 1: 1741.9. Samples: 34012186. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:35:46,211][50642] Avg episode reward: [(0, '17.060'), (1, '21.350')] [2023-10-08 02:35:46,348][52060] Updated weights for policy 0, policy_version 66000 (0.0007) [2023-10-08 02:35:46,725][52060] Updated weights for policy 0, policy_version 66010 (0.0008) [2023-10-08 02:35:47,583][52059] Updated weights for policy 1, policy_version 66822 (0.0010) [2023-10-08 02:35:47,950][52059] Updated weights for policy 1, policy_version 66832 (0.0008) [2023-10-08 02:35:48,307][52059] Updated weights for policy 1, policy_version 66842 (0.0009) [2023-10-08 02:35:50,782][52060] Updated weights for policy 0, policy_version 66020 (0.0009) [2023-10-08 02:35:51,142][52060] Updated weights for policy 0, policy_version 66030 (0.0009) [2023-10-08 02:35:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 136052736. Throughput: 0: 1717.6, 1: 1721.2. Samples: 34021654. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:35:51,211][50642] Avg episode reward: [(0, '18.930'), (1, '19.300')] [2023-10-08 02:35:51,517][52060] Updated weights for policy 0, policy_version 66040 (0.0008) [2023-10-08 02:35:52,206][52059] Updated weights for policy 1, policy_version 66852 (0.0009) [2023-10-08 02:35:52,580][52059] Updated weights for policy 1, policy_version 66862 (0.0009) [2023-10-08 02:35:52,949][52059] Updated weights for policy 1, policy_version 66872 (0.0010) [2023-10-08 02:35:55,542][52060] Updated weights for policy 0, policy_version 66050 (0.0010) [2023-10-08 02:35:55,905][52060] Updated weights for policy 0, policy_version 66060 (0.0008) [2023-10-08 02:35:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 136118272. Throughput: 0: 1724.7, 1: 1729.6. Samples: 34042976. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:35:56,211][50642] Avg episode reward: [(0, '18.280'), (1, '19.390')] [2023-10-08 02:35:56,283][52060] Updated weights for policy 0, policy_version 66070 (0.0009) [2023-10-08 02:35:56,655][52060] Updated weights for policy 0, policy_version 66080 (0.0009) [2023-10-08 02:35:56,874][52059] Updated weights for policy 1, policy_version 66882 (0.0010) [2023-10-08 02:35:57,236][52059] Updated weights for policy 1, policy_version 66892 (0.0009) [2023-10-08 02:35:57,595][52059] Updated weights for policy 1, policy_version 66902 (0.0007) [2023-10-08 02:35:57,961][52059] Updated weights for policy 1, policy_version 66912 (0.0007) [2023-10-08 02:36:00,409][52060] Updated weights for policy 0, policy_version 66090 (0.0008) [2023-10-08 02:36:00,770][52060] Updated weights for policy 0, policy_version 66100 (0.0009) [2023-10-08 02:36:01,137][52060] Updated weights for policy 0, policy_version 66110 (0.0010) [2023-10-08 02:36:01,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 136216576. Throughput: 0: 1708.2, 1: 1752.5. Samples: 34063578. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:36:01,211][50642] Avg episode reward: [(0, '18.550'), (1, '20.890')] [2023-10-08 02:36:02,023][52059] Updated weights for policy 1, policy_version 66922 (0.0008) [2023-10-08 02:36:02,386][52059] Updated weights for policy 1, policy_version 66932 (0.0008) [2023-10-08 02:36:02,741][52059] Updated weights for policy 1, policy_version 66942 (0.0010) [2023-10-08 02:36:04,926][52060] Updated weights for policy 0, policy_version 66120 (0.0008) [2023-10-08 02:36:05,297][52060] Updated weights for policy 0, policy_version 66130 (0.0008) [2023-10-08 02:36:05,666][52060] Updated weights for policy 0, policy_version 66140 (0.0007) [2023-10-08 02:36:06,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 136282112. Throughput: 0: 1732.4, 1: 1717.1. Samples: 34073882. Policy #0 lag: (min: 10.0, avg: 10.0, max: 14.0) [2023-10-08 02:36:06,211][50642] Avg episode reward: [(0, '20.170'), (1, '22.580')] [2023-10-08 02:36:06,607][52059] Updated weights for policy 1, policy_version 66952 (0.0009) [2023-10-08 02:36:06,969][52059] Updated weights for policy 1, policy_version 66962 (0.0010) [2023-10-08 02:36:07,342][52059] Updated weights for policy 1, policy_version 66972 (0.0007) [2023-10-08 02:36:09,623][52060] Updated weights for policy 0, policy_version 66150 (0.0007) [2023-10-08 02:36:10,001][52060] Updated weights for policy 0, policy_version 66160 (0.0007) [2023-10-08 02:36:10,367][52060] Updated weights for policy 0, policy_version 66170 (0.0007) [2023-10-08 02:36:11,171][52059] Updated weights for policy 1, policy_version 66982 (0.0008) [2023-10-08 02:36:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 136347648. Throughput: 0: 1721.1, 1: 1750.0. Samples: 34094982. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:11,211][50642] Avg episode reward: [(0, '18.210'), (1, '20.990')] [2023-10-08 02:36:11,542][52059] Updated weights for policy 1, policy_version 66992 (0.0008) [2023-10-08 02:36:11,915][52059] Updated weights for policy 1, policy_version 67002 (0.0008) [2023-10-08 02:36:14,338][52060] Updated weights for policy 0, policy_version 66180 (0.0008) [2023-10-08 02:36:14,710][52060] Updated weights for policy 0, policy_version 66190 (0.0011) [2023-10-08 02:36:15,077][52060] Updated weights for policy 0, policy_version 66200 (0.0010) [2023-10-08 02:36:15,755][52059] Updated weights for policy 1, policy_version 67012 (0.0009) [2023-10-08 02:36:16,116][52059] Updated weights for policy 1, policy_version 67022 (0.0009) [2023-10-08 02:36:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 136413184. Throughput: 0: 1702.3, 1: 1749.5. Samples: 34115338. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:16,211][50642] Avg episode reward: [(0, '19.170'), (1, '21.200')] [2023-10-08 02:36:16,477][52059] Updated weights for policy 1, policy_version 67032 (0.0008) [2023-10-08 02:36:18,924][52060] Updated weights for policy 0, policy_version 66210 (0.0011) [2023-10-08 02:36:19,284][52060] Updated weights for policy 0, policy_version 66220 (0.0007) [2023-10-08 02:36:19,653][52060] Updated weights for policy 0, policy_version 66230 (0.0007) [2023-10-08 02:36:20,018][52060] Updated weights for policy 0, policy_version 66240 (0.0010) [2023-10-08 02:36:20,372][52059] Updated weights for policy 1, policy_version 67042 (0.0008) [2023-10-08 02:36:20,739][52059] Updated weights for policy 1, policy_version 67052 (0.0007) [2023-10-08 02:36:21,113][52059] Updated weights for policy 1, policy_version 67062 (0.0008) [2023-10-08 02:36:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 136478720. Throughput: 0: 1735.7, 1: 1738.0. Samples: 34126426. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:21,211][50642] Avg episode reward: [(0, '19.550'), (1, '20.680')] [2023-10-08 02:36:21,477][52059] Updated weights for policy 1, policy_version 67072 (0.0008) [2023-10-08 02:36:24,065][52060] Updated weights for policy 0, policy_version 66250 (0.0008) [2023-10-08 02:36:24,438][52060] Updated weights for policy 0, policy_version 66260 (0.0010) [2023-10-08 02:36:24,802][52060] Updated weights for policy 0, policy_version 66270 (0.0011) [2023-10-08 02:36:25,346][52059] Updated weights for policy 1, policy_version 67082 (0.0008) [2023-10-08 02:36:25,709][52059] Updated weights for policy 1, policy_version 67092 (0.0008) [2023-10-08 02:36:26,071][52059] Updated weights for policy 1, policy_version 67102 (0.0009) [2023-10-08 02:36:26,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 136577024. Throughput: 0: 1706.6, 1: 1756.0. Samples: 34146682. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:26,211][50642] Avg episode reward: [(0, '16.860'), (1, '23.670')] [2023-10-08 02:36:28,729][52060] Updated weights for policy 0, policy_version 66280 (0.0008) [2023-10-08 02:36:29,092][52060] Updated weights for policy 0, policy_version 66290 (0.0009) [2023-10-08 02:36:29,454][52060] Updated weights for policy 0, policy_version 66300 (0.0007) [2023-10-08 02:36:30,153][52059] Updated weights for policy 1, policy_version 67112 (0.0007) [2023-10-08 02:36:30,531][52059] Updated weights for policy 1, policy_version 67122 (0.0008) [2023-10-08 02:36:30,888][52059] Updated weights for policy 1, policy_version 67132 (0.0008) [2023-10-08 02:36:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 136642560. Throughput: 0: 1707.5, 1: 1728.4. Samples: 34166802. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:31,211][50642] Avg episode reward: [(0, '18.290'), (1, '19.200')] [2023-10-08 02:36:33,469][52060] Updated weights for policy 0, policy_version 66310 (0.0008) [2023-10-08 02:36:33,852][52060] Updated weights for policy 0, policy_version 66320 (0.0008) [2023-10-08 02:36:34,214][52060] Updated weights for policy 0, policy_version 66330 (0.0010) [2023-10-08 02:36:34,821][52059] Updated weights for policy 1, policy_version 67142 (0.0010) [2023-10-08 02:36:35,188][52059] Updated weights for policy 1, policy_version 67152 (0.0007) [2023-10-08 02:36:35,547][52059] Updated weights for policy 1, policy_version 67162 (0.0012) [2023-10-08 02:36:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 136708096. Throughput: 0: 1725.4, 1: 1750.9. Samples: 34178088. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:36,211][50642] Avg episode reward: [(0, '21.890'), (1, '19.730')] [2023-10-08 02:36:38,357][52060] Updated weights for policy 0, policy_version 66340 (0.0009) [2023-10-08 02:36:38,729][52060] Updated weights for policy 0, policy_version 66350 (0.0007) [2023-10-08 02:36:39,087][52060] Updated weights for policy 0, policy_version 66360 (0.0008) [2023-10-08 02:36:39,549][52059] Updated weights for policy 1, policy_version 67172 (0.0009) [2023-10-08 02:36:39,905][52059] Updated weights for policy 1, policy_version 67182 (0.0009) [2023-10-08 02:36:40,276][52059] Updated weights for policy 1, policy_version 67192 (0.0007) [2023-10-08 02:36:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 136773632. Throughput: 0: 1707.2, 1: 1740.6. Samples: 34198128. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:41,211][50642] Avg episode reward: [(0, '17.530'), (1, '20.980')] [2023-10-08 02:36:43,104][52060] Updated weights for policy 0, policy_version 66370 (0.0009) [2023-10-08 02:36:43,467][52060] Updated weights for policy 0, policy_version 66380 (0.0010) [2023-10-08 02:36:43,831][52060] Updated weights for policy 0, policy_version 66390 (0.0008) [2023-10-08 02:36:44,195][52060] Updated weights for policy 0, policy_version 66400 (0.0009) [2023-10-08 02:36:44,213][52059] Updated weights for policy 1, policy_version 67202 (0.0009) [2023-10-08 02:36:44,588][52059] Updated weights for policy 1, policy_version 67212 (0.0008) [2023-10-08 02:36:44,960][52059] Updated weights for policy 1, policy_version 67222 (0.0007) [2023-10-08 02:36:45,329][52059] Updated weights for policy 1, policy_version 67232 (0.0008) [2023-10-08 02:36:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 136839168. Throughput: 0: 1721.8, 1: 1721.3. Samples: 34218518. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:46,211][50642] Avg episode reward: [(0, '18.230'), (1, '20.840')] [2023-10-08 02:36:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000067232_68845568.pth... [2023-10-08 02:36:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000066400_67993600.pth... [2023-10-08 02:36:46,254][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000065600_67174400.pth [2023-10-08 02:36:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000064800_66355200.pth [2023-10-08 02:36:48,409][52060] Updated weights for policy 0, policy_version 66410 (0.0009) [2023-10-08 02:36:48,789][52060] Updated weights for policy 0, policy_version 66420 (0.0009) [2023-10-08 02:36:49,065][52059] Updated weights for policy 1, policy_version 67242 (0.0007) [2023-10-08 02:36:49,152][52060] Updated weights for policy 0, policy_version 66430 (0.0009) [2023-10-08 02:36:49,448][52059] Updated weights for policy 1, policy_version 67252 (0.0009) [2023-10-08 02:36:49,802][52059] Updated weights for policy 1, policy_version 67262 (0.0010) [2023-10-08 02:36:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 136904704. Throughput: 0: 1704.9, 1: 1754.1. Samples: 34229540. Policy #0 lag: (min: 0.0, avg: 22.6, max: 32.0) [2023-10-08 02:36:51,211][50642] Avg episode reward: [(0, '22.230'), (1, '18.610')] [2023-10-08 02:36:53,022][52060] Updated weights for policy 0, policy_version 66440 (0.0008) [2023-10-08 02:36:53,394][52060] Updated weights for policy 0, policy_version 66450 (0.0010) [2023-10-08 02:36:53,665][52059] Updated weights for policy 1, policy_version 67272 (0.0008) [2023-10-08 02:36:53,765][52060] Updated weights for policy 0, policy_version 66460 (0.0009) [2023-10-08 02:36:54,026][52059] Updated weights for policy 1, policy_version 67282 (0.0010) [2023-10-08 02:36:54,393][52059] Updated weights for policy 1, policy_version 67292 (0.0010) [2023-10-08 02:36:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 136970240. Throughput: 0: 1708.4, 1: 1725.4. Samples: 34249502. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:36:56,211][50642] Avg episode reward: [(0, '18.730'), (1, '20.930')] [2023-10-08 02:36:57,674][52060] Updated weights for policy 0, policy_version 66470 (0.0009) [2023-10-08 02:36:58,048][52060] Updated weights for policy 0, policy_version 66480 (0.0010) [2023-10-08 02:36:58,264][52059] Updated weights for policy 1, policy_version 67302 (0.0008) [2023-10-08 02:36:58,413][52060] Updated weights for policy 0, policy_version 66490 (0.0007) [2023-10-08 02:36:58,628][52059] Updated weights for policy 1, policy_version 67312 (0.0008) [2023-10-08 02:36:58,992][52059] Updated weights for policy 1, policy_version 67322 (0.0009) [2023-10-08 02:37:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 137035776. Throughput: 0: 1725.0, 1: 1732.7. Samples: 34270934. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:01,211][50642] Avg episode reward: [(0, '17.310'), (1, '17.040')] [2023-10-08 02:37:02,480][52060] Updated weights for policy 0, policy_version 66500 (0.0008) [2023-10-08 02:37:02,849][52060] Updated weights for policy 0, policy_version 66510 (0.0007) [2023-10-08 02:37:02,907][52059] Updated weights for policy 1, policy_version 67332 (0.0007) [2023-10-08 02:37:03,222][52060] Updated weights for policy 0, policy_version 66520 (0.0007) [2023-10-08 02:37:03,272][52059] Updated weights for policy 1, policy_version 67342 (0.0007) [2023-10-08 02:37:03,642][52059] Updated weights for policy 1, policy_version 67352 (0.0008) [2023-10-08 02:37:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 137101312. Throughput: 0: 1688.4, 1: 1728.5. Samples: 34280188. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:06,211][50642] Avg episode reward: [(0, '19.470'), (1, '18.720')] [2023-10-08 02:37:07,144][52060] Updated weights for policy 0, policy_version 66530 (0.0008) [2023-10-08 02:37:07,362][52059] Updated weights for policy 1, policy_version 67362 (0.0008) [2023-10-08 02:37:07,514][52060] Updated weights for policy 0, policy_version 66540 (0.0008) [2023-10-08 02:37:07,732][52059] Updated weights for policy 1, policy_version 67372 (0.0009) [2023-10-08 02:37:07,890][52060] Updated weights for policy 0, policy_version 66550 (0.0009) [2023-10-08 02:37:08,086][52059] Updated weights for policy 1, policy_version 67382 (0.0010) [2023-10-08 02:37:08,261][52060] Updated weights for policy 0, policy_version 66560 (0.0009) [2023-10-08 02:37:08,453][52059] Updated weights for policy 1, policy_version 67392 (0.0008) [2023-10-08 02:37:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 137166848. Throughput: 0: 1715.6, 1: 1723.0. Samples: 34301420. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:11,211][50642] Avg episode reward: [(0, '20.380'), (1, '17.880')] [2023-10-08 02:37:12,250][52060] Updated weights for policy 0, policy_version 66570 (0.0007) [2023-10-08 02:37:12,455][52059] Updated weights for policy 1, policy_version 67402 (0.0007) [2023-10-08 02:37:12,622][52060] Updated weights for policy 0, policy_version 66580 (0.0007) [2023-10-08 02:37:12,815][52059] Updated weights for policy 1, policy_version 67412 (0.0009) [2023-10-08 02:37:13,000][52060] Updated weights for policy 0, policy_version 66590 (0.0008) [2023-10-08 02:37:13,190][52059] Updated weights for policy 1, policy_version 67422 (0.0007) [2023-10-08 02:37:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 137232384. Throughput: 0: 1715.1, 1: 1752.7. Samples: 34322854. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:16,211][50642] Avg episode reward: [(0, '17.600'), (1, '21.910')] [2023-10-08 02:37:16,878][52060] Updated weights for policy 0, policy_version 66600 (0.0008) [2023-10-08 02:37:17,147][52059] Updated weights for policy 1, policy_version 67432 (0.0008) [2023-10-08 02:37:17,250][52060] Updated weights for policy 0, policy_version 66610 (0.0008) [2023-10-08 02:37:17,528][52059] Updated weights for policy 1, policy_version 67442 (0.0009) [2023-10-08 02:37:17,614][52060] Updated weights for policy 0, policy_version 66620 (0.0008) [2023-10-08 02:37:17,890][52059] Updated weights for policy 1, policy_version 67452 (0.0010) [2023-10-08 02:37:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 137297920. Throughput: 0: 1697.6, 1: 1724.1. Samples: 34332066. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:21,211][50642] Avg episode reward: [(0, '18.950'), (1, '20.620')] [2023-10-08 02:37:21,483][52060] Updated weights for policy 0, policy_version 66630 (0.0009) [2023-10-08 02:37:21,845][52060] Updated weights for policy 0, policy_version 66640 (0.0008) [2023-10-08 02:37:21,913][52059] Updated weights for policy 1, policy_version 67462 (0.0008) [2023-10-08 02:37:22,223][52060] Updated weights for policy 0, policy_version 66650 (0.0008) [2023-10-08 02:37:22,276][52059] Updated weights for policy 1, policy_version 67472 (0.0008) [2023-10-08 02:37:22,645][52059] Updated weights for policy 1, policy_version 67482 (0.0007) [2023-10-08 02:37:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 137363456. Throughput: 0: 1717.9, 1: 1733.1. Samples: 34353422. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:26,211][50642] Avg episode reward: [(0, '20.750'), (1, '20.220')] [2023-10-08 02:37:26,237][52060] Updated weights for policy 0, policy_version 66660 (0.0010) [2023-10-08 02:37:26,589][52059] Updated weights for policy 1, policy_version 67492 (0.0008) [2023-10-08 02:37:26,601][52060] Updated weights for policy 0, policy_version 66670 (0.0008) [2023-10-08 02:37:26,952][52059] Updated weights for policy 1, policy_version 67502 (0.0008) [2023-10-08 02:37:26,962][52060] Updated weights for policy 0, policy_version 66680 (0.0007) [2023-10-08 02:37:27,309][52059] Updated weights for policy 1, policy_version 67512 (0.0008) [2023-10-08 02:37:30,916][52060] Updated weights for policy 0, policy_version 66690 (0.0007) [2023-10-08 02:37:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 137428992. Throughput: 0: 1720.0, 1: 1752.7. Samples: 34374790. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:31,211][50642] Avg episode reward: [(0, '17.820'), (1, '22.540')] [2023-10-08 02:37:31,241][52059] Updated weights for policy 1, policy_version 67522 (0.0007) [2023-10-08 02:37:31,285][52060] Updated weights for policy 0, policy_version 66700 (0.0010) [2023-10-08 02:37:31,605][52059] Updated weights for policy 1, policy_version 67532 (0.0008) [2023-10-08 02:37:31,655][52060] Updated weights for policy 0, policy_version 66710 (0.0009) [2023-10-08 02:37:31,973][52059] Updated weights for policy 1, policy_version 67542 (0.0009) [2023-10-08 02:37:32,020][52060] Updated weights for policy 0, policy_version 66720 (0.0009) [2023-10-08 02:37:32,347][52059] Updated weights for policy 1, policy_version 67552 (0.0009) [2023-10-08 02:37:36,157][52059] Updated weights for policy 1, policy_version 67562 (0.0009) [2023-10-08 02:37:36,177][52060] Updated weights for policy 0, policy_version 66730 (0.0009) [2023-10-08 02:37:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 137494528. Throughput: 0: 1712.8, 1: 1724.0. Samples: 34384192. Policy #0 lag: (min: 16.0, avg: 39.6, max: 48.0) [2023-10-08 02:37:36,211][50642] Avg episode reward: [(0, '18.130'), (1, '22.980')] [2023-10-08 02:37:36,517][52059] Updated weights for policy 1, policy_version 67572 (0.0008) [2023-10-08 02:37:36,552][52060] Updated weights for policy 0, policy_version 66740 (0.0009) [2023-10-08 02:37:36,876][52059] Updated weights for policy 1, policy_version 67582 (0.0008) [2023-10-08 02:37:36,919][52060] Updated weights for policy 0, policy_version 66750 (0.0009) [2023-10-08 02:37:40,927][52060] Updated weights for policy 0, policy_version 66760 (0.0008) [2023-10-08 02:37:40,972][52059] Updated weights for policy 1, policy_version 67592 (0.0007) [2023-10-08 02:37:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 137560064. Throughput: 0: 1713.6, 1: 1747.2. Samples: 34405240. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:37:41,211][50642] Avg episode reward: [(0, '20.170'), (1, '19.490')] [2023-10-08 02:37:41,297][52060] Updated weights for policy 0, policy_version 66770 (0.0008) [2023-10-08 02:37:41,335][52059] Updated weights for policy 1, policy_version 67602 (0.0007) [2023-10-08 02:37:41,663][52060] Updated weights for policy 0, policy_version 66780 (0.0007) [2023-10-08 02:37:41,702][52059] Updated weights for policy 1, policy_version 67612 (0.0010) [2023-10-08 02:37:45,676][52059] Updated weights for policy 1, policy_version 67622 (0.0009) [2023-10-08 02:37:45,713][52060] Updated weights for policy 0, policy_version 66790 (0.0009) [2023-10-08 02:37:46,035][52059] Updated weights for policy 1, policy_version 67632 (0.0007) [2023-10-08 02:37:46,079][52060] Updated weights for policy 0, policy_version 66800 (0.0009) [2023-10-08 02:37:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 137625600. Throughput: 0: 1703.6, 1: 1730.8. Samples: 34425484. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:37:46,211][50642] Avg episode reward: [(0, '17.430'), (1, '21.410')] [2023-10-08 02:37:46,409][52059] Updated weights for policy 1, policy_version 67642 (0.0007) [2023-10-08 02:37:46,453][52060] Updated weights for policy 0, policy_version 66810 (0.0007) [2023-10-08 02:37:50,410][52059] Updated weights for policy 1, policy_version 67652 (0.0007) [2023-10-08 02:37:50,423][52060] Updated weights for policy 0, policy_version 66820 (0.0007) [2023-10-08 02:37:50,778][52059] Updated weights for policy 1, policy_version 67662 (0.0007) [2023-10-08 02:37:50,793][52060] Updated weights for policy 0, policy_version 66830 (0.0008) [2023-10-08 02:37:51,139][52059] Updated weights for policy 1, policy_version 67672 (0.0007) [2023-10-08 02:37:51,151][52060] Updated weights for policy 0, policy_version 66840 (0.0009) [2023-10-08 02:37:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 137691136. Throughput: 0: 1716.0, 1: 1735.8. Samples: 34435520. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:37:51,211][50642] Avg episode reward: [(0, '17.500'), (1, '18.820')] [2023-10-08 02:37:55,094][52059] Updated weights for policy 1, policy_version 67682 (0.0007) [2023-10-08 02:37:55,184][52060] Updated weights for policy 0, policy_version 66850 (0.0007) [2023-10-08 02:37:55,450][52059] Updated weights for policy 1, policy_version 67692 (0.0009) [2023-10-08 02:37:55,543][52060] Updated weights for policy 0, policy_version 66860 (0.0009) [2023-10-08 02:37:55,813][52059] Updated weights for policy 1, policy_version 67702 (0.0009) [2023-10-08 02:37:55,913][52060] Updated weights for policy 0, policy_version 66870 (0.0007) [2023-10-08 02:37:56,176][52059] Updated weights for policy 1, policy_version 67712 (0.0008) [2023-10-08 02:37:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 137789440. Throughput: 0: 1720.4, 1: 1738.2. Samples: 34457054. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:37:56,211][50642] Avg episode reward: [(0, '20.760'), (1, '21.350')] [2023-10-08 02:37:56,274][52060] Updated weights for policy 0, policy_version 66880 (0.0009) [2023-10-08 02:38:00,019][52059] Updated weights for policy 1, policy_version 67722 (0.0009) [2023-10-08 02:38:00,385][52059] Updated weights for policy 1, policy_version 67732 (0.0010) [2023-10-08 02:38:00,400][52060] Updated weights for policy 0, policy_version 66890 (0.0008) [2023-10-08 02:38:00,753][52059] Updated weights for policy 1, policy_version 67742 (0.0007) [2023-10-08 02:38:00,763][52060] Updated weights for policy 0, policy_version 66900 (0.0009) [2023-10-08 02:38:01,134][52060] Updated weights for policy 0, policy_version 66910 (0.0009) [2023-10-08 02:38:01,210][50642] Fps is (10 sec: 19660.8, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 137887744. Throughput: 0: 1695.0, 1: 1712.0. Samples: 34476168. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:38:01,211][50642] Avg episode reward: [(0, '16.840'), (1, '19.290')] [2023-10-08 02:38:04,761][52059] Updated weights for policy 1, policy_version 67752 (0.0007) [2023-10-08 02:38:05,137][52059] Updated weights for policy 1, policy_version 67762 (0.0007) [2023-10-08 02:38:05,165][52060] Updated weights for policy 0, policy_version 66920 (0.0007) [2023-10-08 02:38:05,508][52059] Updated weights for policy 1, policy_version 67772 (0.0008) [2023-10-08 02:38:05,539][52060] Updated weights for policy 0, policy_version 66930 (0.0008) [2023-10-08 02:38:05,908][52060] Updated weights for policy 0, policy_version 66940 (0.0008) [2023-10-08 02:38:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 137953280. Throughput: 0: 1713.0, 1: 1748.5. Samples: 34487836. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:38:06,211][50642] Avg episode reward: [(0, '17.730'), (1, '19.920')] [2023-10-08 02:38:09,340][52059] Updated weights for policy 1, policy_version 67782 (0.0008) [2023-10-08 02:38:09,691][52059] Updated weights for policy 1, policy_version 67792 (0.0009) [2023-10-08 02:38:09,881][52060] Updated weights for policy 0, policy_version 66950 (0.0009) [2023-10-08 02:38:10,059][52059] Updated weights for policy 1, policy_version 67802 (0.0010) [2023-10-08 02:38:10,252][52060] Updated weights for policy 0, policy_version 66960 (0.0009) [2023-10-08 02:38:10,618][52060] Updated weights for policy 0, policy_version 66970 (0.0008) [2023-10-08 02:38:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 138018816. Throughput: 0: 1703.4, 1: 1729.8. Samples: 34507918. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:38:11,211][50642] Avg episode reward: [(0, '20.050'), (1, '21.560')] [2023-10-08 02:38:13,948][52059] Updated weights for policy 1, policy_version 67812 (0.0009) [2023-10-08 02:38:14,307][52059] Updated weights for policy 1, policy_version 67822 (0.0008) [2023-10-08 02:38:14,576][52060] Updated weights for policy 0, policy_version 66980 (0.0009) [2023-10-08 02:38:14,668][52059] Updated weights for policy 1, policy_version 67832 (0.0007) [2023-10-08 02:38:14,953][52060] Updated weights for policy 0, policy_version 66990 (0.0007) [2023-10-08 02:38:15,321][52060] Updated weights for policy 0, policy_version 67000 (0.0008) [2023-10-08 02:38:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 138084352. Throughput: 0: 1672.7, 1: 1719.6. Samples: 34527442. Policy #0 lag: (min: 14.0, avg: 14.0, max: 14.0) [2023-10-08 02:38:16,211][50642] Avg episode reward: [(0, '19.370'), (1, '22.040')] [2023-10-08 02:38:18,728][52059] Updated weights for policy 1, policy_version 67842 (0.0007) [2023-10-08 02:38:19,093][52059] Updated weights for policy 1, policy_version 67852 (0.0010) [2023-10-08 02:38:19,190][52060] Updated weights for policy 0, policy_version 67010 (0.0010) [2023-10-08 02:38:19,447][52059] Updated weights for policy 1, policy_version 67862 (0.0009) [2023-10-08 02:38:19,552][52060] Updated weights for policy 0, policy_version 67020 (0.0009) [2023-10-08 02:38:19,818][52059] Updated weights for policy 1, policy_version 67872 (0.0007) [2023-10-08 02:38:19,923][52060] Updated weights for policy 0, policy_version 67030 (0.0008) [2023-10-08 02:38:20,292][52060] Updated weights for policy 0, policy_version 67040 (0.0010) [2023-10-08 02:38:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 138149888. Throughput: 0: 1707.2, 1: 1738.4. Samples: 34539242. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:21,211][50642] Avg episode reward: [(0, '16.840'), (1, '21.340')] [2023-10-08 02:38:23,686][52059] Updated weights for policy 1, policy_version 67882 (0.0008) [2023-10-08 02:38:24,055][52059] Updated weights for policy 1, policy_version 67892 (0.0007) [2023-10-08 02:38:24,400][52060] Updated weights for policy 0, policy_version 67050 (0.0008) [2023-10-08 02:38:24,417][52059] Updated weights for policy 1, policy_version 67902 (0.0007) [2023-10-08 02:38:24,776][52060] Updated weights for policy 0, policy_version 67060 (0.0008) [2023-10-08 02:38:25,153][52060] Updated weights for policy 0, policy_version 67070 (0.0007) [2023-10-08 02:38:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 138215424. Throughput: 0: 1691.5, 1: 1716.0. Samples: 34558576. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:26,211][50642] Avg episode reward: [(0, '19.050'), (1, '19.710')] [2023-10-08 02:38:28,472][52059] Updated weights for policy 1, policy_version 67912 (0.0008) [2023-10-08 02:38:28,835][52059] Updated weights for policy 1, policy_version 67922 (0.0009) [2023-10-08 02:38:29,193][52059] Updated weights for policy 1, policy_version 67932 (0.0008) [2023-10-08 02:38:29,198][52060] Updated weights for policy 0, policy_version 67080 (0.0008) [2023-10-08 02:38:29,564][52060] Updated weights for policy 0, policy_version 67090 (0.0010) [2023-10-08 02:38:29,937][52060] Updated weights for policy 0, policy_version 67100 (0.0010) [2023-10-08 02:38:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 138280960. Throughput: 0: 1692.0, 1: 1727.0. Samples: 34579338. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:31,211][50642] Avg episode reward: [(0, '17.980'), (1, '22.200')] [2023-10-08 02:38:33,013][52059] Updated weights for policy 1, policy_version 67942 (0.0008) [2023-10-08 02:38:33,376][52059] Updated weights for policy 1, policy_version 67952 (0.0010) [2023-10-08 02:38:33,739][52059] Updated weights for policy 1, policy_version 67962 (0.0009) [2023-10-08 02:38:33,972][52060] Updated weights for policy 0, policy_version 67110 (0.0008) [2023-10-08 02:38:34,346][52060] Updated weights for policy 0, policy_version 67120 (0.0010) [2023-10-08 02:38:34,709][52060] Updated weights for policy 0, policy_version 67130 (0.0008) [2023-10-08 02:38:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 138346496. Throughput: 0: 1712.3, 1: 1727.2. Samples: 34590300. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:36,211][50642] Avg episode reward: [(0, '19.560'), (1, '22.000')] [2023-10-08 02:38:37,605][52059] Updated weights for policy 1, policy_version 67972 (0.0008) [2023-10-08 02:38:37,975][52059] Updated weights for policy 1, policy_version 67982 (0.0007) [2023-10-08 02:38:38,336][52059] Updated weights for policy 1, policy_version 67992 (0.0007) [2023-10-08 02:38:38,768][52060] Updated weights for policy 0, policy_version 67140 (0.0008) [2023-10-08 02:38:39,141][52060] Updated weights for policy 0, policy_version 67150 (0.0008) [2023-10-08 02:38:39,519][52060] Updated weights for policy 0, policy_version 67160 (0.0008) [2023-10-08 02:38:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 138412032. Throughput: 0: 1681.1, 1: 1724.6. Samples: 34610310. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:41,211][50642] Avg episode reward: [(0, '18.880'), (1, '20.600')] [2023-10-08 02:38:42,244][52059] Updated weights for policy 1, policy_version 68002 (0.0007) [2023-10-08 02:38:42,614][52059] Updated weights for policy 1, policy_version 68012 (0.0009) [2023-10-08 02:38:42,974][52059] Updated weights for policy 1, policy_version 68022 (0.0009) [2023-10-08 02:38:43,337][52059] Updated weights for policy 1, policy_version 68032 (0.0010) [2023-10-08 02:38:43,534][52060] Updated weights for policy 0, policy_version 67170 (0.0010) [2023-10-08 02:38:43,910][52060] Updated weights for policy 0, policy_version 67180 (0.0008) [2023-10-08 02:38:44,286][52060] Updated weights for policy 0, policy_version 67190 (0.0007) [2023-10-08 02:38:44,649][52060] Updated weights for policy 0, policy_version 67200 (0.0008) [2023-10-08 02:38:46,211][50642] Fps is (10 sec: 13106.5, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 138477568. Throughput: 0: 1702.5, 1: 1749.6. Samples: 34631514. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:46,212][50642] Avg episode reward: [(0, '20.920'), (1, '22.280')] [2023-10-08 02:38:46,224][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000067200_68812800.pth... [2023-10-08 02:38:46,224][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000068032_69664768.pth... [2023-10-08 02:38:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000065600_67174400.pth [2023-10-08 02:38:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000066432_68026368.pth [2023-10-08 02:38:47,258][52059] Updated weights for policy 1, policy_version 68042 (0.0010) [2023-10-08 02:38:47,627][52059] Updated weights for policy 1, policy_version 68052 (0.0009) [2023-10-08 02:38:47,994][52059] Updated weights for policy 1, policy_version 68062 (0.0011) [2023-10-08 02:38:48,479][52060] Updated weights for policy 0, policy_version 67210 (0.0007) [2023-10-08 02:38:48,840][52060] Updated weights for policy 0, policy_version 67220 (0.0010) [2023-10-08 02:38:49,217][52060] Updated weights for policy 0, policy_version 67230 (0.0010) [2023-10-08 02:38:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 138543104. Throughput: 0: 1698.1, 1: 1714.2. Samples: 34641392. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:51,211][50642] Avg episode reward: [(0, '20.060'), (1, '19.820')] [2023-10-08 02:38:52,035][52059] Updated weights for policy 1, policy_version 68072 (0.0008) [2023-10-08 02:38:52,406][52059] Updated weights for policy 1, policy_version 68082 (0.0007) [2023-10-08 02:38:52,773][52059] Updated weights for policy 1, policy_version 68092 (0.0007) [2023-10-08 02:38:53,035][52060] Updated weights for policy 0, policy_version 67240 (0.0008) [2023-10-08 02:38:53,394][52060] Updated weights for policy 0, policy_version 67250 (0.0008) [2023-10-08 02:38:53,767][52060] Updated weights for policy 0, policy_version 67260 (0.0007) [2023-10-08 02:38:56,210][50642] Fps is (10 sec: 13107.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 138608640. Throughput: 0: 1692.5, 1: 1733.2. Samples: 34662078. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:38:56,211][50642] Avg episode reward: [(0, '19.890'), (1, '19.710')] [2023-10-08 02:38:56,655][52059] Updated weights for policy 1, policy_version 68102 (0.0009) [2023-10-08 02:38:57,021][52059] Updated weights for policy 1, policy_version 68112 (0.0010) [2023-10-08 02:38:57,377][52059] Updated weights for policy 1, policy_version 68122 (0.0010) [2023-10-08 02:38:57,809][52060] Updated weights for policy 0, policy_version 67270 (0.0010) [2023-10-08 02:38:58,169][52060] Updated weights for policy 0, policy_version 67280 (0.0010) [2023-10-08 02:38:58,535][52060] Updated weights for policy 0, policy_version 67290 (0.0009) [2023-10-08 02:39:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 138674176. Throughput: 0: 1718.9, 1: 1741.1. Samples: 34683142. Policy #0 lag: (min: 10.0, avg: 10.3, max: 21.0) [2023-10-08 02:39:01,211][50642] Avg episode reward: [(0, '19.280'), (1, '20.610')] [2023-10-08 02:39:01,384][52059] Updated weights for policy 1, policy_version 68132 (0.0009) [2023-10-08 02:39:01,755][52059] Updated weights for policy 1, policy_version 68142 (0.0008) [2023-10-08 02:39:02,117][52059] Updated weights for policy 1, policy_version 68152 (0.0009) [2023-10-08 02:39:02,482][52060] Updated weights for policy 0, policy_version 67300 (0.0010) [2023-10-08 02:39:02,847][52060] Updated weights for policy 0, policy_version 67310 (0.0007) [2023-10-08 02:39:03,218][52060] Updated weights for policy 0, policy_version 67320 (0.0007) [2023-10-08 02:39:06,103][52059] Updated weights for policy 1, policy_version 68162 (0.0008) [2023-10-08 02:39:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 138739712. Throughput: 0: 1687.2, 1: 1722.9. Samples: 34692696. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:06,211][50642] Avg episode reward: [(0, '18.210'), (1, '21.660')] [2023-10-08 02:39:06,464][52059] Updated weights for policy 1, policy_version 68172 (0.0009) [2023-10-08 02:39:06,836][52059] Updated weights for policy 1, policy_version 68182 (0.0008) [2023-10-08 02:39:07,202][52059] Updated weights for policy 1, policy_version 68192 (0.0008) [2023-10-08 02:39:07,382][52060] Updated weights for policy 0, policy_version 67330 (0.0009) [2023-10-08 02:39:07,739][52060] Updated weights for policy 0, policy_version 67340 (0.0010) [2023-10-08 02:39:08,106][52060] Updated weights for policy 0, policy_version 67350 (0.0011) [2023-10-08 02:39:08,474][52060] Updated weights for policy 0, policy_version 67360 (0.0008) [2023-10-08 02:39:10,819][52059] Updated weights for policy 1, policy_version 68202 (0.0010) [2023-10-08 02:39:11,181][52059] Updated weights for policy 1, policy_version 68212 (0.0011) [2023-10-08 02:39:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 138805248. Throughput: 0: 1703.4, 1: 1752.3. Samples: 34714080. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:11,211][50642] Avg episode reward: [(0, '21.260'), (1, '20.940')] [2023-10-08 02:39:11,550][52059] Updated weights for policy 1, policy_version 68222 (0.0009) [2023-10-08 02:39:12,735][52060] Updated weights for policy 0, policy_version 67370 (0.0008) [2023-10-08 02:39:13,120][52060] Updated weights for policy 0, policy_version 67380 (0.0009) [2023-10-08 02:39:13,481][52060] Updated weights for policy 0, policy_version 67390 (0.0010) [2023-10-08 02:39:15,449][52059] Updated weights for policy 1, policy_version 68232 (0.0008) [2023-10-08 02:39:15,812][52059] Updated weights for policy 1, policy_version 68242 (0.0009) [2023-10-08 02:39:16,184][52059] Updated weights for policy 1, policy_version 68252 (0.0007) [2023-10-08 02:39:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 138870784. Throughput: 0: 1708.6, 1: 1740.5. Samples: 34734550. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:16,211][50642] Avg episode reward: [(0, '19.740'), (1, '19.840')] [2023-10-08 02:39:17,361][52060] Updated weights for policy 0, policy_version 67400 (0.0008) [2023-10-08 02:39:17,739][52060] Updated weights for policy 0, policy_version 67410 (0.0007) [2023-10-08 02:39:18,104][52060] Updated weights for policy 0, policy_version 67420 (0.0007) [2023-10-08 02:39:20,069][52059] Updated weights for policy 1, policy_version 68262 (0.0009) [2023-10-08 02:39:20,432][52059] Updated weights for policy 1, policy_version 68272 (0.0008) [2023-10-08 02:39:20,800][52059] Updated weights for policy 1, policy_version 68282 (0.0008) [2023-10-08 02:39:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 138969088. Throughput: 0: 1680.4, 1: 1754.8. Samples: 34744880. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:21,211][50642] Avg episode reward: [(0, '19.250'), (1, '22.560')] [2023-10-08 02:39:22,063][52060] Updated weights for policy 0, policy_version 67430 (0.0010) [2023-10-08 02:39:22,439][52060] Updated weights for policy 0, policy_version 67440 (0.0009) [2023-10-08 02:39:22,806][52060] Updated weights for policy 0, policy_version 67450 (0.0008) [2023-10-08 02:39:24,768][52059] Updated weights for policy 1, policy_version 68292 (0.0008) [2023-10-08 02:39:25,125][52059] Updated weights for policy 1, policy_version 68302 (0.0008) [2023-10-08 02:39:25,497][52059] Updated weights for policy 1, policy_version 68312 (0.0009) [2023-10-08 02:39:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 139034624. Throughput: 0: 1711.1, 1: 1749.2. Samples: 34766022. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:26,211][50642] Avg episode reward: [(0, '21.550'), (1, '21.890')] [2023-10-08 02:39:26,827][52060] Updated weights for policy 0, policy_version 67460 (0.0009) [2023-10-08 02:39:27,198][52060] Updated weights for policy 0, policy_version 67470 (0.0007) [2023-10-08 02:39:27,576][52060] Updated weights for policy 0, policy_version 67480 (0.0008) [2023-10-08 02:39:29,360][52059] Updated weights for policy 1, policy_version 68322 (0.0008) [2023-10-08 02:39:29,728][52059] Updated weights for policy 1, policy_version 68332 (0.0009) [2023-10-08 02:39:30,097][52059] Updated weights for policy 1, policy_version 68342 (0.0008) [2023-10-08 02:39:30,469][52059] Updated weights for policy 1, policy_version 68352 (0.0008) [2023-10-08 02:39:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 139100160. Throughput: 0: 1719.8, 1: 1724.3. Samples: 34786500. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:31,211][50642] Avg episode reward: [(0, '19.170'), (1, '20.500')] [2023-10-08 02:39:31,463][52060] Updated weights for policy 0, policy_version 67490 (0.0008) [2023-10-08 02:39:31,839][52060] Updated weights for policy 0, policy_version 67500 (0.0010) [2023-10-08 02:39:32,212][52060] Updated weights for policy 0, policy_version 67510 (0.0010) [2023-10-08 02:39:32,581][52060] Updated weights for policy 0, policy_version 67520 (0.0009) [2023-10-08 02:39:34,173][52059] Updated weights for policy 1, policy_version 68362 (0.0009) [2023-10-08 02:39:34,537][52059] Updated weights for policy 1, policy_version 68372 (0.0008) [2023-10-08 02:39:34,899][52059] Updated weights for policy 1, policy_version 68382 (0.0007) [2023-10-08 02:39:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 139165696. Throughput: 0: 1704.0, 1: 1761.9. Samples: 34797360. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:36,211][50642] Avg episode reward: [(0, '18.420'), (1, '20.480')] [2023-10-08 02:39:36,395][52060] Updated weights for policy 0, policy_version 67530 (0.0008) [2023-10-08 02:39:36,767][52060] Updated weights for policy 0, policy_version 67540 (0.0010) [2023-10-08 02:39:37,144][52060] Updated weights for policy 0, policy_version 67550 (0.0007) [2023-10-08 02:39:38,879][52059] Updated weights for policy 1, policy_version 68392 (0.0009) [2023-10-08 02:39:39,247][52059] Updated weights for policy 1, policy_version 68402 (0.0010) [2023-10-08 02:39:39,612][52059] Updated weights for policy 1, policy_version 68412 (0.0007) [2023-10-08 02:39:40,999][52060] Updated weights for policy 0, policy_version 67560 (0.0008) [2023-10-08 02:39:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 139231232. Throughput: 0: 1713.4, 1: 1732.2. Samples: 34817128. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:41,211][50642] Avg episode reward: [(0, '19.960'), (1, '20.630')] [2023-10-08 02:39:41,378][52060] Updated weights for policy 0, policy_version 67570 (0.0010) [2023-10-08 02:39:41,744][52060] Updated weights for policy 0, policy_version 67580 (0.0009) [2023-10-08 02:39:43,464][52059] Updated weights for policy 1, policy_version 68422 (0.0008) [2023-10-08 02:39:43,828][52059] Updated weights for policy 1, policy_version 68432 (0.0007) [2023-10-08 02:39:44,198][52059] Updated weights for policy 1, policy_version 68442 (0.0009) [2023-10-08 02:39:45,534][52060] Updated weights for policy 0, policy_version 67590 (0.0010) [2023-10-08 02:39:45,904][52060] Updated weights for policy 0, policy_version 67600 (0.0008) [2023-10-08 02:39:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 139296768. Throughput: 0: 1714.0, 1: 1734.7. Samples: 34838334. Policy #0 lag: (min: 31.0, avg: 37.2, max: 63.0) [2023-10-08 02:39:46,211][50642] Avg episode reward: [(0, '19.500'), (1, '22.450')] [2023-10-08 02:39:46,276][52060] Updated weights for policy 0, policy_version 67610 (0.0007) [2023-10-08 02:39:48,132][52059] Updated weights for policy 1, policy_version 68452 (0.0008) [2023-10-08 02:39:48,509][52059] Updated weights for policy 1, policy_version 68462 (0.0009) [2023-10-08 02:39:48,868][52059] Updated weights for policy 1, policy_version 68472 (0.0008) [2023-10-08 02:39:50,266][52060] Updated weights for policy 0, policy_version 67620 (0.0008) [2023-10-08 02:39:50,636][52060] Updated weights for policy 0, policy_version 67630 (0.0011) [2023-10-08 02:39:51,004][52060] Updated weights for policy 0, policy_version 67640 (0.0007) [2023-10-08 02:39:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 139362304. Throughput: 0: 1726.8, 1: 1743.2. Samples: 34848848. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:39:51,211][50642] Avg episode reward: [(0, '20.440'), (1, '22.510')] [2023-10-08 02:39:52,798][52059] Updated weights for policy 1, policy_version 68482 (0.0008) [2023-10-08 02:39:53,175][52059] Updated weights for policy 1, policy_version 68492 (0.0010) [2023-10-08 02:39:53,535][52059] Updated weights for policy 1, policy_version 68502 (0.0007) [2023-10-08 02:39:53,903][52059] Updated weights for policy 1, policy_version 68512 (0.0008) [2023-10-08 02:39:54,983][52060] Updated weights for policy 0, policy_version 67650 (0.0008) [2023-10-08 02:39:55,359][52060] Updated weights for policy 0, policy_version 67660 (0.0010) [2023-10-08 02:39:55,720][52060] Updated weights for policy 0, policy_version 67670 (0.0010) [2023-10-08 02:39:56,084][52060] Updated weights for policy 0, policy_version 67680 (0.0008) [2023-10-08 02:39:56,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 139460608. Throughput: 0: 1730.8, 1: 1725.3. Samples: 34869608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:39:56,211][50642] Avg episode reward: [(0, '19.760'), (1, '20.080')] [2023-10-08 02:39:57,745][52059] Updated weights for policy 1, policy_version 68522 (0.0008) [2023-10-08 02:39:58,111][52059] Updated weights for policy 1, policy_version 68532 (0.0009) [2023-10-08 02:39:58,477][52059] Updated weights for policy 1, policy_version 68542 (0.0007) [2023-10-08 02:40:00,106][52060] Updated weights for policy 0, policy_version 67690 (0.0008) [2023-10-08 02:40:00,471][52060] Updated weights for policy 0, policy_version 67700 (0.0009) [2023-10-08 02:40:00,845][52060] Updated weights for policy 0, policy_version 67710 (0.0010) [2023-10-08 02:40:01,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 139526144. Throughput: 0: 1708.1, 1: 1742.5. Samples: 34889828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:40:01,211][50642] Avg episode reward: [(0, '19.190'), (1, '21.290')] [2023-10-08 02:40:02,505][52059] Updated weights for policy 1, policy_version 68552 (0.0009) [2023-10-08 02:40:02,869][52059] Updated weights for policy 1, policy_version 68562 (0.0007) [2023-10-08 02:40:03,228][52059] Updated weights for policy 1, policy_version 68572 (0.0007) [2023-10-08 02:40:04,814][52060] Updated weights for policy 0, policy_version 67720 (0.0009) [2023-10-08 02:40:05,196][52060] Updated weights for policy 0, policy_version 67730 (0.0009) [2023-10-08 02:40:05,551][52060] Updated weights for policy 0, policy_version 67740 (0.0007) [2023-10-08 02:40:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 139591680. Throughput: 0: 1735.4, 1: 1721.4. Samples: 34900434. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:40:06,211][50642] Avg episode reward: [(0, '17.890'), (1, '22.640')] [2023-10-08 02:40:07,140][52059] Updated weights for policy 1, policy_version 68582 (0.0009) [2023-10-08 02:40:07,503][52059] Updated weights for policy 1, policy_version 68592 (0.0008) [2023-10-08 02:40:07,863][52059] Updated weights for policy 1, policy_version 68602 (0.0007) [2023-10-08 02:40:09,704][52060] Updated weights for policy 0, policy_version 67750 (0.0007) [2023-10-08 02:40:10,075][52060] Updated weights for policy 0, policy_version 67760 (0.0009) [2023-10-08 02:40:10,443][52060] Updated weights for policy 0, policy_version 67770 (0.0009) [2023-10-08 02:40:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 139657216. Throughput: 0: 1725.3, 1: 1725.6. Samples: 34921316. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:40:11,211][50642] Avg episode reward: [(0, '18.630'), (1, '22.680')] [2023-10-08 02:40:11,801][52059] Updated weights for policy 1, policy_version 68612 (0.0008) [2023-10-08 02:40:12,162][52059] Updated weights for policy 1, policy_version 68622 (0.0009) [2023-10-08 02:40:12,532][52059] Updated weights for policy 1, policy_version 68632 (0.0009) [2023-10-08 02:40:14,320][52060] Updated weights for policy 0, policy_version 67780 (0.0007) [2023-10-08 02:40:14,682][52060] Updated weights for policy 0, policy_version 67790 (0.0007) [2023-10-08 02:40:15,053][52060] Updated weights for policy 0, policy_version 67800 (0.0008) [2023-10-08 02:40:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 139722752. Throughput: 0: 1701.5, 1: 1751.8. Samples: 34941898. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:40:16,211][50642] Avg episode reward: [(0, '21.060'), (1, '20.130')] [2023-10-08 02:40:16,503][52059] Updated weights for policy 1, policy_version 68642 (0.0008) [2023-10-08 02:40:16,875][52059] Updated weights for policy 1, policy_version 68652 (0.0008) [2023-10-08 02:40:17,253][52059] Updated weights for policy 1, policy_version 68662 (0.0009) [2023-10-08 02:40:17,618][52059] Updated weights for policy 1, policy_version 68672 (0.0009) [2023-10-08 02:40:18,874][52060] Updated weights for policy 0, policy_version 67810 (0.0009) [2023-10-08 02:40:19,244][52060] Updated weights for policy 0, policy_version 67820 (0.0008) [2023-10-08 02:40:19,604][52060] Updated weights for policy 0, policy_version 67830 (0.0011) [2023-10-08 02:40:19,973][52060] Updated weights for policy 0, policy_version 67840 (0.0010) [2023-10-08 02:40:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 139788288. Throughput: 0: 1735.8, 1: 1715.7. Samples: 34952678. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:40:21,211][50642] Avg episode reward: [(0, '18.680'), (1, '24.060')] [2023-10-08 02:40:21,487][52059] Updated weights for policy 1, policy_version 68682 (0.0008) [2023-10-08 02:40:21,848][52059] Updated weights for policy 1, policy_version 68692 (0.0009) [2023-10-08 02:40:22,213][52059] Updated weights for policy 1, policy_version 68702 (0.0008) [2023-10-08 02:40:23,979][52060] Updated weights for policy 0, policy_version 67850 (0.0008) [2023-10-08 02:40:24,358][52060] Updated weights for policy 0, policy_version 67860 (0.0010) [2023-10-08 02:40:24,718][52060] Updated weights for policy 0, policy_version 67870 (0.0010) [2023-10-08 02:40:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 139853824. Throughput: 0: 1708.0, 1: 1750.0. Samples: 34972742. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:40:26,211][50642] Avg episode reward: [(0, '20.180'), (1, '23.300')] [2023-10-08 02:40:26,253][52059] Updated weights for policy 1, policy_version 68712 (0.0008) [2023-10-08 02:40:26,631][52059] Updated weights for policy 1, policy_version 68722 (0.0007) [2023-10-08 02:40:26,998][52059] Updated weights for policy 1, policy_version 68732 (0.0008) [2023-10-08 02:40:28,605][52060] Updated weights for policy 0, policy_version 67880 (0.0007) [2023-10-08 02:40:28,972][52060] Updated weights for policy 0, policy_version 67890 (0.0008) [2023-10-08 02:40:29,331][52060] Updated weights for policy 0, policy_version 67900 (0.0009) [2023-10-08 02:40:30,860][52059] Updated weights for policy 1, policy_version 68742 (0.0010) [2023-10-08 02:40:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 139919360. Throughput: 0: 1716.3, 1: 1742.4. Samples: 34993974. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:40:31,211][50642] Avg episode reward: [(0, '20.020'), (1, '21.710')] [2023-10-08 02:40:31,233][52059] Updated weights for policy 1, policy_version 68752 (0.0010) [2023-10-08 02:40:31,601][52059] Updated weights for policy 1, policy_version 68762 (0.0007) [2023-10-08 02:40:33,420][52060] Updated weights for policy 0, policy_version 67910 (0.0009) [2023-10-08 02:40:33,781][52060] Updated weights for policy 0, policy_version 67920 (0.0008) [2023-10-08 02:40:34,155][52060] Updated weights for policy 0, policy_version 67930 (0.0010) [2023-10-08 02:40:35,445][52059] Updated weights for policy 1, policy_version 68772 (0.0007) [2023-10-08 02:40:35,802][52059] Updated weights for policy 1, policy_version 68782 (0.0007) [2023-10-08 02:40:36,163][52059] Updated weights for policy 1, policy_version 68792 (0.0007) [2023-10-08 02:40:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 139984896. Throughput: 0: 1714.1, 1: 1735.9. Samples: 35004098. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:40:36,211][50642] Avg episode reward: [(0, '20.350'), (1, '20.250')] [2023-10-08 02:40:38,132][52060] Updated weights for policy 0, policy_version 67940 (0.0008) [2023-10-08 02:40:38,495][52060] Updated weights for policy 0, policy_version 67950 (0.0009) [2023-10-08 02:40:38,866][52060] Updated weights for policy 0, policy_version 67960 (0.0009) [2023-10-08 02:40:40,138][52059] Updated weights for policy 1, policy_version 68802 (0.0008) [2023-10-08 02:40:40,505][52059] Updated weights for policy 1, policy_version 68812 (0.0011) [2023-10-08 02:40:40,862][52059] Updated weights for policy 1, policy_version 68822 (0.0010) [2023-10-08 02:40:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 140050432. Throughput: 0: 1699.3, 1: 1751.2. Samples: 35024880. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:40:41,211][50642] Avg episode reward: [(0, '18.840'), (1, '21.020')] [2023-10-08 02:40:41,226][52059] Updated weights for policy 1, policy_version 68832 (0.0010) [2023-10-08 02:40:42,930][52060] Updated weights for policy 0, policy_version 67970 (0.0010) [2023-10-08 02:40:43,297][52060] Updated weights for policy 0, policy_version 67980 (0.0008) [2023-10-08 02:40:43,666][52060] Updated weights for policy 0, policy_version 67990 (0.0008) [2023-10-08 02:40:44,039][52060] Updated weights for policy 0, policy_version 68000 (0.0008) [2023-10-08 02:40:45,181][52059] Updated weights for policy 1, policy_version 68842 (0.0008) [2023-10-08 02:40:45,555][52059] Updated weights for policy 1, policy_version 68852 (0.0009) [2023-10-08 02:40:45,923][52059] Updated weights for policy 1, policy_version 68862 (0.0010) [2023-10-08 02:40:46,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 140148736. Throughput: 0: 1723.8, 1: 1719.9. Samples: 35044796. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:40:46,211][50642] Avg episode reward: [(0, '18.140'), (1, '21.740')] [2023-10-08 02:40:46,225][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000068000_69632000.pth... [2023-10-08 02:40:46,225][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000068864_70516736.pth... [2023-10-08 02:40:46,259][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000067232_68845568.pth [2023-10-08 02:40:46,270][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000066400_67993600.pth [2023-10-08 02:40:47,987][52060] Updated weights for policy 0, policy_version 68010 (0.0010) [2023-10-08 02:40:48,350][52060] Updated weights for policy 0, policy_version 68020 (0.0011) [2023-10-08 02:40:48,717][52060] Updated weights for policy 0, policy_version 68030 (0.0009) [2023-10-08 02:40:49,769][52059] Updated weights for policy 1, policy_version 68872 (0.0011) [2023-10-08 02:40:50,143][52059] Updated weights for policy 1, policy_version 68882 (0.0011) [2023-10-08 02:40:50,503][52059] Updated weights for policy 1, policy_version 68892 (0.0010) [2023-10-08 02:40:51,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 140214272. Throughput: 0: 1697.5, 1: 1744.2. Samples: 35055312. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:40:51,211][50642] Avg episode reward: [(0, '21.490'), (1, '21.130')] [2023-10-08 02:40:52,688][52060] Updated weights for policy 0, policy_version 68040 (0.0008) [2023-10-08 02:40:53,052][52060] Updated weights for policy 0, policy_version 68050 (0.0008) [2023-10-08 02:40:53,425][52060] Updated weights for policy 0, policy_version 68060 (0.0008) [2023-10-08 02:40:54,525][52059] Updated weights for policy 1, policy_version 68902 (0.0010) [2023-10-08 02:40:54,879][52059] Updated weights for policy 1, policy_version 68912 (0.0007) [2023-10-08 02:40:55,249][52059] Updated weights for policy 1, policy_version 68922 (0.0008) [2023-10-08 02:40:56,210][50642] Fps is (10 sec: 13107.7, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 140279808. Throughput: 0: 1708.3, 1: 1729.7. Samples: 35076024. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:40:56,211][50642] Avg episode reward: [(0, '21.010'), (1, '21.550')] [2023-10-08 02:40:57,412][52060] Updated weights for policy 0, policy_version 68070 (0.0009) [2023-10-08 02:40:57,769][52060] Updated weights for policy 0, policy_version 68080 (0.0008) [2023-10-08 02:40:58,146][52060] Updated weights for policy 0, policy_version 68090 (0.0007) [2023-10-08 02:40:59,099][52059] Updated weights for policy 1, policy_version 68932 (0.0009) [2023-10-08 02:40:59,468][52059] Updated weights for policy 1, policy_version 68942 (0.0010) [2023-10-08 02:40:59,826][52059] Updated weights for policy 1, policy_version 68952 (0.0010) [2023-10-08 02:41:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 140345344. Throughput: 0: 1727.2, 1: 1717.5. Samples: 35096912. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:41:01,211][50642] Avg episode reward: [(0, '19.470'), (1, '23.360')] [2023-10-08 02:41:02,079][52060] Updated weights for policy 0, policy_version 68100 (0.0008) [2023-10-08 02:41:02,452][52060] Updated weights for policy 0, policy_version 68110 (0.0008) [2023-10-08 02:41:02,823][52060] Updated weights for policy 0, policy_version 68120 (0.0008) [2023-10-08 02:41:03,783][52059] Updated weights for policy 1, policy_version 68962 (0.0010) [2023-10-08 02:41:04,154][52059] Updated weights for policy 1, policy_version 68972 (0.0007) [2023-10-08 02:41:04,516][52059] Updated weights for policy 1, policy_version 68982 (0.0010) [2023-10-08 02:41:04,884][52059] Updated weights for policy 1, policy_version 68992 (0.0008) [2023-10-08 02:41:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 140410880. Throughput: 0: 1694.1, 1: 1745.0. Samples: 35107436. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:41:06,211][50642] Avg episode reward: [(0, '20.830'), (1, '19.110')] [2023-10-08 02:41:06,801][52060] Updated weights for policy 0, policy_version 68130 (0.0008) [2023-10-08 02:41:07,163][52060] Updated weights for policy 0, policy_version 68140 (0.0009) [2023-10-08 02:41:07,536][52060] Updated weights for policy 0, policy_version 68150 (0.0009) [2023-10-08 02:41:07,903][52060] Updated weights for policy 0, policy_version 68160 (0.0009) [2023-10-08 02:41:08,638][52059] Updated weights for policy 1, policy_version 69002 (0.0008) [2023-10-08 02:41:09,008][52059] Updated weights for policy 1, policy_version 69012 (0.0009) [2023-10-08 02:41:09,378][52059] Updated weights for policy 1, policy_version 69022 (0.0008) [2023-10-08 02:41:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 140476416. Throughput: 0: 1725.6, 1: 1722.8. Samples: 35127922. Policy #0 lag: (min: 10.0, avg: 10.0, max: 11.0) [2023-10-08 02:41:11,211][50642] Avg episode reward: [(0, '19.360'), (1, '19.910')] [2023-10-08 02:41:11,613][52060] Updated weights for policy 0, policy_version 68170 (0.0009) [2023-10-08 02:41:11,983][52060] Updated weights for policy 0, policy_version 68180 (0.0009) [2023-10-08 02:41:12,346][52060] Updated weights for policy 0, policy_version 68190 (0.0009) [2023-10-08 02:41:13,286][52059] Updated weights for policy 1, policy_version 69032 (0.0007) [2023-10-08 02:41:13,668][52059] Updated weights for policy 1, policy_version 69042 (0.0007) [2023-10-08 02:41:14,041][52059] Updated weights for policy 1, policy_version 69052 (0.0010) [2023-10-08 02:41:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 140541952. Throughput: 0: 1722.2, 1: 1726.4. Samples: 35149162. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:16,211][50642] Avg episode reward: [(0, '19.410'), (1, '21.480')] [2023-10-08 02:41:16,489][52060] Updated weights for policy 0, policy_version 68200 (0.0008) [2023-10-08 02:41:16,852][52060] Updated weights for policy 0, policy_version 68210 (0.0010) [2023-10-08 02:41:17,218][52060] Updated weights for policy 0, policy_version 68220 (0.0011) [2023-10-08 02:41:17,883][52059] Updated weights for policy 1, policy_version 69062 (0.0007) [2023-10-08 02:41:18,246][52059] Updated weights for policy 1, policy_version 69072 (0.0008) [2023-10-08 02:41:18,608][52059] Updated weights for policy 1, policy_version 69082 (0.0007) [2023-10-08 02:41:21,155][52060] Updated weights for policy 0, policy_version 68230 (0.0009) [2023-10-08 02:41:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 140607488. Throughput: 0: 1710.9, 1: 1724.8. Samples: 35158706. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:21,211][50642] Avg episode reward: [(0, '20.270'), (1, '22.090')] [2023-10-08 02:41:21,516][52060] Updated weights for policy 0, policy_version 68240 (0.0007) [2023-10-08 02:41:21,884][52060] Updated weights for policy 0, policy_version 68250 (0.0007) [2023-10-08 02:41:22,620][52059] Updated weights for policy 1, policy_version 69092 (0.0008) [2023-10-08 02:41:22,986][52059] Updated weights for policy 1, policy_version 69102 (0.0007) [2023-10-08 02:41:23,357][52059] Updated weights for policy 1, policy_version 69112 (0.0007) [2023-10-08 02:41:25,793][52060] Updated weights for policy 0, policy_version 68260 (0.0007) [2023-10-08 02:41:26,162][52060] Updated weights for policy 0, policy_version 68270 (0.0008) [2023-10-08 02:41:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 140673024. Throughput: 0: 1730.8, 1: 1722.6. Samples: 35180282. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:26,211][50642] Avg episode reward: [(0, '20.810'), (1, '18.690')] [2023-10-08 02:41:26,524][52060] Updated weights for policy 0, policy_version 68280 (0.0008) [2023-10-08 02:41:27,182][52059] Updated weights for policy 1, policy_version 69122 (0.0008) [2023-10-08 02:41:27,550][52059] Updated weights for policy 1, policy_version 69132 (0.0009) [2023-10-08 02:41:27,915][52059] Updated weights for policy 1, policy_version 69142 (0.0008) [2023-10-08 02:41:28,271][52059] Updated weights for policy 1, policy_version 69152 (0.0009) [2023-10-08 02:41:30,405][52060] Updated weights for policy 0, policy_version 68290 (0.0008) [2023-10-08 02:41:30,772][52060] Updated weights for policy 0, policy_version 68300 (0.0009) [2023-10-08 02:41:31,138][52060] Updated weights for policy 0, policy_version 68310 (0.0009) [2023-10-08 02:41:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 140738560. Throughput: 0: 1721.5, 1: 1751.8. Samples: 35201094. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:31,211][50642] Avg episode reward: [(0, '18.510'), (1, '22.540')] [2023-10-08 02:41:31,504][52060] Updated weights for policy 0, policy_version 68320 (0.0008) [2023-10-08 02:41:32,274][52059] Updated weights for policy 1, policy_version 69162 (0.0009) [2023-10-08 02:41:32,643][52059] Updated weights for policy 1, policy_version 69172 (0.0008) [2023-10-08 02:41:33,001][52059] Updated weights for policy 1, policy_version 69182 (0.0008) [2023-10-08 02:41:35,677][52060] Updated weights for policy 0, policy_version 68330 (0.0011) [2023-10-08 02:41:36,051][52060] Updated weights for policy 0, policy_version 68340 (0.0008) [2023-10-08 02:41:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 140804096. Throughput: 0: 1733.4, 1: 1726.9. Samples: 35211026. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:36,211][50642] Avg episode reward: [(0, '20.380'), (1, '16.870')] [2023-10-08 02:41:36,426][52060] Updated weights for policy 0, policy_version 68350 (0.0009) [2023-10-08 02:41:36,968][52059] Updated weights for policy 1, policy_version 69192 (0.0008) [2023-10-08 02:41:37,338][52059] Updated weights for policy 1, policy_version 69202 (0.0008) [2023-10-08 02:41:37,706][52059] Updated weights for policy 1, policy_version 69212 (0.0008) [2023-10-08 02:41:40,391][52060] Updated weights for policy 0, policy_version 68360 (0.0010) [2023-10-08 02:41:40,756][52060] Updated weights for policy 0, policy_version 68370 (0.0008) [2023-10-08 02:41:41,128][52060] Updated weights for policy 0, policy_version 68380 (0.0008) [2023-10-08 02:41:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 140869632. Throughput: 0: 1730.9, 1: 1744.9. Samples: 35232434. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:41,211][50642] Avg episode reward: [(0, '20.250'), (1, '18.430')] [2023-10-08 02:41:41,720][52059] Updated weights for policy 1, policy_version 69222 (0.0009) [2023-10-08 02:41:42,086][52059] Updated weights for policy 1, policy_version 69232 (0.0007) [2023-10-08 02:41:42,446][52059] Updated weights for policy 1, policy_version 69242 (0.0008) [2023-10-08 02:41:44,989][52060] Updated weights for policy 0, policy_version 68390 (0.0008) [2023-10-08 02:41:45,369][52060] Updated weights for policy 0, policy_version 68400 (0.0009) [2023-10-08 02:41:45,737][52060] Updated weights for policy 0, policy_version 68410 (0.0008) [2023-10-08 02:41:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 140967936. Throughput: 0: 1707.9, 1: 1757.0. Samples: 35252830. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:46,211][50642] Avg episode reward: [(0, '18.910'), (1, '18.880')] [2023-10-08 02:41:46,213][52059] Updated weights for policy 1, policy_version 69252 (0.0008) [2023-10-08 02:41:46,572][52059] Updated weights for policy 1, policy_version 69262 (0.0007) [2023-10-08 02:41:46,937][52059] Updated weights for policy 1, policy_version 69272 (0.0008) [2023-10-08 02:41:49,813][52060] Updated weights for policy 0, policy_version 68420 (0.0008) [2023-10-08 02:41:50,180][52060] Updated weights for policy 0, policy_version 68430 (0.0008) [2023-10-08 02:41:50,553][52060] Updated weights for policy 0, policy_version 68440 (0.0009) [2023-10-08 02:41:50,849][52059] Updated weights for policy 1, policy_version 69282 (0.0011) [2023-10-08 02:41:51,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 141033472. Throughput: 0: 1731.1, 1: 1732.0. Samples: 35263276. Policy #0 lag: (min: 19.0, avg: 19.1, max: 26.0) [2023-10-08 02:41:51,211][50642] Avg episode reward: [(0, '19.510'), (1, '20.500')] [2023-10-08 02:41:51,221][52059] Updated weights for policy 1, policy_version 69292 (0.0009) [2023-10-08 02:41:51,574][52059] Updated weights for policy 1, policy_version 69302 (0.0009) [2023-10-08 02:41:51,932][52059] Updated weights for policy 1, policy_version 69312 (0.0011) [2023-10-08 02:41:54,459][52060] Updated weights for policy 0, policy_version 68450 (0.0009) [2023-10-08 02:41:54,822][52060] Updated weights for policy 0, policy_version 68460 (0.0009) [2023-10-08 02:41:55,188][52060] Updated weights for policy 0, policy_version 68470 (0.0009) [2023-10-08 02:41:55,562][52060] Updated weights for policy 0, policy_version 68480 (0.0008) [2023-10-08 02:41:55,836][52059] Updated weights for policy 1, policy_version 69322 (0.0009) [2023-10-08 02:41:56,207][52059] Updated weights for policy 1, policy_version 69332 (0.0008) [2023-10-08 02:41:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 141099008. Throughput: 0: 1721.8, 1: 1750.1. Samples: 35284156. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:41:56,211][50642] Avg episode reward: [(0, '19.660'), (1, '20.060')] [2023-10-08 02:41:56,562][52059] Updated weights for policy 1, policy_version 69342 (0.0007) [2023-10-08 02:41:59,405][52060] Updated weights for policy 0, policy_version 68490 (0.0007) [2023-10-08 02:41:59,773][52060] Updated weights for policy 0, policy_version 68500 (0.0009) [2023-10-08 02:42:00,142][52060] Updated weights for policy 0, policy_version 68510 (0.0009) [2023-10-08 02:42:00,670][52059] Updated weights for policy 1, policy_version 69352 (0.0007) [2023-10-08 02:42:01,040][52059] Updated weights for policy 1, policy_version 69362 (0.0007) [2023-10-08 02:42:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 141164544. Throughput: 0: 1704.5, 1: 1736.2. Samples: 35303992. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:01,211][50642] Avg episode reward: [(0, '20.180'), (1, '20.780')] [2023-10-08 02:42:01,405][52059] Updated weights for policy 1, policy_version 69372 (0.0007) [2023-10-08 02:42:04,125][52060] Updated weights for policy 0, policy_version 68520 (0.0010) [2023-10-08 02:42:04,498][52060] Updated weights for policy 0, policy_version 68530 (0.0012) [2023-10-08 02:42:04,864][52060] Updated weights for policy 0, policy_version 68540 (0.0011) [2023-10-08 02:42:05,192][52059] Updated weights for policy 1, policy_version 69382 (0.0008) [2023-10-08 02:42:05,558][52059] Updated weights for policy 1, policy_version 69392 (0.0009) [2023-10-08 02:42:05,927][52059] Updated weights for policy 1, policy_version 69402 (0.0008) [2023-10-08 02:42:06,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 141262848. Throughput: 0: 1734.3, 1: 1746.9. Samples: 35315362. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:06,211][50642] Avg episode reward: [(0, '18.820'), (1, '20.760')] [2023-10-08 02:42:08,752][52060] Updated weights for policy 0, policy_version 68550 (0.0008) [2023-10-08 02:42:09,127][52060] Updated weights for policy 0, policy_version 68560 (0.0008) [2023-10-08 02:42:09,509][52060] Updated weights for policy 0, policy_version 68570 (0.0009) [2023-10-08 02:42:09,778][52059] Updated weights for policy 1, policy_version 69412 (0.0008) [2023-10-08 02:42:10,143][52059] Updated weights for policy 1, policy_version 69422 (0.0009) [2023-10-08 02:42:10,499][52059] Updated weights for policy 1, policy_version 69432 (0.0009) [2023-10-08 02:42:11,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 141328384. Throughput: 0: 1703.6, 1: 1745.5. Samples: 35335490. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:11,211][50642] Avg episode reward: [(0, '20.150'), (1, '18.830')] [2023-10-08 02:42:13,386][52060] Updated weights for policy 0, policy_version 68580 (0.0008) [2023-10-08 02:42:13,755][52060] Updated weights for policy 0, policy_version 68590 (0.0007) [2023-10-08 02:42:14,123][52060] Updated weights for policy 0, policy_version 68600 (0.0010) [2023-10-08 02:42:14,299][52059] Updated weights for policy 1, policy_version 69442 (0.0010) [2023-10-08 02:42:14,655][52059] Updated weights for policy 1, policy_version 69452 (0.0007) [2023-10-08 02:42:15,019][52059] Updated weights for policy 1, policy_version 69462 (0.0009) [2023-10-08 02:42:15,374][52059] Updated weights for policy 1, policy_version 69472 (0.0008) [2023-10-08 02:42:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 141393920. Throughput: 0: 1723.7, 1: 1725.2. Samples: 35356296. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:16,211][50642] Avg episode reward: [(0, '21.040'), (1, '19.640')] [2023-10-08 02:42:18,032][52060] Updated weights for policy 0, policy_version 68610 (0.0009) [2023-10-08 02:42:18,402][52060] Updated weights for policy 0, policy_version 68620 (0.0010) [2023-10-08 02:42:18,774][52060] Updated weights for policy 0, policy_version 68630 (0.0008) [2023-10-08 02:42:19,140][52060] Updated weights for policy 0, policy_version 68640 (0.0007) [2023-10-08 02:42:19,345][52059] Updated weights for policy 1, policy_version 69482 (0.0007) [2023-10-08 02:42:19,712][52059] Updated weights for policy 1, policy_version 69492 (0.0007) [2023-10-08 02:42:20,076][52059] Updated weights for policy 1, policy_version 69502 (0.0007) [2023-10-08 02:42:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 141459456. Throughput: 0: 1719.0, 1: 1757.0. Samples: 35367444. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:21,211][50642] Avg episode reward: [(0, '19.350'), (1, '21.410')] [2023-10-08 02:42:23,180][52060] Updated weights for policy 0, policy_version 68650 (0.0009) [2023-10-08 02:42:23,547][52060] Updated weights for policy 0, policy_version 68660 (0.0011) [2023-10-08 02:42:23,921][52060] Updated weights for policy 0, policy_version 68670 (0.0009) [2023-10-08 02:42:24,125][52059] Updated weights for policy 1, policy_version 69512 (0.0010) [2023-10-08 02:42:24,498][52059] Updated weights for policy 1, policy_version 69522 (0.0010) [2023-10-08 02:42:24,858][52059] Updated weights for policy 1, policy_version 69532 (0.0010) [2023-10-08 02:42:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 141524992. Throughput: 0: 1705.0, 1: 1727.3. Samples: 35386890. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:26,211][50642] Avg episode reward: [(0, '17.940'), (1, '19.600')] [2023-10-08 02:42:28,040][52060] Updated weights for policy 0, policy_version 68680 (0.0009) [2023-10-08 02:42:28,407][52060] Updated weights for policy 0, policy_version 68690 (0.0008) [2023-10-08 02:42:28,779][52060] Updated weights for policy 0, policy_version 68700 (0.0008) [2023-10-08 02:42:28,850][52059] Updated weights for policy 1, policy_version 69542 (0.0007) [2023-10-08 02:42:29,212][52059] Updated weights for policy 1, policy_version 69552 (0.0008) [2023-10-08 02:42:29,586][52059] Updated weights for policy 1, policy_version 69562 (0.0008) [2023-10-08 02:42:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 141590528. Throughput: 0: 1721.3, 1: 1722.0. Samples: 35407780. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:31,211][50642] Avg episode reward: [(0, '21.110'), (1, '20.600')] [2023-10-08 02:42:32,674][52060] Updated weights for policy 0, policy_version 68710 (0.0008) [2023-10-08 02:42:33,041][52060] Updated weights for policy 0, policy_version 68720 (0.0009) [2023-10-08 02:42:33,397][52059] Updated weights for policy 1, policy_version 69572 (0.0007) [2023-10-08 02:42:33,405][52060] Updated weights for policy 0, policy_version 68730 (0.0008) [2023-10-08 02:42:33,760][52059] Updated weights for policy 1, policy_version 69582 (0.0009) [2023-10-08 02:42:34,128][52059] Updated weights for policy 1, policy_version 69592 (0.0010) [2023-10-08 02:42:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 141656064. Throughput: 0: 1696.4, 1: 1737.7. Samples: 35417808. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:36,211][50642] Avg episode reward: [(0, '19.800'), (1, '22.030')] [2023-10-08 02:42:37,239][52060] Updated weights for policy 0, policy_version 68740 (0.0009) [2023-10-08 02:42:37,619][52060] Updated weights for policy 0, policy_version 68750 (0.0009) [2023-10-08 02:42:37,975][52059] Updated weights for policy 1, policy_version 69602 (0.0009) [2023-10-08 02:42:37,988][52060] Updated weights for policy 0, policy_version 68760 (0.0009) [2023-10-08 02:42:38,339][52059] Updated weights for policy 1, policy_version 69612 (0.0008) [2023-10-08 02:42:38,720][52059] Updated weights for policy 1, policy_version 69622 (0.0008) [2023-10-08 02:42:39,082][52059] Updated weights for policy 1, policy_version 69632 (0.0009) [2023-10-08 02:42:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 141721600. Throughput: 0: 1706.1, 1: 1728.9. Samples: 35438732. Policy #0 lag: (min: 31.0, avg: 33.4, max: 63.0) [2023-10-08 02:42:41,211][50642] Avg episode reward: [(0, '18.270'), (1, '20.850')] [2023-10-08 02:42:42,047][52060] Updated weights for policy 0, policy_version 68770 (0.0008) [2023-10-08 02:42:42,423][52060] Updated weights for policy 0, policy_version 68780 (0.0007) [2023-10-08 02:42:42,789][52060] Updated weights for policy 0, policy_version 68790 (0.0007) [2023-10-08 02:42:42,955][52059] Updated weights for policy 1, policy_version 69642 (0.0007) [2023-10-08 02:42:43,169][52060] Updated weights for policy 0, policy_version 68800 (0.0007) [2023-10-08 02:42:43,315][52059] Updated weights for policy 1, policy_version 69652 (0.0009) [2023-10-08 02:42:43,686][52059] Updated weights for policy 1, policy_version 69662 (0.0008) [2023-10-08 02:42:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 141787136. Throughput: 0: 1724.4, 1: 1750.1. Samples: 35460344. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:42:46,211][50642] Avg episode reward: [(0, '18.210'), (1, '19.900')] [2023-10-08 02:42:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000069664_71335936.pth... [2023-10-08 02:42:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000068800_70451200.pth... [2023-10-08 02:42:46,248][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000068032_69664768.pth [2023-10-08 02:42:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000067200_68812800.pth [2023-10-08 02:42:47,179][52060] Updated weights for policy 0, policy_version 68810 (0.0009) [2023-10-08 02:42:47,493][52059] Updated weights for policy 1, policy_version 69672 (0.0007) [2023-10-08 02:42:47,555][52060] Updated weights for policy 0, policy_version 68820 (0.0010) [2023-10-08 02:42:47,878][52059] Updated weights for policy 1, policy_version 69682 (0.0008) [2023-10-08 02:42:47,919][52060] Updated weights for policy 0, policy_version 68830 (0.0010) [2023-10-08 02:42:48,238][52059] Updated weights for policy 1, policy_version 69692 (0.0009) [2023-10-08 02:42:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 141852672. Throughput: 0: 1692.0, 1: 1731.6. Samples: 35469424. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:42:51,211][50642] Avg episode reward: [(0, '21.220'), (1, '19.910')] [2023-10-08 02:42:52,034][52060] Updated weights for policy 0, policy_version 68840 (0.0009) [2023-10-08 02:42:52,252][52059] Updated weights for policy 1, policy_version 69702 (0.0007) [2023-10-08 02:42:52,401][52060] Updated weights for policy 0, policy_version 68850 (0.0007) [2023-10-08 02:42:52,620][52059] Updated weights for policy 1, policy_version 69712 (0.0007) [2023-10-08 02:42:52,772][52060] Updated weights for policy 0, policy_version 68860 (0.0007) [2023-10-08 02:42:52,978][52059] Updated weights for policy 1, policy_version 69722 (0.0009) [2023-10-08 02:42:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 141918208. Throughput: 0: 1713.9, 1: 1729.5. Samples: 35490444. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:42:56,211][50642] Avg episode reward: [(0, '19.040'), (1, '23.350')] [2023-10-08 02:42:56,763][52060] Updated weights for policy 0, policy_version 68870 (0.0010) [2023-10-08 02:42:56,917][52059] Updated weights for policy 1, policy_version 69732 (0.0009) [2023-10-08 02:42:57,134][52060] Updated weights for policy 0, policy_version 68880 (0.0007) [2023-10-08 02:42:57,279][52059] Updated weights for policy 1, policy_version 69742 (0.0008) [2023-10-08 02:42:57,500][52060] Updated weights for policy 0, policy_version 68890 (0.0010) [2023-10-08 02:42:57,652][52059] Updated weights for policy 1, policy_version 69752 (0.0008) [2023-10-08 02:43:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 141983744. Throughput: 0: 1711.5, 1: 1743.4. Samples: 35511766. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:43:01,211][50642] Avg episode reward: [(0, '18.970'), (1, '22.570')] [2023-10-08 02:43:01,411][52060] Updated weights for policy 0, policy_version 68900 (0.0007) [2023-10-08 02:43:01,628][52059] Updated weights for policy 1, policy_version 69762 (0.0009) [2023-10-08 02:43:01,776][52060] Updated weights for policy 0, policy_version 68910 (0.0007) [2023-10-08 02:43:01,995][52059] Updated weights for policy 1, policy_version 69772 (0.0007) [2023-10-08 02:43:02,142][52060] Updated weights for policy 0, policy_version 68920 (0.0008) [2023-10-08 02:43:02,359][52059] Updated weights for policy 1, policy_version 69782 (0.0007) [2023-10-08 02:43:02,719][52059] Updated weights for policy 1, policy_version 69792 (0.0007) [2023-10-08 02:43:06,066][52060] Updated weights for policy 0, policy_version 68930 (0.0008) [2023-10-08 02:43:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 142049280. Throughput: 0: 1705.2, 1: 1715.2. Samples: 35521362. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:43:06,211][50642] Avg episode reward: [(0, '21.450'), (1, '22.010')] [2023-10-08 02:43:06,427][52060] Updated weights for policy 0, policy_version 68940 (0.0009) [2023-10-08 02:43:06,620][52059] Updated weights for policy 1, policy_version 69802 (0.0009) [2023-10-08 02:43:06,789][52060] Updated weights for policy 0, policy_version 68950 (0.0007) [2023-10-08 02:43:06,989][52059] Updated weights for policy 1, policy_version 69812 (0.0007) [2023-10-08 02:43:07,166][52060] Updated weights for policy 0, policy_version 68960 (0.0008) [2023-10-08 02:43:07,349][52059] Updated weights for policy 1, policy_version 69822 (0.0011) [2023-10-08 02:43:11,036][52060] Updated weights for policy 0, policy_version 68970 (0.0007) [2023-10-08 02:43:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 142114816. Throughput: 0: 1725.0, 1: 1735.8. Samples: 35542626. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:43:11,211][50642] Avg episode reward: [(0, '20.400'), (1, '22.040')] [2023-10-08 02:43:11,407][52060] Updated weights for policy 0, policy_version 68980 (0.0008) [2023-10-08 02:43:11,442][52059] Updated weights for policy 1, policy_version 69832 (0.0011) [2023-10-08 02:43:11,776][52060] Updated weights for policy 0, policy_version 68990 (0.0008) [2023-10-08 02:43:11,794][52059] Updated weights for policy 1, policy_version 69842 (0.0007) [2023-10-08 02:43:12,168][52059] Updated weights for policy 1, policy_version 69852 (0.0010) [2023-10-08 02:43:15,803][52060] Updated weights for policy 0, policy_version 69000 (0.0008) [2023-10-08 02:43:16,018][52059] Updated weights for policy 1, policy_version 69862 (0.0009) [2023-10-08 02:43:16,176][52060] Updated weights for policy 0, policy_version 69010 (0.0007) [2023-10-08 02:43:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 142180352. Throughput: 0: 1717.1, 1: 1739.3. Samples: 35563318. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:43:16,211][50642] Avg episode reward: [(0, '18.240'), (1, '21.910')] [2023-10-08 02:43:16,384][52059] Updated weights for policy 1, policy_version 69872 (0.0007) [2023-10-08 02:43:16,542][52060] Updated weights for policy 0, policy_version 69020 (0.0008) [2023-10-08 02:43:16,754][52059] Updated weights for policy 1, policy_version 69882 (0.0007) [2023-10-08 02:43:20,520][52060] Updated weights for policy 0, policy_version 69030 (0.0008) [2023-10-08 02:43:20,777][52059] Updated weights for policy 1, policy_version 69892 (0.0009) [2023-10-08 02:43:20,892][52060] Updated weights for policy 0, policy_version 69040 (0.0009) [2023-10-08 02:43:21,147][52059] Updated weights for policy 1, policy_version 69902 (0.0009) [2023-10-08 02:43:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 142245888. Throughput: 0: 1725.5, 1: 1723.2. Samples: 35572996. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:43:21,211][50642] Avg episode reward: [(0, '18.630'), (1, '19.440')] [2023-10-08 02:43:21,250][52060] Updated weights for policy 0, policy_version 69050 (0.0007) [2023-10-08 02:43:21,517][52059] Updated weights for policy 1, policy_version 69912 (0.0007) [2023-10-08 02:43:25,255][52060] Updated weights for policy 0, policy_version 69060 (0.0009) [2023-10-08 02:43:25,366][52059] Updated weights for policy 1, policy_version 69922 (0.0010) [2023-10-08 02:43:25,616][52060] Updated weights for policy 0, policy_version 69070 (0.0009) [2023-10-08 02:43:25,736][52059] Updated weights for policy 1, policy_version 69932 (0.0008) [2023-10-08 02:43:25,985][52060] Updated weights for policy 0, policy_version 69080 (0.0007) [2023-10-08 02:43:26,092][52059] Updated weights for policy 1, policy_version 69942 (0.0007) [2023-10-08 02:43:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 142311424. Throughput: 0: 1722.0, 1: 1735.0. Samples: 35594298. Policy #0 lag: (min: 24.0, avg: 42.0, max: 56.0) [2023-10-08 02:43:26,211][50642] Avg episode reward: [(0, '21.710'), (1, '21.110')] [2023-10-08 02:43:26,460][52059] Updated weights for policy 1, policy_version 69952 (0.0010) [2023-10-08 02:43:30,164][52060] Updated weights for policy 0, policy_version 69090 (0.0007) [2023-10-08 02:43:30,501][52059] Updated weights for policy 1, policy_version 69962 (0.0007) [2023-10-08 02:43:30,521][52060] Updated weights for policy 0, policy_version 69100 (0.0007) [2023-10-08 02:43:30,872][52059] Updated weights for policy 1, policy_version 69972 (0.0007) [2023-10-08 02:43:30,878][52060] Updated weights for policy 0, policy_version 69110 (0.0009) [2023-10-08 02:43:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 142376960. Throughput: 0: 1696.9, 1: 1710.8. Samples: 35613688. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:43:31,211][50642] Avg episode reward: [(0, '16.590'), (1, '21.860')] [2023-10-08 02:43:31,228][52059] Updated weights for policy 1, policy_version 69982 (0.0008) [2023-10-08 02:43:31,254][52060] Updated weights for policy 0, policy_version 69120 (0.0009) [2023-10-08 02:43:35,142][52059] Updated weights for policy 1, policy_version 69992 (0.0007) [2023-10-08 02:43:35,231][52060] Updated weights for policy 0, policy_version 69130 (0.0009) [2023-10-08 02:43:35,505][52059] Updated weights for policy 1, policy_version 70002 (0.0008) [2023-10-08 02:43:35,594][52060] Updated weights for policy 0, policy_version 69140 (0.0008) [2023-10-08 02:43:35,863][52059] Updated weights for policy 1, policy_version 70012 (0.0008) [2023-10-08 02:43:35,950][52060] Updated weights for policy 0, policy_version 69150 (0.0009) [2023-10-08 02:43:36,210][50642] Fps is (10 sec: 19660.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 142508032. Throughput: 0: 1718.9, 1: 1730.9. Samples: 35624664. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:43:36,211][50642] Avg episode reward: [(0, '17.670'), (1, '22.670')] [2023-10-08 02:43:39,790][52059] Updated weights for policy 1, policy_version 70022 (0.0009) [2023-10-08 02:43:39,912][52060] Updated weights for policy 0, policy_version 69160 (0.0007) [2023-10-08 02:43:40,155][52059] Updated weights for policy 1, policy_version 70032 (0.0007) [2023-10-08 02:43:40,273][52060] Updated weights for policy 0, policy_version 69170 (0.0008) [2023-10-08 02:43:40,527][52059] Updated weights for policy 1, policy_version 70042 (0.0008) [2023-10-08 02:43:40,638][52060] Updated weights for policy 0, policy_version 69180 (0.0009) [2023-10-08 02:43:41,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 142573568. Throughput: 0: 1717.2, 1: 1723.6. Samples: 35645284. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:43:41,211][50642] Avg episode reward: [(0, '23.500'), (1, '19.380')] [2023-10-08 02:43:44,449][52059] Updated weights for policy 1, policy_version 70052 (0.0008) [2023-10-08 02:43:44,649][52060] Updated weights for policy 0, policy_version 69190 (0.0009) [2023-10-08 02:43:44,798][52059] Updated weights for policy 1, policy_version 70062 (0.0009) [2023-10-08 02:43:45,004][52060] Updated weights for policy 0, policy_version 69200 (0.0007) [2023-10-08 02:43:45,164][52059] Updated weights for policy 1, policy_version 70072 (0.0010) [2023-10-08 02:43:45,370][52060] Updated weights for policy 0, policy_version 69210 (0.0008) [2023-10-08 02:43:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 142639104. Throughput: 0: 1689.9, 1: 1706.8. Samples: 35664618. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:43:46,211][50642] Avg episode reward: [(0, '17.060'), (1, '20.360')] [2023-10-08 02:43:49,229][52059] Updated weights for policy 1, policy_version 70082 (0.0007) [2023-10-08 02:43:49,538][52060] Updated weights for policy 0, policy_version 69220 (0.0010) [2023-10-08 02:43:49,601][52059] Updated weights for policy 1, policy_version 70092 (0.0008) [2023-10-08 02:43:49,907][52060] Updated weights for policy 0, policy_version 69230 (0.0008) [2023-10-08 02:43:49,964][52059] Updated weights for policy 1, policy_version 70102 (0.0007) [2023-10-08 02:43:50,273][52060] Updated weights for policy 0, policy_version 69240 (0.0007) [2023-10-08 02:43:50,330][52059] Updated weights for policy 1, policy_version 70112 (0.0007) [2023-10-08 02:43:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 142704640. Throughput: 0: 1717.6, 1: 1731.3. Samples: 35676564. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:43:51,211][50642] Avg episode reward: [(0, '16.810'), (1, '22.030')] [2023-10-08 02:43:54,305][52060] Updated weights for policy 0, policy_version 69250 (0.0008) [2023-10-08 02:43:54,305][52059] Updated weights for policy 1, policy_version 70122 (0.0008) [2023-10-08 02:43:54,662][52059] Updated weights for policy 1, policy_version 70132 (0.0008) [2023-10-08 02:43:54,678][52060] Updated weights for policy 0, policy_version 69260 (0.0009) [2023-10-08 02:43:55,029][52059] Updated weights for policy 1, policy_version 70142 (0.0007) [2023-10-08 02:43:55,052][52060] Updated weights for policy 0, policy_version 69270 (0.0007) [2023-10-08 02:43:55,421][52060] Updated weights for policy 0, policy_version 69280 (0.0008) [2023-10-08 02:43:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 142770176. Throughput: 0: 1696.3, 1: 1712.1. Samples: 35696002. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:43:56,211][50642] Avg episode reward: [(0, '22.090'), (1, '22.900')] [2023-10-08 02:43:59,167][52059] Updated weights for policy 1, policy_version 70152 (0.0008) [2023-10-08 02:43:59,267][52060] Updated weights for policy 0, policy_version 69290 (0.0008) [2023-10-08 02:43:59,528][52059] Updated weights for policy 1, policy_version 70162 (0.0007) [2023-10-08 02:43:59,632][52060] Updated weights for policy 0, policy_version 69300 (0.0007) [2023-10-08 02:43:59,889][52059] Updated weights for policy 1, policy_version 70172 (0.0009) [2023-10-08 02:44:00,012][52060] Updated weights for policy 0, policy_version 69310 (0.0009) [2023-10-08 02:44:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 142835712. Throughput: 0: 1691.9, 1: 1702.4. Samples: 35716058. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:44:01,211][50642] Avg episode reward: [(0, '17.950'), (1, '20.350')] [2023-10-08 02:44:03,840][52059] Updated weights for policy 1, policy_version 70182 (0.0008) [2023-10-08 02:44:04,186][52060] Updated weights for policy 0, policy_version 69320 (0.0009) [2023-10-08 02:44:04,202][52059] Updated weights for policy 1, policy_version 70192 (0.0007) [2023-10-08 02:44:04,560][52060] Updated weights for policy 0, policy_version 69330 (0.0008) [2023-10-08 02:44:04,564][52059] Updated weights for policy 1, policy_version 70202 (0.0010) [2023-10-08 02:44:04,927][52060] Updated weights for policy 0, policy_version 69340 (0.0007) [2023-10-08 02:44:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 142901248. Throughput: 0: 1714.9, 1: 1727.2. Samples: 35727890. Policy #0 lag: (min: 31.0, avg: 31.2, max: 40.0) [2023-10-08 02:44:06,211][50642] Avg episode reward: [(0, '16.610'), (1, '20.260')] [2023-10-08 02:44:08,579][52059] Updated weights for policy 1, policy_version 70212 (0.0008) [2023-10-08 02:44:08,920][52060] Updated weights for policy 0, policy_version 69350 (0.0009) [2023-10-08 02:44:08,946][52059] Updated weights for policy 1, policy_version 70222 (0.0007) [2023-10-08 02:44:09,295][52060] Updated weights for policy 0, policy_version 69360 (0.0009) [2023-10-08 02:44:09,311][52059] Updated weights for policy 1, policy_version 70232 (0.0009) [2023-10-08 02:44:09,666][52060] Updated weights for policy 0, policy_version 69370 (0.0009) [2023-10-08 02:44:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 142966784. Throughput: 0: 1685.6, 1: 1700.4. Samples: 35746672. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:11,211][50642] Avg episode reward: [(0, '21.710'), (1, '21.110')] [2023-10-08 02:44:13,219][52059] Updated weights for policy 1, policy_version 70242 (0.0008) [2023-10-08 02:44:13,596][52059] Updated weights for policy 1, policy_version 70252 (0.0008) [2023-10-08 02:44:13,601][52060] Updated weights for policy 0, policy_version 69380 (0.0009) [2023-10-08 02:44:13,963][52059] Updated weights for policy 1, policy_version 70262 (0.0007) [2023-10-08 02:44:13,975][52060] Updated weights for policy 0, policy_version 69390 (0.0009) [2023-10-08 02:44:14,330][52060] Updated weights for policy 0, policy_version 69400 (0.0009) [2023-10-08 02:44:14,331][52059] Updated weights for policy 1, policy_version 70272 (0.0008) [2023-10-08 02:44:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 143032320. Throughput: 0: 1706.9, 1: 1718.7. Samples: 35767840. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:16,211][50642] Avg episode reward: [(0, '19.830'), (1, '22.760')] [2023-10-08 02:44:18,064][52060] Updated weights for policy 0, policy_version 69410 (0.0008) [2023-10-08 02:44:18,295][52059] Updated weights for policy 1, policy_version 70282 (0.0009) [2023-10-08 02:44:18,428][52060] Updated weights for policy 0, policy_version 69420 (0.0008) [2023-10-08 02:44:18,650][52059] Updated weights for policy 1, policy_version 70292 (0.0009) [2023-10-08 02:44:18,791][52060] Updated weights for policy 0, policy_version 69430 (0.0010) [2023-10-08 02:44:19,021][52059] Updated weights for policy 1, policy_version 70302 (0.0008) [2023-10-08 02:44:19,163][52060] Updated weights for policy 0, policy_version 69440 (0.0009) [2023-10-08 02:44:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 143097856. Throughput: 0: 1699.2, 1: 1706.9. Samples: 35777942. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:21,211][50642] Avg episode reward: [(0, '17.830'), (1, '20.860')] [2023-10-08 02:44:22,975][52059] Updated weights for policy 1, policy_version 70312 (0.0007) [2023-10-08 02:44:23,240][52060] Updated weights for policy 0, policy_version 69450 (0.0010) [2023-10-08 02:44:23,329][52059] Updated weights for policy 1, policy_version 70322 (0.0008) [2023-10-08 02:44:23,599][52060] Updated weights for policy 0, policy_version 69460 (0.0008) [2023-10-08 02:44:23,689][52059] Updated weights for policy 1, policy_version 70332 (0.0008) [2023-10-08 02:44:23,966][52060] Updated weights for policy 0, policy_version 69470 (0.0008) [2023-10-08 02:44:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 143163392. Throughput: 0: 1686.6, 1: 1712.6. Samples: 35798250. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:26,211][50642] Avg episode reward: [(0, '21.050'), (1, '20.120')] [2023-10-08 02:44:27,605][52059] Updated weights for policy 1, policy_version 70342 (0.0009) [2023-10-08 02:44:27,979][52059] Updated weights for policy 1, policy_version 70352 (0.0009) [2023-10-08 02:44:28,056][52060] Updated weights for policy 0, policy_version 69480 (0.0009) [2023-10-08 02:44:28,347][52059] Updated weights for policy 1, policy_version 70362 (0.0007) [2023-10-08 02:44:28,430][52060] Updated weights for policy 0, policy_version 69490 (0.0008) [2023-10-08 02:44:28,790][52060] Updated weights for policy 0, policy_version 69500 (0.0009) [2023-10-08 02:44:31,211][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 143228928. Throughput: 0: 1707.6, 1: 1729.0. Samples: 35819266. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:31,212][50642] Avg episode reward: [(0, '19.060'), (1, '22.280')] [2023-10-08 02:44:32,367][52059] Updated weights for policy 1, policy_version 70372 (0.0008) [2023-10-08 02:44:32,733][52059] Updated weights for policy 1, policy_version 70382 (0.0008) [2023-10-08 02:44:32,767][52060] Updated weights for policy 0, policy_version 69510 (0.0007) [2023-10-08 02:44:33,100][52059] Updated weights for policy 1, policy_version 70392 (0.0008) [2023-10-08 02:44:33,137][52060] Updated weights for policy 0, policy_version 69520 (0.0009) [2023-10-08 02:44:33,507][52060] Updated weights for policy 0, policy_version 69530 (0.0008) [2023-10-08 02:44:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 143294464. Throughput: 0: 1675.5, 1: 1701.1. Samples: 35828512. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:36,211][50642] Avg episode reward: [(0, '17.850'), (1, '20.490')] [2023-10-08 02:44:36,997][52059] Updated weights for policy 1, policy_version 70402 (0.0008) [2023-10-08 02:44:37,354][52059] Updated weights for policy 1, policy_version 70412 (0.0007) [2023-10-08 02:44:37,639][52060] Updated weights for policy 0, policy_version 69540 (0.0009) [2023-10-08 02:44:37,713][52059] Updated weights for policy 1, policy_version 70422 (0.0008) [2023-10-08 02:44:38,013][52060] Updated weights for policy 0, policy_version 69550 (0.0008) [2023-10-08 02:44:38,081][52059] Updated weights for policy 1, policy_version 70432 (0.0009) [2023-10-08 02:44:38,386][52060] Updated weights for policy 0, policy_version 69560 (0.0008) [2023-10-08 02:44:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 143360000. Throughput: 0: 1689.1, 1: 1729.3. Samples: 35849834. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:41,211][50642] Avg episode reward: [(0, '20.250'), (1, '20.890')] [2023-10-08 02:44:42,151][52059] Updated weights for policy 1, policy_version 70442 (0.0009) [2023-10-08 02:44:42,361][52060] Updated weights for policy 0, policy_version 69570 (0.0010) [2023-10-08 02:44:42,509][52059] Updated weights for policy 1, policy_version 70452 (0.0008) [2023-10-08 02:44:42,731][52060] Updated weights for policy 0, policy_version 69580 (0.0008) [2023-10-08 02:44:42,870][52059] Updated weights for policy 1, policy_version 70462 (0.0008) [2023-10-08 02:44:43,096][52060] Updated weights for policy 0, policy_version 69590 (0.0009) [2023-10-08 02:44:43,465][52060] Updated weights for policy 0, policy_version 69600 (0.0010) [2023-10-08 02:44:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 143425536. Throughput: 0: 1709.7, 1: 1736.9. Samples: 35871156. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:46,211][50642] Avg episode reward: [(0, '21.700'), (1, '20.470')] [2023-10-08 02:44:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000070464_72155136.pth... [2023-10-08 02:44:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000069600_71270400.pth... [2023-10-08 02:44:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000068000_69632000.pth [2023-10-08 02:44:46,261][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000068864_70516736.pth [2023-10-08 02:44:46,787][52059] Updated weights for policy 1, policy_version 70472 (0.0008) [2023-10-08 02:44:47,153][52059] Updated weights for policy 1, policy_version 70482 (0.0008) [2023-10-08 02:44:47,358][52060] Updated weights for policy 0, policy_version 69610 (0.0008) [2023-10-08 02:44:47,515][52059] Updated weights for policy 1, policy_version 70492 (0.0009) [2023-10-08 02:44:47,725][52060] Updated weights for policy 0, policy_version 69620 (0.0008) [2023-10-08 02:44:48,086][52060] Updated weights for policy 0, policy_version 69630 (0.0011) [2023-10-08 02:44:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 143491072. Throughput: 0: 1680.7, 1: 1713.0. Samples: 35880606. Policy #0 lag: (min: 6.0, avg: 13.5, max: 38.0) [2023-10-08 02:44:51,211][50642] Avg episode reward: [(0, '18.300'), (1, '20.920')] [2023-10-08 02:44:51,535][52059] Updated weights for policy 1, policy_version 70502 (0.0007) [2023-10-08 02:44:51,905][52059] Updated weights for policy 1, policy_version 70512 (0.0009) [2023-10-08 02:44:52,247][52060] Updated weights for policy 0, policy_version 69640 (0.0010) [2023-10-08 02:44:52,275][52059] Updated weights for policy 1, policy_version 70522 (0.0007) [2023-10-08 02:44:52,607][52060] Updated weights for policy 0, policy_version 69650 (0.0007) [2023-10-08 02:44:52,979][52060] Updated weights for policy 0, policy_version 69660 (0.0007) [2023-10-08 02:44:56,055][52059] Updated weights for policy 1, policy_version 70532 (0.0010) [2023-10-08 02:44:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 143556608. Throughput: 0: 1704.4, 1: 1737.3. Samples: 35901548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:44:56,211][50642] Avg episode reward: [(0, '19.990'), (1, '19.140')] [2023-10-08 02:44:56,421][52059] Updated weights for policy 1, policy_version 70542 (0.0008) [2023-10-08 02:44:56,795][52059] Updated weights for policy 1, policy_version 70552 (0.0007) [2023-10-08 02:44:56,978][52060] Updated weights for policy 0, policy_version 69670 (0.0009) [2023-10-08 02:44:57,360][52060] Updated weights for policy 0, policy_version 69680 (0.0008) [2023-10-08 02:44:57,735][52060] Updated weights for policy 0, policy_version 69690 (0.0009) [2023-10-08 02:45:00,767][52059] Updated weights for policy 1, policy_version 70562 (0.0008) [2023-10-08 02:45:01,135][52059] Updated weights for policy 1, policy_version 70572 (0.0007) [2023-10-08 02:45:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 143622144. Throughput: 0: 1700.9, 1: 1740.9. Samples: 35922720. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:01,211][50642] Avg episode reward: [(0, '21.590'), (1, '20.890')] [2023-10-08 02:45:01,502][52059] Updated weights for policy 1, policy_version 70582 (0.0009) [2023-10-08 02:45:01,775][52060] Updated weights for policy 0, policy_version 69700 (0.0010) [2023-10-08 02:45:01,859][52059] Updated weights for policy 1, policy_version 70592 (0.0008) [2023-10-08 02:45:02,139][52060] Updated weights for policy 0, policy_version 69710 (0.0008) [2023-10-08 02:45:02,510][52060] Updated weights for policy 0, policy_version 69720 (0.0009) [2023-10-08 02:45:05,812][52059] Updated weights for policy 1, policy_version 70602 (0.0009) [2023-10-08 02:45:06,174][52059] Updated weights for policy 1, policy_version 70612 (0.0008) [2023-10-08 02:45:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 143687680. Throughput: 0: 1690.4, 1: 1742.5. Samples: 35932424. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:06,211][50642] Avg episode reward: [(0, '19.230'), (1, '21.920')] [2023-10-08 02:45:06,504][52060] Updated weights for policy 0, policy_version 69730 (0.0007) [2023-10-08 02:45:06,540][52059] Updated weights for policy 1, policy_version 70622 (0.0008) [2023-10-08 02:45:06,881][52060] Updated weights for policy 0, policy_version 69740 (0.0007) [2023-10-08 02:45:07,243][52060] Updated weights for policy 0, policy_version 69750 (0.0007) [2023-10-08 02:45:07,613][52060] Updated weights for policy 0, policy_version 69760 (0.0009) [2023-10-08 02:45:10,362][52059] Updated weights for policy 1, policy_version 70632 (0.0007) [2023-10-08 02:45:10,717][52059] Updated weights for policy 1, policy_version 70642 (0.0007) [2023-10-08 02:45:11,086][52059] Updated weights for policy 1, policy_version 70652 (0.0008) [2023-10-08 02:45:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 143753216. Throughput: 0: 1710.3, 1: 1748.7. Samples: 35953904. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:11,211][50642] Avg episode reward: [(0, '21.580'), (1, '19.020')] [2023-10-08 02:45:11,474][52060] Updated weights for policy 0, policy_version 69770 (0.0009) [2023-10-08 02:45:11,833][52060] Updated weights for policy 0, policy_version 69780 (0.0011) [2023-10-08 02:45:12,196][52060] Updated weights for policy 0, policy_version 69790 (0.0011) [2023-10-08 02:45:15,143][52059] Updated weights for policy 1, policy_version 70662 (0.0007) [2023-10-08 02:45:15,518][52059] Updated weights for policy 1, policy_version 70672 (0.0007) [2023-10-08 02:45:15,877][52059] Updated weights for policy 1, policy_version 70682 (0.0008) [2023-10-08 02:45:16,131][52060] Updated weights for policy 0, policy_version 69800 (0.0008) [2023-10-08 02:45:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 143851520. Throughput: 0: 1719.7, 1: 1727.6. Samples: 35974392. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:16,211][50642] Avg episode reward: [(0, '21.570'), (1, '20.270')] [2023-10-08 02:45:16,494][52060] Updated weights for policy 0, policy_version 69810 (0.0008) [2023-10-08 02:45:16,858][52060] Updated weights for policy 0, policy_version 69820 (0.0010) [2023-10-08 02:45:19,582][52059] Updated weights for policy 1, policy_version 70692 (0.0008) [2023-10-08 02:45:19,957][52059] Updated weights for policy 1, policy_version 70702 (0.0011) [2023-10-08 02:45:20,314][52059] Updated weights for policy 1, policy_version 70712 (0.0009) [2023-10-08 02:45:20,684][52060] Updated weights for policy 0, policy_version 69830 (0.0008) [2023-10-08 02:45:21,061][52060] Updated weights for policy 0, policy_version 69840 (0.0009) [2023-10-08 02:45:21,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 143917056. Throughput: 0: 1727.5, 1: 1755.6. Samples: 35985254. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:21,211][50642] Avg episode reward: [(0, '18.700'), (1, '20.940')] [2023-10-08 02:45:21,423][52060] Updated weights for policy 0, policy_version 69850 (0.0009) [2023-10-08 02:45:24,328][52059] Updated weights for policy 1, policy_version 70722 (0.0008) [2023-10-08 02:45:24,700][52059] Updated weights for policy 1, policy_version 70732 (0.0007) [2023-10-08 02:45:25,061][52059] Updated weights for policy 1, policy_version 70742 (0.0009) [2023-10-08 02:45:25,428][52059] Updated weights for policy 1, policy_version 70752 (0.0009) [2023-10-08 02:45:25,451][52060] Updated weights for policy 0, policy_version 69860 (0.0008) [2023-10-08 02:45:25,816][52060] Updated weights for policy 0, policy_version 69870 (0.0007) [2023-10-08 02:45:26,191][52060] Updated weights for policy 0, policy_version 69880 (0.0008) [2023-10-08 02:45:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 143982592. Throughput: 0: 1732.3, 1: 1737.2. Samples: 36005958. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:26,211][50642] Avg episode reward: [(0, '22.400'), (1, '22.430')] [2023-10-08 02:45:29,296][52059] Updated weights for policy 1, policy_version 70762 (0.0008) [2023-10-08 02:45:29,669][52059] Updated weights for policy 1, policy_version 70772 (0.0007) [2023-10-08 02:45:30,028][52059] Updated weights for policy 1, policy_version 70782 (0.0008) [2023-10-08 02:45:30,162][52060] Updated weights for policy 0, policy_version 69890 (0.0008) [2023-10-08 02:45:30,523][52060] Updated weights for policy 0, policy_version 69900 (0.0010) [2023-10-08 02:45:30,890][52060] Updated weights for policy 0, policy_version 69910 (0.0009) [2023-10-08 02:45:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 144048128. Throughput: 0: 1708.8, 1: 1729.5. Samples: 36025878. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:31,211][50642] Avg episode reward: [(0, '21.650'), (1, '23.070')] [2023-10-08 02:45:31,251][52060] Updated weights for policy 0, policy_version 69920 (0.0009) [2023-10-08 02:45:33,769][52059] Updated weights for policy 1, policy_version 70792 (0.0010) [2023-10-08 02:45:34,128][52059] Updated weights for policy 1, policy_version 70802 (0.0008) [2023-10-08 02:45:34,503][52059] Updated weights for policy 1, policy_version 70812 (0.0009) [2023-10-08 02:45:35,180][52060] Updated weights for policy 0, policy_version 69930 (0.0010) [2023-10-08 02:45:35,548][52060] Updated weights for policy 0, policy_version 69940 (0.0010) [2023-10-08 02:45:35,910][52060] Updated weights for policy 0, policy_version 69950 (0.0010) [2023-10-08 02:45:36,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 144146432. Throughput: 0: 1726.1, 1: 1750.3. Samples: 36037046. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:45:36,211][50642] Avg episode reward: [(0, '20.920'), (1, '20.280')] [2023-10-08 02:45:38,486][52059] Updated weights for policy 1, policy_version 70822 (0.0010) [2023-10-08 02:45:38,859][52059] Updated weights for policy 1, policy_version 70832 (0.0009) [2023-10-08 02:45:39,220][52059] Updated weights for policy 1, policy_version 70842 (0.0007) [2023-10-08 02:45:39,658][52060] Updated weights for policy 0, policy_version 69960 (0.0010) [2023-10-08 02:45:40,026][52060] Updated weights for policy 0, policy_version 69970 (0.0007) [2023-10-08 02:45:40,397][52060] Updated weights for policy 0, policy_version 69980 (0.0008) [2023-10-08 02:45:41,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 144211968. Throughput: 0: 1727.2, 1: 1733.0. Samples: 36057258. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:45:41,211][50642] Avg episode reward: [(0, '21.490'), (1, '21.910')] [2023-10-08 02:45:43,090][52059] Updated weights for policy 1, policy_version 70852 (0.0007) [2023-10-08 02:45:43,464][52059] Updated weights for policy 1, policy_version 70862 (0.0009) [2023-10-08 02:45:43,834][52059] Updated weights for policy 1, policy_version 70872 (0.0008) [2023-10-08 02:45:44,515][52060] Updated weights for policy 0, policy_version 69990 (0.0008) [2023-10-08 02:45:44,898][52060] Updated weights for policy 0, policy_version 70000 (0.0007) [2023-10-08 02:45:45,267][52060] Updated weights for policy 0, policy_version 70010 (0.0007) [2023-10-08 02:45:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 144277504. Throughput: 0: 1709.7, 1: 1735.1. Samples: 36077734. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:45:46,211][50642] Avg episode reward: [(0, '21.630'), (1, '22.210')] [2023-10-08 02:45:47,592][52059] Updated weights for policy 1, policy_version 70882 (0.0008) [2023-10-08 02:45:47,959][52059] Updated weights for policy 1, policy_version 70892 (0.0011) [2023-10-08 02:45:48,321][52059] Updated weights for policy 1, policy_version 70902 (0.0011) [2023-10-08 02:45:48,680][52059] Updated weights for policy 1, policy_version 70912 (0.0011) [2023-10-08 02:45:49,233][52060] Updated weights for policy 0, policy_version 70020 (0.0009) [2023-10-08 02:45:49,601][52060] Updated weights for policy 0, policy_version 70030 (0.0010) [2023-10-08 02:45:49,967][52060] Updated weights for policy 0, policy_version 70040 (0.0008) [2023-10-08 02:45:51,211][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13773.6). Total num frames: 144343040. Throughput: 0: 1744.1, 1: 1725.7. Samples: 36088566. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:45:51,212][50642] Avg episode reward: [(0, '21.030'), (1, '20.600')] [2023-10-08 02:45:52,561][52059] Updated weights for policy 1, policy_version 70922 (0.0007) [2023-10-08 02:45:52,927][52059] Updated weights for policy 1, policy_version 70932 (0.0007) [2023-10-08 02:45:53,291][52059] Updated weights for policy 1, policy_version 70942 (0.0010) [2023-10-08 02:45:54,015][52060] Updated weights for policy 0, policy_version 70050 (0.0009) [2023-10-08 02:45:54,385][52060] Updated weights for policy 0, policy_version 70060 (0.0008) [2023-10-08 02:45:54,764][52060] Updated weights for policy 0, policy_version 70070 (0.0008) [2023-10-08 02:45:55,130][52060] Updated weights for policy 0, policy_version 70080 (0.0008) [2023-10-08 02:45:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 144408576. Throughput: 0: 1716.8, 1: 1725.7. Samples: 36108820. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:45:56,211][50642] Avg episode reward: [(0, '20.830'), (1, '20.700')] [2023-10-08 02:45:57,239][52059] Updated weights for policy 1, policy_version 70952 (0.0008) [2023-10-08 02:45:57,609][52059] Updated weights for policy 1, policy_version 70962 (0.0007) [2023-10-08 02:45:57,971][52059] Updated weights for policy 1, policy_version 70972 (0.0008) [2023-10-08 02:45:58,984][52060] Updated weights for policy 0, policy_version 70090 (0.0010) [2023-10-08 02:45:59,359][52060] Updated weights for policy 0, policy_version 70100 (0.0008) [2023-10-08 02:45:59,723][52060] Updated weights for policy 0, policy_version 70110 (0.0009) [2023-10-08 02:46:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 144474112. Throughput: 0: 1700.7, 1: 1756.5. Samples: 36129966. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:46:01,211][50642] Avg episode reward: [(0, '20.660'), (1, '21.360')] [2023-10-08 02:46:01,862][52059] Updated weights for policy 1, policy_version 70982 (0.0007) [2023-10-08 02:46:02,235][52059] Updated weights for policy 1, policy_version 70992 (0.0009) [2023-10-08 02:46:02,605][52059] Updated weights for policy 1, policy_version 71002 (0.0009) [2023-10-08 02:46:03,774][52060] Updated weights for policy 0, policy_version 70120 (0.0010) [2023-10-08 02:46:04,137][52060] Updated weights for policy 0, policy_version 70130 (0.0009) [2023-10-08 02:46:04,507][52060] Updated weights for policy 0, policy_version 70140 (0.0007) [2023-10-08 02:46:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 144539648. Throughput: 0: 1714.8, 1: 1724.9. Samples: 36140042. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:46:06,211][50642] Avg episode reward: [(0, '20.760'), (1, '21.550')] [2023-10-08 02:46:06,643][52059] Updated weights for policy 1, policy_version 71012 (0.0007) [2023-10-08 02:46:07,013][52059] Updated weights for policy 1, policy_version 71022 (0.0008) [2023-10-08 02:46:07,383][52059] Updated weights for policy 1, policy_version 71032 (0.0007) [2023-10-08 02:46:08,402][52060] Updated weights for policy 0, policy_version 70150 (0.0010) [2023-10-08 02:46:08,767][52060] Updated weights for policy 0, policy_version 70160 (0.0010) [2023-10-08 02:46:09,138][52060] Updated weights for policy 0, policy_version 70170 (0.0011) [2023-10-08 02:46:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 144605184. Throughput: 0: 1688.0, 1: 1741.8. Samples: 36160298. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:46:11,211][50642] Avg episode reward: [(0, '19.140'), (1, '20.800')] [2023-10-08 02:46:11,231][52059] Updated weights for policy 1, policy_version 71042 (0.0007) [2023-10-08 02:46:11,591][52059] Updated weights for policy 1, policy_version 71052 (0.0007) [2023-10-08 02:46:11,962][52059] Updated weights for policy 1, policy_version 71062 (0.0008) [2023-10-08 02:46:12,317][52059] Updated weights for policy 1, policy_version 71072 (0.0009) [2023-10-08 02:46:13,144][52060] Updated weights for policy 0, policy_version 70180 (0.0008) [2023-10-08 02:46:13,506][52060] Updated weights for policy 0, policy_version 70190 (0.0008) [2023-10-08 02:46:13,878][52060] Updated weights for policy 0, policy_version 70200 (0.0008) [2023-10-08 02:46:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 144670720. Throughput: 0: 1707.6, 1: 1750.5. Samples: 36181490. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:46:16,211][50642] Avg episode reward: [(0, '20.620'), (1, '20.690')] [2023-10-08 02:46:16,236][52059] Updated weights for policy 1, policy_version 71082 (0.0008) [2023-10-08 02:46:16,590][52059] Updated weights for policy 1, policy_version 71092 (0.0009) [2023-10-08 02:46:16,963][52059] Updated weights for policy 1, policy_version 71102 (0.0008) [2023-10-08 02:46:17,958][52060] Updated weights for policy 0, policy_version 70210 (0.0007) [2023-10-08 02:46:18,338][52060] Updated weights for policy 0, policy_version 70220 (0.0008) [2023-10-08 02:46:18,695][52060] Updated weights for policy 0, policy_version 70230 (0.0007) [2023-10-08 02:46:19,069][52060] Updated weights for policy 0, policy_version 70240 (0.0008) [2023-10-08 02:46:20,902][52059] Updated weights for policy 1, policy_version 71112 (0.0009) [2023-10-08 02:46:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 144736256. Throughput: 0: 1696.0, 1: 1731.4. Samples: 36191278. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:46:21,211][50642] Avg episode reward: [(0, '21.380'), (1, '20.600')] [2023-10-08 02:46:21,270][52059] Updated weights for policy 1, policy_version 71122 (0.0008) [2023-10-08 02:46:21,630][52059] Updated weights for policy 1, policy_version 71132 (0.0009) [2023-10-08 02:46:22,941][52060] Updated weights for policy 0, policy_version 70250 (0.0008) [2023-10-08 02:46:23,309][52060] Updated weights for policy 0, policy_version 70260 (0.0009) [2023-10-08 02:46:23,678][52060] Updated weights for policy 0, policy_version 70270 (0.0008) [2023-10-08 02:46:25,619][52059] Updated weights for policy 1, policy_version 71142 (0.0010) [2023-10-08 02:46:25,988][52059] Updated weights for policy 1, policy_version 71152 (0.0010) [2023-10-08 02:46:26,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 144801792. Throughput: 0: 1696.7, 1: 1754.9. Samples: 36212582. Policy #0 lag: (min: 28.0, avg: 30.8, max: 58.0) [2023-10-08 02:46:26,211][50642] Avg episode reward: [(0, '20.070'), (1, '19.240')] [2023-10-08 02:46:26,352][52059] Updated weights for policy 1, policy_version 71162 (0.0007) [2023-10-08 02:46:27,607][52060] Updated weights for policy 0, policy_version 70280 (0.0007) [2023-10-08 02:46:27,976][52060] Updated weights for policy 0, policy_version 70290 (0.0009) [2023-10-08 02:46:28,353][52060] Updated weights for policy 0, policy_version 70300 (0.0009) [2023-10-08 02:46:30,214][52059] Updated weights for policy 1, policy_version 71172 (0.0007) [2023-10-08 02:46:30,571][52059] Updated weights for policy 1, policy_version 71182 (0.0010) [2023-10-08 02:46:30,939][52059] Updated weights for policy 1, policy_version 71192 (0.0010) [2023-10-08 02:46:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 144867328. Throughput: 0: 1720.5, 1: 1730.9. Samples: 36233044. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:46:31,211][50642] Avg episode reward: [(0, '22.310'), (1, '21.080')] [2023-10-08 02:46:32,531][52060] Updated weights for policy 0, policy_version 70310 (0.0008) [2023-10-08 02:46:32,908][52060] Updated weights for policy 0, policy_version 70320 (0.0008) [2023-10-08 02:46:33,282][52060] Updated weights for policy 0, policy_version 70330 (0.0008) [2023-10-08 02:46:34,916][52059] Updated weights for policy 1, policy_version 71202 (0.0008) [2023-10-08 02:46:35,271][52059] Updated weights for policy 1, policy_version 71212 (0.0009) [2023-10-08 02:46:35,646][52059] Updated weights for policy 1, policy_version 71222 (0.0010) [2023-10-08 02:46:36,002][52059] Updated weights for policy 1, policy_version 71232 (0.0008) [2023-10-08 02:46:36,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 144965632. Throughput: 0: 1681.7, 1: 1757.3. Samples: 36243320. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:46:36,211][50642] Avg episode reward: [(0, '21.630'), (1, '19.940')] [2023-10-08 02:46:37,077][52060] Updated weights for policy 0, policy_version 70340 (0.0008) [2023-10-08 02:46:37,442][52060] Updated weights for policy 0, policy_version 70350 (0.0007) [2023-10-08 02:46:37,818][52060] Updated weights for policy 0, policy_version 70360 (0.0009) [2023-10-08 02:46:39,901][52059] Updated weights for policy 1, policy_version 71242 (0.0009) [2023-10-08 02:46:40,262][52059] Updated weights for policy 1, policy_version 71252 (0.0009) [2023-10-08 02:46:40,628][52059] Updated weights for policy 1, policy_version 71262 (0.0007) [2023-10-08 02:46:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 145031168. Throughput: 0: 1710.7, 1: 1748.2. Samples: 36264468. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:46:41,211][50642] Avg episode reward: [(0, '19.400'), (1, '20.540')] [2023-10-08 02:46:41,743][52060] Updated weights for policy 0, policy_version 70370 (0.0010) [2023-10-08 02:46:42,119][52060] Updated weights for policy 0, policy_version 70380 (0.0009) [2023-10-08 02:46:42,487][52060] Updated weights for policy 0, policy_version 70390 (0.0007) [2023-10-08 02:46:42,853][52060] Updated weights for policy 0, policy_version 70400 (0.0007) [2023-10-08 02:46:44,635][52059] Updated weights for policy 1, policy_version 71272 (0.0009) [2023-10-08 02:46:45,000][52059] Updated weights for policy 1, policy_version 71282 (0.0010) [2023-10-08 02:46:45,365][52059] Updated weights for policy 1, policy_version 71292 (0.0007) [2023-10-08 02:46:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 145096704. Throughput: 0: 1724.0, 1: 1721.6. Samples: 36285014. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:46:46,211][50642] Avg episode reward: [(0, '21.620'), (1, '21.700')] [2023-10-08 02:46:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000071296_73007104.pth... [2023-10-08 02:46:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000070400_72089600.pth... [2023-10-08 02:46:46,249][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000069664_71335936.pth [2023-10-08 02:46:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000068800_70451200.pth [2023-10-08 02:46:46,847][52060] Updated weights for policy 0, policy_version 70410 (0.0008) [2023-10-08 02:46:47,228][52060] Updated weights for policy 0, policy_version 70420 (0.0007) [2023-10-08 02:46:47,599][52060] Updated weights for policy 0, policy_version 70430 (0.0009) [2023-10-08 02:46:49,298][52059] Updated weights for policy 1, policy_version 71302 (0.0007) [2023-10-08 02:46:49,696][52059] Updated weights for policy 1, policy_version 71312 (0.0008) [2023-10-08 02:46:50,055][52059] Updated weights for policy 1, policy_version 71322 (0.0008) [2023-10-08 02:46:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 145162240. Throughput: 0: 1699.0, 1: 1757.5. Samples: 36295584. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:46:51,211][50642] Avg episode reward: [(0, '20.880'), (1, '21.420')] [2023-10-08 02:46:51,783][52060] Updated weights for policy 0, policy_version 70440 (0.0012) [2023-10-08 02:46:52,161][52060] Updated weights for policy 0, policy_version 70450 (0.0008) [2023-10-08 02:46:52,529][52060] Updated weights for policy 0, policy_version 70460 (0.0011) [2023-10-08 02:46:54,051][52059] Updated weights for policy 1, policy_version 71332 (0.0008) [2023-10-08 02:46:54,402][52059] Updated weights for policy 1, policy_version 71342 (0.0007) [2023-10-08 02:46:54,773][52059] Updated weights for policy 1, policy_version 71352 (0.0009) [2023-10-08 02:46:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 145227776. Throughput: 0: 1721.8, 1: 1724.2. Samples: 36315370. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:46:56,211][50642] Avg episode reward: [(0, '19.360'), (1, '21.200')] [2023-10-08 02:46:56,585][52060] Updated weights for policy 0, policy_version 70470 (0.0009) [2023-10-08 02:46:56,957][52060] Updated weights for policy 0, policy_version 70480 (0.0008) [2023-10-08 02:46:57,330][52060] Updated weights for policy 0, policy_version 70490 (0.0008) [2023-10-08 02:46:58,762][52059] Updated weights for policy 1, policy_version 71362 (0.0011) [2023-10-08 02:46:59,134][52059] Updated weights for policy 1, policy_version 71372 (0.0007) [2023-10-08 02:46:59,503][52059] Updated weights for policy 1, policy_version 71382 (0.0008) [2023-10-08 02:46:59,868][52059] Updated weights for policy 1, policy_version 71392 (0.0009) [2023-10-08 02:47:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 145293312. Throughput: 0: 1724.9, 1: 1716.4. Samples: 36336350. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:01,211][50642] Avg episode reward: [(0, '20.130'), (1, '22.240')] [2023-10-08 02:47:01,330][52060] Updated weights for policy 0, policy_version 70500 (0.0008) [2023-10-08 02:47:01,699][52060] Updated weights for policy 0, policy_version 70510 (0.0009) [2023-10-08 02:47:02,078][52060] Updated weights for policy 0, policy_version 70520 (0.0009) [2023-10-08 02:47:03,769][52059] Updated weights for policy 1, policy_version 71402 (0.0010) [2023-10-08 02:47:04,132][52059] Updated weights for policy 1, policy_version 71412 (0.0010) [2023-10-08 02:47:04,507][52059] Updated weights for policy 1, policy_version 71422 (0.0008) [2023-10-08 02:47:05,880][52060] Updated weights for policy 0, policy_version 70530 (0.0009) [2023-10-08 02:47:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 145358848. Throughput: 0: 1720.6, 1: 1730.4. Samples: 36346572. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:06,211][50642] Avg episode reward: [(0, '21.630'), (1, '21.720')] [2023-10-08 02:47:06,261][52060] Updated weights for policy 0, policy_version 70540 (0.0010) [2023-10-08 02:47:06,642][52060] Updated weights for policy 0, policy_version 70550 (0.0011) [2023-10-08 02:47:07,013][52060] Updated weights for policy 0, policy_version 70560 (0.0007) [2023-10-08 02:47:08,490][52059] Updated weights for policy 1, policy_version 71432 (0.0009) [2023-10-08 02:47:08,863][52059] Updated weights for policy 1, policy_version 71442 (0.0009) [2023-10-08 02:47:09,222][52059] Updated weights for policy 1, policy_version 71452 (0.0009) [2023-10-08 02:47:10,862][52060] Updated weights for policy 0, policy_version 70570 (0.0007) [2023-10-08 02:47:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 145424384. Throughput: 0: 1731.2, 1: 1706.2. Samples: 36367264. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:11,211][50642] Avg episode reward: [(0, '19.500'), (1, '22.240')] [2023-10-08 02:47:11,232][52060] Updated weights for policy 0, policy_version 70580 (0.0008) [2023-10-08 02:47:11,611][52060] Updated weights for policy 0, policy_version 70590 (0.0011) [2023-10-08 02:47:13,091][52059] Updated weights for policy 1, policy_version 71462 (0.0008) [2023-10-08 02:47:13,454][52059] Updated weights for policy 1, policy_version 71472 (0.0008) [2023-10-08 02:47:13,818][52059] Updated weights for policy 1, policy_version 71482 (0.0008) [2023-10-08 02:47:15,595][52060] Updated weights for policy 0, policy_version 70600 (0.0009) [2023-10-08 02:47:15,954][52060] Updated weights for policy 0, policy_version 70610 (0.0008) [2023-10-08 02:47:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 145489920. Throughput: 0: 1717.4, 1: 1731.8. Samples: 36388258. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:16,211][50642] Avg episode reward: [(0, '19.700'), (1, '21.530')] [2023-10-08 02:47:16,334][52060] Updated weights for policy 0, policy_version 70620 (0.0007) [2023-10-08 02:47:17,640][52059] Updated weights for policy 1, policy_version 71492 (0.0009) [2023-10-08 02:47:18,005][52059] Updated weights for policy 1, policy_version 71502 (0.0008) [2023-10-08 02:47:18,370][52059] Updated weights for policy 1, policy_version 71512 (0.0009) [2023-10-08 02:47:20,289][52060] Updated weights for policy 0, policy_version 70630 (0.0008) [2023-10-08 02:47:20,665][52060] Updated weights for policy 0, policy_version 70640 (0.0007) [2023-10-08 02:47:21,031][52060] Updated weights for policy 0, policy_version 70650 (0.0009) [2023-10-08 02:47:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 145555456. Throughput: 0: 1735.6, 1: 1708.4. Samples: 36398298. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:21,211][50642] Avg episode reward: [(0, '20.650'), (1, '23.460')] [2023-10-08 02:47:22,269][52059] Updated weights for policy 1, policy_version 71522 (0.0007) [2023-10-08 02:47:22,630][52059] Updated weights for policy 1, policy_version 71532 (0.0007) [2023-10-08 02:47:22,998][52059] Updated weights for policy 1, policy_version 71542 (0.0009) [2023-10-08 02:47:23,364][52059] Updated weights for policy 1, policy_version 71552 (0.0009) [2023-10-08 02:47:25,129][52060] Updated weights for policy 0, policy_version 70660 (0.0009) [2023-10-08 02:47:25,503][52060] Updated weights for policy 0, policy_version 70670 (0.0011) [2023-10-08 02:47:25,866][52060] Updated weights for policy 0, policy_version 70680 (0.0008) [2023-10-08 02:47:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 145653760. Throughput: 0: 1731.8, 1: 1717.7. Samples: 36419696. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:26,211][50642] Avg episode reward: [(0, '19.980'), (1, '21.920')] [2023-10-08 02:47:27,152][52059] Updated weights for policy 1, policy_version 71562 (0.0007) [2023-10-08 02:47:27,528][52059] Updated weights for policy 1, policy_version 71572 (0.0007) [2023-10-08 02:47:27,888][52059] Updated weights for policy 1, policy_version 71582 (0.0008) [2023-10-08 02:47:29,760][52060] Updated weights for policy 0, policy_version 70690 (0.0008) [2023-10-08 02:47:30,140][52060] Updated weights for policy 0, policy_version 70700 (0.0009) [2023-10-08 02:47:30,512][52060] Updated weights for policy 0, policy_version 70710 (0.0009) [2023-10-08 02:47:30,886][52060] Updated weights for policy 0, policy_version 70720 (0.0008) [2023-10-08 02:47:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 145719296. Throughput: 0: 1697.5, 1: 1743.8. Samples: 36439870. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:31,211][50642] Avg episode reward: [(0, '19.140'), (1, '22.070')] [2023-10-08 02:47:31,794][52059] Updated weights for policy 1, policy_version 71592 (0.0010) [2023-10-08 02:47:32,171][52059] Updated weights for policy 1, policy_version 71602 (0.0009) [2023-10-08 02:47:32,544][52059] Updated weights for policy 1, policy_version 71612 (0.0008) [2023-10-08 02:47:34,793][52060] Updated weights for policy 0, policy_version 70730 (0.0007) [2023-10-08 02:47:35,165][52060] Updated weights for policy 0, policy_version 70740 (0.0007) [2023-10-08 02:47:35,531][52060] Updated weights for policy 0, policy_version 70750 (0.0008) [2023-10-08 02:47:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 145784832. Throughput: 0: 1727.8, 1: 1712.1. Samples: 36450382. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:36,211][50642] Avg episode reward: [(0, '20.770'), (1, '21.170')] [2023-10-08 02:47:36,516][52059] Updated weights for policy 1, policy_version 71622 (0.0009) [2023-10-08 02:47:36,918][52059] Updated weights for policy 1, policy_version 71632 (0.0009) [2023-10-08 02:47:37,293][52059] Updated weights for policy 1, policy_version 71642 (0.0009) [2023-10-08 02:47:39,543][52060] Updated weights for policy 0, policy_version 70760 (0.0010) [2023-10-08 02:47:39,907][52060] Updated weights for policy 0, policy_version 70770 (0.0011) [2023-10-08 02:47:40,272][52060] Updated weights for policy 0, policy_version 70780 (0.0010) [2023-10-08 02:47:41,049][52059] Updated weights for policy 1, policy_version 71652 (0.0008) [2023-10-08 02:47:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 145850368. Throughput: 0: 1712.1, 1: 1745.8. Samples: 36470978. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:41,211][50642] Avg episode reward: [(0, '21.750'), (1, '21.580')] [2023-10-08 02:47:41,417][52059] Updated weights for policy 1, policy_version 71662 (0.0007) [2023-10-08 02:47:41,787][52059] Updated weights for policy 1, policy_version 71672 (0.0008) [2023-10-08 02:47:44,259][52060] Updated weights for policy 0, policy_version 70790 (0.0010) [2023-10-08 02:47:44,625][52060] Updated weights for policy 0, policy_version 70800 (0.0011) [2023-10-08 02:47:44,999][52060] Updated weights for policy 0, policy_version 70810 (0.0011) [2023-10-08 02:47:45,660][52059] Updated weights for policy 1, policy_version 71682 (0.0008) [2023-10-08 02:47:46,035][52059] Updated weights for policy 1, policy_version 71692 (0.0011) [2023-10-08 02:47:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 145915904. Throughput: 0: 1694.1, 1: 1752.2. Samples: 36491432. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:46,211][50642] Avg episode reward: [(0, '19.440'), (1, '23.930')] [2023-10-08 02:47:46,397][52059] Updated weights for policy 1, policy_version 71702 (0.0011) [2023-10-08 02:47:46,766][52059] Updated weights for policy 1, policy_version 71712 (0.0010) [2023-10-08 02:47:49,043][52060] Updated weights for policy 0, policy_version 70820 (0.0009) [2023-10-08 02:47:49,408][52060] Updated weights for policy 0, policy_version 70830 (0.0008) [2023-10-08 02:47:49,784][52060] Updated weights for policy 0, policy_version 70840 (0.0009) [2023-10-08 02:47:50,720][52059] Updated weights for policy 1, policy_version 71722 (0.0010) [2023-10-08 02:47:51,086][52059] Updated weights for policy 1, policy_version 71732 (0.0010) [2023-10-08 02:47:51,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 145981440. Throughput: 0: 1723.3, 1: 1740.4. Samples: 36502442. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:51,211][50642] Avg episode reward: [(0, '22.380'), (1, '22.030')] [2023-10-08 02:47:51,452][52059] Updated weights for policy 1, policy_version 71742 (0.0007) [2023-10-08 02:47:53,746][52060] Updated weights for policy 0, policy_version 70850 (0.0009) [2023-10-08 02:47:54,129][52060] Updated weights for policy 0, policy_version 70860 (0.0009) [2023-10-08 02:47:54,498][52060] Updated weights for policy 0, policy_version 70870 (0.0008) [2023-10-08 02:47:54,867][52060] Updated weights for policy 0, policy_version 70880 (0.0009) [2023-10-08 02:47:55,300][52059] Updated weights for policy 1, policy_version 71752 (0.0011) [2023-10-08 02:47:55,662][52059] Updated weights for policy 1, policy_version 71762 (0.0010) [2023-10-08 02:47:56,028][52059] Updated weights for policy 1, policy_version 71772 (0.0008) [2023-10-08 02:47:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 146079744. Throughput: 0: 1689.9, 1: 1762.2. Samples: 36522608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:47:56,211][50642] Avg episode reward: [(0, '21.070'), (1, '21.210')] [2023-10-08 02:47:58,612][52060] Updated weights for policy 0, policy_version 70890 (0.0008) [2023-10-08 02:47:58,983][52060] Updated weights for policy 0, policy_version 70900 (0.0008) [2023-10-08 02:47:59,349][52060] Updated weights for policy 0, policy_version 70910 (0.0008) [2023-10-08 02:47:59,835][52059] Updated weights for policy 1, policy_version 71782 (0.0010) [2023-10-08 02:48:00,205][52059] Updated weights for policy 1, policy_version 71792 (0.0007) [2023-10-08 02:48:00,556][52059] Updated weights for policy 1, policy_version 71802 (0.0010) [2023-10-08 02:48:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 146145280. Throughput: 0: 1705.2, 1: 1731.1. Samples: 36542896. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:01,211][50642] Avg episode reward: [(0, '19.420'), (1, '20.690')] [2023-10-08 02:48:03,573][52060] Updated weights for policy 0, policy_version 70920 (0.0008) [2023-10-08 02:48:03,944][52060] Updated weights for policy 0, policy_version 70930 (0.0010) [2023-10-08 02:48:04,322][52060] Updated weights for policy 0, policy_version 70940 (0.0010) [2023-10-08 02:48:04,633][52059] Updated weights for policy 1, policy_version 71812 (0.0008) [2023-10-08 02:48:04,986][52059] Updated weights for policy 1, policy_version 71822 (0.0009) [2023-10-08 02:48:05,352][52059] Updated weights for policy 1, policy_version 71832 (0.0010) [2023-10-08 02:48:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 146210816. Throughput: 0: 1700.4, 1: 1762.0. Samples: 36554110. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:06,211][50642] Avg episode reward: [(0, '21.040'), (1, '25.700')] [2023-10-08 02:48:08,505][52060] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-10-08 02:48:08,887][52060] Updated weights for policy 0, policy_version 70960 (0.0008) [2023-10-08 02:48:09,168][52059] Updated weights for policy 1, policy_version 71842 (0.0008) [2023-10-08 02:48:09,246][52060] Updated weights for policy 0, policy_version 70970 (0.0008) [2023-10-08 02:48:09,534][52059] Updated weights for policy 1, policy_version 71852 (0.0009) [2023-10-08 02:48:09,904][52059] Updated weights for policy 1, policy_version 71862 (0.0010) [2023-10-08 02:48:10,271][52059] Updated weights for policy 1, policy_version 71872 (0.0009) [2023-10-08 02:48:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 146276352. Throughput: 0: 1684.3, 1: 1744.2. Samples: 36573978. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:11,211][50642] Avg episode reward: [(0, '21.540'), (1, '22.670')] [2023-10-08 02:48:13,003][52060] Updated weights for policy 0, policy_version 70980 (0.0007) [2023-10-08 02:48:13,376][52060] Updated weights for policy 0, policy_version 70990 (0.0007) [2023-10-08 02:48:13,739][52060] Updated weights for policy 0, policy_version 71000 (0.0007) [2023-10-08 02:48:14,239][52059] Updated weights for policy 1, policy_version 71882 (0.0008) [2023-10-08 02:48:14,612][52059] Updated weights for policy 1, policy_version 71892 (0.0010) [2023-10-08 02:48:14,976][52059] Updated weights for policy 1, policy_version 71902 (0.0008) [2023-10-08 02:48:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 146341888. Throughput: 0: 1714.9, 1: 1728.0. Samples: 36594800. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:16,211][50642] Avg episode reward: [(0, '20.410'), (1, '21.100')] [2023-10-08 02:48:17,853][52060] Updated weights for policy 0, policy_version 71010 (0.0009) [2023-10-08 02:48:18,229][52060] Updated weights for policy 0, policy_version 71020 (0.0010) [2023-10-08 02:48:18,600][52060] Updated weights for policy 0, policy_version 71030 (0.0009) [2023-10-08 02:48:18,872][52059] Updated weights for policy 1, policy_version 71912 (0.0008) [2023-10-08 02:48:18,966][52060] Updated weights for policy 0, policy_version 71040 (0.0009) [2023-10-08 02:48:19,229][52059] Updated weights for policy 1, policy_version 71922 (0.0009) [2023-10-08 02:48:19,596][52059] Updated weights for policy 1, policy_version 71932 (0.0007) [2023-10-08 02:48:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 146407424. Throughput: 0: 1695.1, 1: 1756.2. Samples: 36605690. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:21,211][50642] Avg episode reward: [(0, '20.870'), (1, '22.590')] [2023-10-08 02:48:22,936][52060] Updated weights for policy 0, policy_version 71050 (0.0010) [2023-10-08 02:48:23,309][52060] Updated weights for policy 0, policy_version 71060 (0.0008) [2023-10-08 02:48:23,504][52059] Updated weights for policy 1, policy_version 71942 (0.0009) [2023-10-08 02:48:23,675][52060] Updated weights for policy 0, policy_version 71070 (0.0009) [2023-10-08 02:48:23,868][52059] Updated weights for policy 1, policy_version 71952 (0.0008) [2023-10-08 02:48:24,225][52059] Updated weights for policy 1, policy_version 71962 (0.0009) [2023-10-08 02:48:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 146472960. Throughput: 0: 1704.6, 1: 1732.7. Samples: 36625654. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:26,211][50642] Avg episode reward: [(0, '21.790'), (1, '20.820')] [2023-10-08 02:48:27,759][52060] Updated weights for policy 0, policy_version 71080 (0.0007) [2023-10-08 02:48:28,125][52060] Updated weights for policy 0, policy_version 71090 (0.0009) [2023-10-08 02:48:28,165][52059] Updated weights for policy 1, policy_version 71972 (0.0007) [2023-10-08 02:48:28,502][52060] Updated weights for policy 0, policy_version 71100 (0.0008) [2023-10-08 02:48:28,579][52059] Updated weights for policy 1, policy_version 71982 (0.0009) [2023-10-08 02:48:28,955][52059] Updated weights for policy 1, policy_version 71992 (0.0008) [2023-10-08 02:48:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 146538496. Throughput: 0: 1720.3, 1: 1734.8. Samples: 36646912. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:31,211][50642] Avg episode reward: [(0, '19.740'), (1, '20.590')] [2023-10-08 02:48:32,428][52060] Updated weights for policy 0, policy_version 71110 (0.0009) [2023-10-08 02:48:32,754][52059] Updated weights for policy 1, policy_version 72002 (0.0009) [2023-10-08 02:48:32,791][52060] Updated weights for policy 0, policy_version 71120 (0.0008) [2023-10-08 02:48:33,119][52059] Updated weights for policy 1, policy_version 72012 (0.0010) [2023-10-08 02:48:33,162][52060] Updated weights for policy 0, policy_version 71130 (0.0008) [2023-10-08 02:48:33,486][52059] Updated weights for policy 1, policy_version 72022 (0.0010) [2023-10-08 02:48:33,843][52059] Updated weights for policy 1, policy_version 72032 (0.0007) [2023-10-08 02:48:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 146604032. Throughput: 0: 1687.4, 1: 1732.5. Samples: 36656338. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:36,211][50642] Avg episode reward: [(0, '19.230'), (1, '20.360')] [2023-10-08 02:48:37,088][52060] Updated weights for policy 0, policy_version 71140 (0.0008) [2023-10-08 02:48:37,455][52060] Updated weights for policy 0, policy_version 71150 (0.0009) [2023-10-08 02:48:37,759][52059] Updated weights for policy 1, policy_version 72042 (0.0007) [2023-10-08 02:48:37,825][52060] Updated weights for policy 0, policy_version 71160 (0.0008) [2023-10-08 02:48:38,115][52059] Updated weights for policy 1, policy_version 72052 (0.0008) [2023-10-08 02:48:38,490][52059] Updated weights for policy 1, policy_version 72062 (0.0007) [2023-10-08 02:48:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 146669568. Throughput: 0: 1719.2, 1: 1730.9. Samples: 36677866. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:41,211][50642] Avg episode reward: [(0, '20.130'), (1, '20.900')] [2023-10-08 02:48:41,743][52060] Updated weights for policy 0, policy_version 71170 (0.0009) [2023-10-08 02:48:42,112][52060] Updated weights for policy 0, policy_version 71180 (0.0008) [2023-10-08 02:48:42,302][52059] Updated weights for policy 1, policy_version 72072 (0.0008) [2023-10-08 02:48:42,492][52060] Updated weights for policy 0, policy_version 71190 (0.0008) [2023-10-08 02:48:42,668][52059] Updated weights for policy 1, policy_version 72082 (0.0008) [2023-10-08 02:48:42,856][52060] Updated weights for policy 0, policy_version 71200 (0.0009) [2023-10-08 02:48:43,031][52059] Updated weights for policy 1, policy_version 72092 (0.0008) [2023-10-08 02:48:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 146735104. Throughput: 0: 1709.5, 1: 1758.4. Samples: 36698948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:46,211][50642] Avg episode reward: [(0, '19.860'), (1, '19.350')] [2023-10-08 02:48:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000072096_73826304.pth... [2023-10-08 02:48:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000071200_72908800.pth... [2023-10-08 02:48:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000070464_72155136.pth [2023-10-08 02:48:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000069600_71270400.pth [2023-10-08 02:48:46,916][52059] Updated weights for policy 1, policy_version 72102 (0.0008) [2023-10-08 02:48:46,950][52060] Updated weights for policy 0, policy_version 71210 (0.0008) [2023-10-08 02:48:47,275][52059] Updated weights for policy 1, policy_version 72112 (0.0007) [2023-10-08 02:48:47,327][52060] Updated weights for policy 0, policy_version 71220 (0.0008) [2023-10-08 02:48:47,640][52059] Updated weights for policy 1, policy_version 72122 (0.0008) [2023-10-08 02:48:47,685][52060] Updated weights for policy 0, policy_version 71230 (0.0007) [2023-10-08 02:48:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 146800640. Throughput: 0: 1697.6, 1: 1727.6. Samples: 36708242. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:48:51,211][50642] Avg episode reward: [(0, '17.350'), (1, '18.950')] [2023-10-08 02:48:51,689][52059] Updated weights for policy 1, policy_version 72132 (0.0008) [2023-10-08 02:48:51,730][52060] Updated weights for policy 0, policy_version 71240 (0.0008) [2023-10-08 02:48:52,047][52059] Updated weights for policy 1, policy_version 72142 (0.0008) [2023-10-08 02:48:52,098][52060] Updated weights for policy 0, policy_version 71250 (0.0009) [2023-10-08 02:48:52,413][52059] Updated weights for policy 1, policy_version 72152 (0.0009) [2023-10-08 02:48:52,467][52060] Updated weights for policy 0, policy_version 71260 (0.0009) [2023-10-08 02:48:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 146866176. Throughput: 0: 1713.2, 1: 1743.9. Samples: 36729546. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:48:56,211][50642] Avg episode reward: [(0, '19.090'), (1, '20.060')] [2023-10-08 02:48:56,374][52059] Updated weights for policy 1, policy_version 72162 (0.0010) [2023-10-08 02:48:56,591][52060] Updated weights for policy 0, policy_version 71270 (0.0008) [2023-10-08 02:48:56,738][52059] Updated weights for policy 1, policy_version 72172 (0.0008) [2023-10-08 02:48:56,959][52060] Updated weights for policy 0, policy_version 71280 (0.0007) [2023-10-08 02:48:57,106][52059] Updated weights for policy 1, policy_version 72182 (0.0008) [2023-10-08 02:48:57,330][52060] Updated weights for policy 0, policy_version 71290 (0.0007) [2023-10-08 02:48:57,467][52059] Updated weights for policy 1, policy_version 72192 (0.0007) [2023-10-08 02:49:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 146931712. Throughput: 0: 1708.5, 1: 1754.5. Samples: 36750630. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:01,211][50642] Avg episode reward: [(0, '20.250'), (1, '18.980')] [2023-10-08 02:49:01,241][52060] Updated weights for policy 0, policy_version 71300 (0.0008) [2023-10-08 02:49:01,425][52059] Updated weights for policy 1, policy_version 72202 (0.0008) [2023-10-08 02:49:01,615][52060] Updated weights for policy 0, policy_version 71310 (0.0008) [2023-10-08 02:49:01,791][52059] Updated weights for policy 1, policy_version 72212 (0.0009) [2023-10-08 02:49:01,978][52060] Updated weights for policy 0, policy_version 71320 (0.0007) [2023-10-08 02:49:02,147][52059] Updated weights for policy 1, policy_version 72222 (0.0007) [2023-10-08 02:49:05,666][52060] Updated weights for policy 0, policy_version 71330 (0.0007) [2023-10-08 02:49:06,031][52060] Updated weights for policy 0, policy_version 71340 (0.0007) [2023-10-08 02:49:06,054][52059] Updated weights for policy 1, policy_version 72232 (0.0007) [2023-10-08 02:49:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 146997248. Throughput: 0: 1707.4, 1: 1723.3. Samples: 36760072. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:06,211][50642] Avg episode reward: [(0, '16.600'), (1, '21.160')] [2023-10-08 02:49:06,398][52060] Updated weights for policy 0, policy_version 71350 (0.0007) [2023-10-08 02:49:06,418][52059] Updated weights for policy 1, policy_version 72242 (0.0008) [2023-10-08 02:49:06,772][52060] Updated weights for policy 0, policy_version 71360 (0.0007) [2023-10-08 02:49:06,789][52059] Updated weights for policy 1, policy_version 72252 (0.0008) [2023-10-08 02:49:10,641][52059] Updated weights for policy 1, policy_version 72262 (0.0008) [2023-10-08 02:49:10,778][52060] Updated weights for policy 0, policy_version 71370 (0.0009) [2023-10-08 02:49:11,008][52059] Updated weights for policy 1, policy_version 72272 (0.0011) [2023-10-08 02:49:11,152][52060] Updated weights for policy 0, policy_version 71380 (0.0008) [2023-10-08 02:49:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 147062784. Throughput: 0: 1716.6, 1: 1745.6. Samples: 36781454. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:11,211][50642] Avg episode reward: [(0, '16.000'), (1, '19.210')] [2023-10-08 02:49:11,382][52059] Updated weights for policy 1, policy_version 72282 (0.0009) [2023-10-08 02:49:11,525][52060] Updated weights for policy 0, policy_version 71390 (0.0008) [2023-10-08 02:49:15,369][52059] Updated weights for policy 1, policy_version 72292 (0.0008) [2023-10-08 02:49:15,518][52060] Updated weights for policy 0, policy_version 71400 (0.0008) [2023-10-08 02:49:15,766][52059] Updated weights for policy 1, policy_version 72302 (0.0009) [2023-10-08 02:49:15,883][52060] Updated weights for policy 0, policy_version 71410 (0.0008) [2023-10-08 02:49:16,138][52059] Updated weights for policy 1, policy_version 72312 (0.0007) [2023-10-08 02:49:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 147128320. Throughput: 0: 1705.3, 1: 1736.2. Samples: 36801778. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:16,211][50642] Avg episode reward: [(0, '15.340'), (1, '20.600')] [2023-10-08 02:49:16,258][52060] Updated weights for policy 0, policy_version 71420 (0.0008) [2023-10-08 02:49:19,939][52059] Updated weights for policy 1, policy_version 72322 (0.0009) [2023-10-08 02:49:20,230][52060] Updated weights for policy 0, policy_version 71430 (0.0008) [2023-10-08 02:49:20,307][52059] Updated weights for policy 1, policy_version 72332 (0.0010) [2023-10-08 02:49:20,595][52060] Updated weights for policy 0, policy_version 71440 (0.0009) [2023-10-08 02:49:20,673][52059] Updated weights for policy 1, policy_version 72342 (0.0009) [2023-10-08 02:49:20,964][52060] Updated weights for policy 0, policy_version 71450 (0.0008) [2023-10-08 02:49:21,038][52059] Updated weights for policy 1, policy_version 72352 (0.0010) [2023-10-08 02:49:21,210][50642] Fps is (10 sec: 19660.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 147259392. Throughput: 0: 1720.4, 1: 1753.6. Samples: 36812666. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:21,211][50642] Avg episode reward: [(0, '15.130'), (1, '21.310')] [2023-10-08 02:49:24,999][52059] Updated weights for policy 1, policy_version 72362 (0.0009) [2023-10-08 02:49:25,007][52060] Updated weights for policy 0, policy_version 71460 (0.0008) [2023-10-08 02:49:25,360][52059] Updated weights for policy 1, policy_version 72372 (0.0008) [2023-10-08 02:49:25,365][52060] Updated weights for policy 0, policy_version 71470 (0.0008) [2023-10-08 02:49:25,725][52059] Updated weights for policy 1, policy_version 72382 (0.0009) [2023-10-08 02:49:25,729][52060] Updated weights for policy 0, policy_version 71480 (0.0008) [2023-10-08 02:49:26,210][50642] Fps is (10 sec: 19661.0, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 147324928. Throughput: 0: 1720.0, 1: 1741.1. Samples: 36833616. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:26,211][50642] Avg episode reward: [(0, '15.670'), (1, '21.690')] [2023-10-08 02:49:29,659][52060] Updated weights for policy 0, policy_version 71490 (0.0008) [2023-10-08 02:49:29,698][52059] Updated weights for policy 1, policy_version 72392 (0.0007) [2023-10-08 02:49:30,033][52060] Updated weights for policy 0, policy_version 71500 (0.0007) [2023-10-08 02:49:30,051][52059] Updated weights for policy 1, policy_version 72402 (0.0007) [2023-10-08 02:49:30,391][52060] Updated weights for policy 0, policy_version 71510 (0.0009) [2023-10-08 02:49:30,418][52059] Updated weights for policy 1, policy_version 72412 (0.0008) [2023-10-08 02:49:30,765][52060] Updated weights for policy 0, policy_version 71520 (0.0011) [2023-10-08 02:49:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 147390464. Throughput: 0: 1700.0, 1: 1713.0. Samples: 36852536. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:31,211][50642] Avg episode reward: [(0, '20.550'), (1, '18.650')] [2023-10-08 02:49:34,292][52059] Updated weights for policy 1, policy_version 72422 (0.0008) [2023-10-08 02:49:34,653][52059] Updated weights for policy 1, policy_version 72432 (0.0007) [2023-10-08 02:49:34,861][52060] Updated weights for policy 0, policy_version 71530 (0.0008) [2023-10-08 02:49:35,018][52059] Updated weights for policy 1, policy_version 72442 (0.0007) [2023-10-08 02:49:35,232][52060] Updated weights for policy 0, policy_version 71540 (0.0008) [2023-10-08 02:49:35,608][52060] Updated weights for policy 0, policy_version 71550 (0.0011) [2023-10-08 02:49:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 147456000. Throughput: 0: 1728.0, 1: 1744.7. Samples: 36864512. Policy #0 lag: (min: 6.0, avg: 13.0, max: 38.0) [2023-10-08 02:49:36,211][50642] Avg episode reward: [(0, '21.120'), (1, '21.070')] [2023-10-08 02:49:38,940][52059] Updated weights for policy 1, policy_version 72452 (0.0010) [2023-10-08 02:49:39,310][52059] Updated weights for policy 1, policy_version 72462 (0.0009) [2023-10-08 02:49:39,602][52060] Updated weights for policy 0, policy_version 71560 (0.0010) [2023-10-08 02:49:39,667][52059] Updated weights for policy 1, policy_version 72472 (0.0008) [2023-10-08 02:49:39,972][52060] Updated weights for policy 0, policy_version 71570 (0.0007) [2023-10-08 02:49:40,343][52060] Updated weights for policy 0, policy_version 71580 (0.0009) [2023-10-08 02:49:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 147521536. Throughput: 0: 1713.8, 1: 1716.9. Samples: 36883930. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:49:41,211][50642] Avg episode reward: [(0, '19.550'), (1, '23.380')] [2023-10-08 02:49:43,531][52059] Updated weights for policy 1, policy_version 72482 (0.0007) [2023-10-08 02:49:43,880][52059] Updated weights for policy 1, policy_version 72492 (0.0008) [2023-10-08 02:49:44,248][52059] Updated weights for policy 1, policy_version 72502 (0.0010) [2023-10-08 02:49:44,415][52060] Updated weights for policy 0, policy_version 71590 (0.0010) [2023-10-08 02:49:44,616][52059] Updated weights for policy 1, policy_version 72512 (0.0008) [2023-10-08 02:49:44,806][52060] Updated weights for policy 0, policy_version 71600 (0.0009) [2023-10-08 02:49:45,179][52060] Updated weights for policy 0, policy_version 71610 (0.0010) [2023-10-08 02:49:46,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 147587072. Throughput: 0: 1690.1, 1: 1720.8. Samples: 36904120. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:49:46,211][50642] Avg episode reward: [(0, '19.280'), (1, '20.370')] [2023-10-08 02:49:48,544][52059] Updated weights for policy 1, policy_version 72522 (0.0007) [2023-10-08 02:49:48,901][52059] Updated weights for policy 1, policy_version 72532 (0.0008) [2023-10-08 02:49:49,157][52060] Updated weights for policy 0, policy_version 71620 (0.0009) [2023-10-08 02:49:49,260][52059] Updated weights for policy 1, policy_version 72542 (0.0008) [2023-10-08 02:49:49,532][52060] Updated weights for policy 0, policy_version 71630 (0.0008) [2023-10-08 02:49:49,895][52060] Updated weights for policy 0, policy_version 71640 (0.0007) [2023-10-08 02:49:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 147652608. Throughput: 0: 1712.0, 1: 1737.6. Samples: 36915300. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:49:51,211][50642] Avg episode reward: [(0, '20.940'), (1, '19.120')] [2023-10-08 02:49:53,019][52059] Updated weights for policy 1, policy_version 72552 (0.0008) [2023-10-08 02:49:53,381][52059] Updated weights for policy 1, policy_version 72562 (0.0008) [2023-10-08 02:49:53,751][52059] Updated weights for policy 1, policy_version 72572 (0.0008) [2023-10-08 02:49:53,906][52060] Updated weights for policy 0, policy_version 71650 (0.0009) [2023-10-08 02:49:54,272][52060] Updated weights for policy 0, policy_version 71660 (0.0009) [2023-10-08 02:49:54,639][52060] Updated weights for policy 0, policy_version 71670 (0.0009) [2023-10-08 02:49:54,999][52060] Updated weights for policy 0, policy_version 71680 (0.0008) [2023-10-08 02:49:56,211][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 147718144. Throughput: 0: 1688.7, 1: 1728.9. Samples: 36935244. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:49:56,212][50642] Avg episode reward: [(0, '18.830'), (1, '21.640')] [2023-10-08 02:49:57,777][52059] Updated weights for policy 1, policy_version 72582 (0.0008) [2023-10-08 02:49:58,145][52059] Updated weights for policy 1, policy_version 72592 (0.0007) [2023-10-08 02:49:58,503][52059] Updated weights for policy 1, policy_version 72602 (0.0008) [2023-10-08 02:49:58,857][52060] Updated weights for policy 0, policy_version 71690 (0.0007) [2023-10-08 02:49:59,218][52060] Updated weights for policy 0, policy_version 71700 (0.0009) [2023-10-08 02:49:59,591][52060] Updated weights for policy 0, policy_version 71710 (0.0008) [2023-10-08 02:50:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 147783680. Throughput: 0: 1695.7, 1: 1737.7. Samples: 36956280. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:50:01,211][50642] Avg episode reward: [(0, '19.790'), (1, '24.830')] [2023-10-08 02:50:02,468][52059] Updated weights for policy 1, policy_version 72612 (0.0008) [2023-10-08 02:50:02,860][52059] Updated weights for policy 1, policy_version 72622 (0.0007) [2023-10-08 02:50:03,226][52059] Updated weights for policy 1, policy_version 72632 (0.0007) [2023-10-08 02:50:03,704][52060] Updated weights for policy 0, policy_version 71720 (0.0009) [2023-10-08 02:50:04,069][52060] Updated weights for policy 0, policy_version 71730 (0.0008) [2023-10-08 02:50:04,437][52060] Updated weights for policy 0, policy_version 71740 (0.0008) [2023-10-08 02:50:06,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 147849216. Throughput: 0: 1699.8, 1: 1714.1. Samples: 36966292. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:50:06,211][50642] Avg episode reward: [(0, '20.490'), (1, '21.500')] [2023-10-08 02:50:07,118][52059] Updated weights for policy 1, policy_version 72642 (0.0009) [2023-10-08 02:50:07,488][52059] Updated weights for policy 1, policy_version 72652 (0.0011) [2023-10-08 02:50:07,854][52059] Updated weights for policy 1, policy_version 72662 (0.0008) [2023-10-08 02:50:08,210][52059] Updated weights for policy 1, policy_version 72672 (0.0008) [2023-10-08 02:50:08,399][52060] Updated weights for policy 0, policy_version 71750 (0.0008) [2023-10-08 02:50:08,776][52060] Updated weights for policy 0, policy_version 71760 (0.0008) [2023-10-08 02:50:09,138][52060] Updated weights for policy 0, policy_version 71770 (0.0009) [2023-10-08 02:50:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 147914752. Throughput: 0: 1680.8, 1: 1727.9. Samples: 36987008. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:50:11,211][50642] Avg episode reward: [(0, '18.060'), (1, '19.760')] [2023-10-08 02:50:12,009][52059] Updated weights for policy 1, policy_version 72682 (0.0009) [2023-10-08 02:50:12,367][52059] Updated weights for policy 1, policy_version 72692 (0.0008) [2023-10-08 02:50:12,734][52059] Updated weights for policy 1, policy_version 72702 (0.0008) [2023-10-08 02:50:13,054][52060] Updated weights for policy 0, policy_version 71780 (0.0009) [2023-10-08 02:50:13,420][52060] Updated weights for policy 0, policy_version 71790 (0.0009) [2023-10-08 02:50:13,793][52060] Updated weights for policy 0, policy_version 71800 (0.0012) [2023-10-08 02:50:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 147980288. Throughput: 0: 1705.4, 1: 1755.6. Samples: 37008282. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:50:16,211][50642] Avg episode reward: [(0, '19.380'), (1, '20.400')] [2023-10-08 02:50:16,744][52059] Updated weights for policy 1, policy_version 72712 (0.0010) [2023-10-08 02:50:17,102][52059] Updated weights for policy 1, policy_version 72722 (0.0007) [2023-10-08 02:50:17,476][52059] Updated weights for policy 1, policy_version 72732 (0.0010) [2023-10-08 02:50:17,921][52060] Updated weights for policy 0, policy_version 71810 (0.0010) [2023-10-08 02:50:18,288][52060] Updated weights for policy 0, policy_version 71820 (0.0011) [2023-10-08 02:50:18,658][52060] Updated weights for policy 0, policy_version 71830 (0.0008) [2023-10-08 02:50:19,024][52060] Updated weights for policy 0, policy_version 71840 (0.0010) [2023-10-08 02:50:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 148045824. Throughput: 0: 1683.5, 1: 1722.9. Samples: 37017802. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:50:21,211][50642] Avg episode reward: [(0, '18.410'), (1, '22.850')] [2023-10-08 02:50:21,538][52059] Updated weights for policy 1, policy_version 72742 (0.0011) [2023-10-08 02:50:21,907][52059] Updated weights for policy 1, policy_version 72752 (0.0011) [2023-10-08 02:50:22,266][52059] Updated weights for policy 1, policy_version 72762 (0.0008) [2023-10-08 02:50:22,887][52060] Updated weights for policy 0, policy_version 71850 (0.0009) [2023-10-08 02:50:23,254][52060] Updated weights for policy 0, policy_version 71860 (0.0008) [2023-10-08 02:50:23,625][52060] Updated weights for policy 0, policy_version 71870 (0.0010) [2023-10-08 02:50:26,167][52059] Updated weights for policy 1, policy_version 72772 (0.0008) [2023-10-08 02:50:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 148111360. Throughput: 0: 1695.3, 1: 1749.0. Samples: 37038926. Policy #0 lag: (min: 11.0, avg: 11.0, max: 11.0) [2023-10-08 02:50:26,211][50642] Avg episode reward: [(0, '19.970'), (1, '22.280')] [2023-10-08 02:50:26,537][52059] Updated weights for policy 1, policy_version 72782 (0.0007) [2023-10-08 02:50:26,900][52059] Updated weights for policy 1, policy_version 72792 (0.0009) [2023-10-08 02:50:27,664][52060] Updated weights for policy 0, policy_version 71880 (0.0009) [2023-10-08 02:50:28,030][52060] Updated weights for policy 0, policy_version 71890 (0.0008) [2023-10-08 02:50:28,407][52060] Updated weights for policy 0, policy_version 71900 (0.0009) [2023-10-08 02:50:30,845][52059] Updated weights for policy 1, policy_version 72802 (0.0010) [2023-10-08 02:50:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 148176896. Throughput: 0: 1724.6, 1: 1740.4. Samples: 37060046. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:50:31,211][50642] Avg episode reward: [(0, '18.280'), (1, '19.700')] [2023-10-08 02:50:31,214][52059] Updated weights for policy 1, policy_version 72812 (0.0010) [2023-10-08 02:50:31,580][52059] Updated weights for policy 1, policy_version 72822 (0.0007) [2023-10-08 02:50:31,942][52059] Updated weights for policy 1, policy_version 72832 (0.0008) [2023-10-08 02:50:32,296][52060] Updated weights for policy 0, policy_version 71910 (0.0008) [2023-10-08 02:50:32,670][52060] Updated weights for policy 0, policy_version 71920 (0.0007) [2023-10-08 02:50:33,053][52060] Updated weights for policy 0, policy_version 71930 (0.0007) [2023-10-08 02:50:35,931][52059] Updated weights for policy 1, policy_version 72842 (0.0007) [2023-10-08 02:50:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 148242432. Throughput: 0: 1698.1, 1: 1727.0. Samples: 37069430. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:50:36,211][50642] Avg episode reward: [(0, '18.160'), (1, '20.200')] [2023-10-08 02:50:36,284][52059] Updated weights for policy 1, policy_version 72852 (0.0007) [2023-10-08 02:50:36,645][52059] Updated weights for policy 1, policy_version 72862 (0.0009) [2023-10-08 02:50:37,022][52060] Updated weights for policy 0, policy_version 71940 (0.0009) [2023-10-08 02:50:37,399][52060] Updated weights for policy 0, policy_version 71950 (0.0009) [2023-10-08 02:50:37,769][52060] Updated weights for policy 0, policy_version 71960 (0.0008) [2023-10-08 02:50:40,529][52059] Updated weights for policy 1, policy_version 72872 (0.0008) [2023-10-08 02:50:40,893][52059] Updated weights for policy 1, policy_version 72882 (0.0008) [2023-10-08 02:50:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 148307968. Throughput: 0: 1721.7, 1: 1741.7. Samples: 37091096. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:50:41,211][50642] Avg episode reward: [(0, '17.590'), (1, '22.730')] [2023-10-08 02:50:41,249][52059] Updated weights for policy 1, policy_version 72892 (0.0008) [2023-10-08 02:50:41,617][52060] Updated weights for policy 0, policy_version 71970 (0.0008) [2023-10-08 02:50:41,985][52060] Updated weights for policy 0, policy_version 71980 (0.0011) [2023-10-08 02:50:42,358][52060] Updated weights for policy 0, policy_version 71990 (0.0009) [2023-10-08 02:50:42,732][52060] Updated weights for policy 0, policy_version 72000 (0.0007) [2023-10-08 02:50:45,141][52059] Updated weights for policy 1, policy_version 72902 (0.0007) [2023-10-08 02:50:45,499][52059] Updated weights for policy 1, policy_version 72912 (0.0011) [2023-10-08 02:50:45,858][52059] Updated weights for policy 1, policy_version 72922 (0.0011) [2023-10-08 02:50:46,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 148406272. Throughput: 0: 1726.3, 1: 1718.3. Samples: 37111288. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:50:46,211][50642] Avg episode reward: [(0, '20.460'), (1, '22.010')] [2023-10-08 02:50:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000072928_74678272.pth... [2023-10-08 02:50:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000072000_73728000.pth... [2023-10-08 02:50:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000071296_73007104.pth [2023-10-08 02:50:46,255][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000072928_74678272.pth [2023-10-08 02:50:46,265][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000070400_72089600.pth [2023-10-08 02:50:46,271][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000072000_73728000.pth [2023-10-08 02:50:46,806][52060] Updated weights for policy 0, policy_version 72010 (0.0007) [2023-10-08 02:50:47,174][52060] Updated weights for policy 0, policy_version 72020 (0.0007) [2023-10-08 02:50:47,531][52060] Updated weights for policy 0, policy_version 72030 (0.0008) [2023-10-08 02:50:49,848][52059] Updated weights for policy 1, policy_version 72932 (0.0011) [2023-10-08 02:50:50,237][52059] Updated weights for policy 1, policy_version 72942 (0.0008) [2023-10-08 02:50:50,608][52059] Updated weights for policy 1, policy_version 72952 (0.0008) [2023-10-08 02:50:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 148471808. Throughput: 0: 1708.0, 1: 1745.7. Samples: 37121708. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:50:51,211][50642] Avg episode reward: [(0, '19.180'), (1, '21.860')] [2023-10-08 02:50:51,556][52060] Updated weights for policy 0, policy_version 72040 (0.0009) [2023-10-08 02:50:51,924][52060] Updated weights for policy 0, policy_version 72050 (0.0010) [2023-10-08 02:50:52,300][52060] Updated weights for policy 0, policy_version 72060 (0.0007) [2023-10-08 02:50:54,562][52059] Updated weights for policy 1, policy_version 72962 (0.0009) [2023-10-08 02:50:54,929][52059] Updated weights for policy 1, policy_version 72972 (0.0009) [2023-10-08 02:50:55,296][52059] Updated weights for policy 1, policy_version 72982 (0.0009) [2023-10-08 02:50:55,661][52059] Updated weights for policy 1, policy_version 72992 (0.0010) [2023-10-08 02:50:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 148537344. Throughput: 0: 1724.5, 1: 1732.9. Samples: 37142590. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:50:56,211][50642] Avg episode reward: [(0, '19.810'), (1, '20.050')] [2023-10-08 02:50:56,224][52060] Updated weights for policy 0, policy_version 72070 (0.0009) [2023-10-08 02:50:56,591][52060] Updated weights for policy 0, policy_version 72080 (0.0011) [2023-10-08 02:50:56,972][52060] Updated weights for policy 0, policy_version 72090 (0.0009) [2023-10-08 02:50:59,659][52059] Updated weights for policy 1, policy_version 73002 (0.0007) [2023-10-08 02:51:00,019][52059] Updated weights for policy 1, policy_version 73012 (0.0010) [2023-10-08 02:51:00,386][52059] Updated weights for policy 1, policy_version 73022 (0.0010) [2023-10-08 02:51:00,953][52060] Updated weights for policy 0, policy_version 72100 (0.0010) [2023-10-08 02:51:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 148602880. Throughput: 0: 1726.6, 1: 1707.8. Samples: 37162832. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:51:01,211][50642] Avg episode reward: [(0, '21.840'), (1, '22.570')] [2023-10-08 02:51:01,319][52060] Updated weights for policy 0, policy_version 72110 (0.0010) [2023-10-08 02:51:01,683][52060] Updated weights for policy 0, policy_version 72120 (0.0010) [2023-10-08 02:51:04,322][52059] Updated weights for policy 1, policy_version 73032 (0.0009) [2023-10-08 02:51:04,691][52059] Updated weights for policy 1, policy_version 73042 (0.0008) [2023-10-08 02:51:05,059][52059] Updated weights for policy 1, policy_version 73052 (0.0008) [2023-10-08 02:51:05,708][52060] Updated weights for policy 0, policy_version 72130 (0.0010) [2023-10-08 02:51:06,068][52060] Updated weights for policy 0, policy_version 72140 (0.0008) [2023-10-08 02:51:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 148668416. Throughput: 0: 1722.4, 1: 1738.5. Samples: 37173546. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:51:06,211][50642] Avg episode reward: [(0, '18.130'), (1, '22.260')] [2023-10-08 02:51:06,435][52060] Updated weights for policy 0, policy_version 72150 (0.0009) [2023-10-08 02:51:06,811][52060] Updated weights for policy 0, policy_version 72160 (0.0008) [2023-10-08 02:51:09,056][52059] Updated weights for policy 1, policy_version 73062 (0.0007) [2023-10-08 02:51:09,421][52059] Updated weights for policy 1, policy_version 73072 (0.0008) [2023-10-08 02:51:09,791][52059] Updated weights for policy 1, policy_version 73082 (0.0010) [2023-10-08 02:51:10,774][52060] Updated weights for policy 0, policy_version 72170 (0.0009) [2023-10-08 02:51:11,148][52060] Updated weights for policy 0, policy_version 72180 (0.0007) [2023-10-08 02:51:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 148733952. Throughput: 0: 1726.5, 1: 1711.9. Samples: 37193654. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:51:11,211][50642] Avg episode reward: [(0, '18.670'), (1, '22.400')] [2023-10-08 02:51:11,523][52060] Updated weights for policy 0, policy_version 72190 (0.0008) [2023-10-08 02:51:13,645][52059] Updated weights for policy 1, policy_version 73092 (0.0010) [2023-10-08 02:51:14,018][52059] Updated weights for policy 1, policy_version 73102 (0.0009) [2023-10-08 02:51:14,378][52059] Updated weights for policy 1, policy_version 73112 (0.0008) [2023-10-08 02:51:15,529][52060] Updated weights for policy 0, policy_version 72200 (0.0009) [2023-10-08 02:51:15,902][52060] Updated weights for policy 0, policy_version 72210 (0.0007) [2023-10-08 02:51:16,211][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 148799488. Throughput: 0: 1711.2, 1: 1717.4. Samples: 37214334. Policy #0 lag: (min: 6.0, avg: 14.0, max: 38.0) [2023-10-08 02:51:16,212][50642] Avg episode reward: [(0, '21.700'), (1, '21.190')] [2023-10-08 02:51:16,270][52060] Updated weights for policy 0, policy_version 72220 (0.0009) [2023-10-08 02:51:18,330][52059] Updated weights for policy 1, policy_version 73122 (0.0008) [2023-10-08 02:51:18,694][52059] Updated weights for policy 1, policy_version 73132 (0.0011) [2023-10-08 02:51:19,060][52059] Updated weights for policy 1, policy_version 73142 (0.0009) [2023-10-08 02:51:19,435][52059] Updated weights for policy 1, policy_version 73152 (0.0009) [2023-10-08 02:51:20,390][52060] Updated weights for policy 0, policy_version 72230 (0.0009) [2023-10-08 02:51:20,768][52060] Updated weights for policy 0, policy_version 72240 (0.0009) [2023-10-08 02:51:21,136][52060] Updated weights for policy 0, policy_version 72250 (0.0008) [2023-10-08 02:51:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 148865024. Throughput: 0: 1723.6, 1: 1732.7. Samples: 37224962. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:21,211][50642] Avg episode reward: [(0, '20.710'), (1, '21.810')] [2023-10-08 02:51:23,448][52059] Updated weights for policy 1, policy_version 73162 (0.0010) [2023-10-08 02:51:23,812][52059] Updated weights for policy 1, policy_version 73172 (0.0008) [2023-10-08 02:51:24,185][52059] Updated weights for policy 1, policy_version 73182 (0.0007) [2023-10-08 02:51:24,934][52060] Updated weights for policy 0, policy_version 72260 (0.0008) [2023-10-08 02:51:25,301][52060] Updated weights for policy 0, policy_version 72270 (0.0008) [2023-10-08 02:51:25,675][52060] Updated weights for policy 0, policy_version 72280 (0.0010) [2023-10-08 02:51:26,210][50642] Fps is (10 sec: 16384.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 148963328. Throughput: 0: 1720.4, 1: 1709.7. Samples: 37245454. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:26,211][50642] Avg episode reward: [(0, '18.550'), (1, '22.560')] [2023-10-08 02:51:27,977][52059] Updated weights for policy 1, policy_version 73192 (0.0008) [2023-10-08 02:51:28,340][52059] Updated weights for policy 1, policy_version 73202 (0.0011) [2023-10-08 02:51:28,708][52059] Updated weights for policy 1, policy_version 73212 (0.0011) [2023-10-08 02:51:29,394][52060] Updated weights for policy 0, policy_version 72290 (0.0009) [2023-10-08 02:51:29,763][52060] Updated weights for policy 0, policy_version 72300 (0.0009) [2023-10-08 02:51:30,128][52060] Updated weights for policy 0, policy_version 72310 (0.0011) [2023-10-08 02:51:30,502][52060] Updated weights for policy 0, policy_version 72320 (0.0010) [2023-10-08 02:51:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 149028864. Throughput: 0: 1695.0, 1: 1735.6. Samples: 37265666. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:31,211][50642] Avg episode reward: [(0, '21.400'), (1, '22.970')] [2023-10-08 02:51:32,667][52059] Updated weights for policy 1, policy_version 73222 (0.0007) [2023-10-08 02:51:33,030][52059] Updated weights for policy 1, policy_version 73232 (0.0007) [2023-10-08 02:51:33,384][52059] Updated weights for policy 1, policy_version 73242 (0.0008) [2023-10-08 02:51:34,550][52060] Updated weights for policy 0, policy_version 72330 (0.0009) [2023-10-08 02:51:34,921][52060] Updated weights for policy 0, policy_version 72340 (0.0008) [2023-10-08 02:51:35,280][52060] Updated weights for policy 0, policy_version 72350 (0.0009) [2023-10-08 02:51:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 149094400. Throughput: 0: 1725.0, 1: 1711.8. Samples: 37276362. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:36,211][50642] Avg episode reward: [(0, '21.380'), (1, '20.470')] [2023-10-08 02:51:37,259][52059] Updated weights for policy 1, policy_version 73252 (0.0008) [2023-10-08 02:51:37,654][52059] Updated weights for policy 1, policy_version 73262 (0.0008) [2023-10-08 02:51:38,019][52059] Updated weights for policy 1, policy_version 73272 (0.0008) [2023-10-08 02:51:39,268][52060] Updated weights for policy 0, policy_version 72360 (0.0010) [2023-10-08 02:51:39,635][52060] Updated weights for policy 0, policy_version 72370 (0.0009) [2023-10-08 02:51:40,004][52060] Updated weights for policy 0, policy_version 72380 (0.0009) [2023-10-08 02:51:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 149159936. Throughput: 0: 1702.0, 1: 1729.2. Samples: 37296994. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:41,211][50642] Avg episode reward: [(0, '17.980'), (1, '22.020')] [2023-10-08 02:51:41,878][52059] Updated weights for policy 1, policy_version 73282 (0.0008) [2023-10-08 02:51:42,243][52059] Updated weights for policy 1, policy_version 73292 (0.0009) [2023-10-08 02:51:42,603][52059] Updated weights for policy 1, policy_version 73302 (0.0008) [2023-10-08 02:51:42,965][52059] Updated weights for policy 1, policy_version 73312 (0.0008) [2023-10-08 02:51:43,998][52060] Updated weights for policy 0, policy_version 72390 (0.0009) [2023-10-08 02:51:44,363][52060] Updated weights for policy 0, policy_version 72400 (0.0008) [2023-10-08 02:51:44,729][52060] Updated weights for policy 0, policy_version 72410 (0.0009) [2023-10-08 02:51:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 149225472. Throughput: 0: 1693.3, 1: 1753.5. Samples: 37317940. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:46,212][50642] Avg episode reward: [(0, '20.100'), (1, '22.300')] [2023-10-08 02:51:46,732][52059] Updated weights for policy 1, policy_version 73322 (0.0007) [2023-10-08 02:51:47,085][52059] Updated weights for policy 1, policy_version 73332 (0.0008) [2023-10-08 02:51:47,457][52059] Updated weights for policy 1, policy_version 73342 (0.0007) [2023-10-08 02:51:48,896][52060] Updated weights for policy 0, policy_version 72420 (0.0009) [2023-10-08 02:51:49,263][52060] Updated weights for policy 0, policy_version 72430 (0.0007) [2023-10-08 02:51:49,634][52060] Updated weights for policy 0, policy_version 72440 (0.0007) [2023-10-08 02:51:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 149291008. Throughput: 0: 1718.4, 1: 1724.3. Samples: 37328464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:51,211][50642] Avg episode reward: [(0, '20.650'), (1, '22.410')] [2023-10-08 02:51:51,399][52059] Updated weights for policy 1, policy_version 73352 (0.0008) [2023-10-08 02:51:51,762][52059] Updated weights for policy 1, policy_version 73362 (0.0008) [2023-10-08 02:51:52,124][52059] Updated weights for policy 1, policy_version 73372 (0.0008) [2023-10-08 02:51:53,588][52060] Updated weights for policy 0, policy_version 72450 (0.0008) [2023-10-08 02:51:53,949][52060] Updated weights for policy 0, policy_version 72460 (0.0010) [2023-10-08 02:51:54,320][52060] Updated weights for policy 0, policy_version 72470 (0.0011) [2023-10-08 02:51:54,682][52060] Updated weights for policy 0, policy_version 72480 (0.0011) [2023-10-08 02:51:56,043][52059] Updated weights for policy 1, policy_version 73382 (0.0008) [2023-10-08 02:51:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 149356544. Throughput: 0: 1692.6, 1: 1757.4. Samples: 37348904. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:51:56,211][50642] Avg episode reward: [(0, '20.240'), (1, '21.080')] [2023-10-08 02:51:56,402][52059] Updated weights for policy 1, policy_version 73392 (0.0010) [2023-10-08 02:51:56,777][52059] Updated weights for policy 1, policy_version 73402 (0.0007) [2023-10-08 02:51:58,711][52060] Updated weights for policy 0, policy_version 72490 (0.0007) [2023-10-08 02:51:59,076][52060] Updated weights for policy 0, policy_version 72500 (0.0008) [2023-10-08 02:51:59,438][52060] Updated weights for policy 0, policy_version 72510 (0.0009) [2023-10-08 02:52:00,895][52059] Updated weights for policy 1, policy_version 73412 (0.0010) [2023-10-08 02:52:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 149422080. Throughput: 0: 1706.7, 1: 1749.2. Samples: 37369846. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:01,211][50642] Avg episode reward: [(0, '19.610'), (1, '22.710')] [2023-10-08 02:52:01,256][52059] Updated weights for policy 1, policy_version 73422 (0.0009) [2023-10-08 02:52:01,612][52059] Updated weights for policy 1, policy_version 73432 (0.0008) [2023-10-08 02:52:03,380][52060] Updated weights for policy 0, policy_version 72520 (0.0009) [2023-10-08 02:52:03,751][52060] Updated weights for policy 0, policy_version 72530 (0.0009) [2023-10-08 02:52:04,116][52060] Updated weights for policy 0, policy_version 72540 (0.0009) [2023-10-08 02:52:05,644][52059] Updated weights for policy 1, policy_version 73442 (0.0008) [2023-10-08 02:52:06,011][52059] Updated weights for policy 1, policy_version 73452 (0.0008) [2023-10-08 02:52:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 149487616. Throughput: 0: 1708.8, 1: 1734.4. Samples: 37379906. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:06,211][50642] Avg episode reward: [(0, '21.630'), (1, '21.990')] [2023-10-08 02:52:06,377][52059] Updated weights for policy 1, policy_version 73462 (0.0010) [2023-10-08 02:52:06,746][52059] Updated weights for policy 1, policy_version 73472 (0.0009) [2023-10-08 02:52:08,034][52060] Updated weights for policy 0, policy_version 72550 (0.0010) [2023-10-08 02:52:08,418][52060] Updated weights for policy 0, policy_version 72560 (0.0009) [2023-10-08 02:52:08,790][52060] Updated weights for policy 0, policy_version 72570 (0.0010) [2023-10-08 02:52:10,619][52059] Updated weights for policy 1, policy_version 73482 (0.0009) [2023-10-08 02:52:10,973][52059] Updated weights for policy 1, policy_version 73492 (0.0009) [2023-10-08 02:52:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 149553152. Throughput: 0: 1699.9, 1: 1751.9. Samples: 37400784. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:11,211][50642] Avg episode reward: [(0, '19.180'), (1, '21.690')] [2023-10-08 02:52:11,337][52059] Updated weights for policy 1, policy_version 73502 (0.0007) [2023-10-08 02:52:12,739][52060] Updated weights for policy 0, policy_version 72580 (0.0009) [2023-10-08 02:52:13,105][52060] Updated weights for policy 0, policy_version 72590 (0.0010) [2023-10-08 02:52:13,472][52060] Updated weights for policy 0, policy_version 72600 (0.0010) [2023-10-08 02:52:15,119][52059] Updated weights for policy 1, policy_version 73512 (0.0008) [2023-10-08 02:52:15,481][52059] Updated weights for policy 1, policy_version 73522 (0.0010) [2023-10-08 02:52:15,850][52059] Updated weights for policy 1, policy_version 73532 (0.0008) [2023-10-08 02:52:16,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.6, 300 sec: 13884.7). Total num frames: 149651456. Throughput: 0: 1726.0, 1: 1726.9. Samples: 37421046. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:16,211][50642] Avg episode reward: [(0, '19.820'), (1, '20.520')] [2023-10-08 02:52:17,241][52060] Updated weights for policy 0, policy_version 72610 (0.0009) [2023-10-08 02:52:17,616][52060] Updated weights for policy 0, policy_version 72620 (0.0007) [2023-10-08 02:52:17,983][52060] Updated weights for policy 0, policy_version 72630 (0.0008) [2023-10-08 02:52:18,365][52060] Updated weights for policy 0, policy_version 72640 (0.0011) [2023-10-08 02:52:19,828][52059] Updated weights for policy 1, policy_version 73542 (0.0008) [2023-10-08 02:52:20,188][52059] Updated weights for policy 1, policy_version 73552 (0.0009) [2023-10-08 02:52:20,549][52059] Updated weights for policy 1, policy_version 73562 (0.0009) [2023-10-08 02:52:21,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 149716992. Throughput: 0: 1697.9, 1: 1751.9. Samples: 37431600. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:21,211][50642] Avg episode reward: [(0, '19.870'), (1, '20.770')] [2023-10-08 02:52:22,369][52060] Updated weights for policy 0, policy_version 72650 (0.0008) [2023-10-08 02:52:22,747][52060] Updated weights for policy 0, policy_version 72660 (0.0008) [2023-10-08 02:52:23,117][52060] Updated weights for policy 0, policy_version 72670 (0.0008) [2023-10-08 02:52:24,534][52059] Updated weights for policy 1, policy_version 73572 (0.0009) [2023-10-08 02:52:24,940][52059] Updated weights for policy 1, policy_version 73582 (0.0008) [2023-10-08 02:52:25,305][52059] Updated weights for policy 1, policy_version 73592 (0.0010) [2023-10-08 02:52:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 149782528. Throughput: 0: 1719.2, 1: 1733.7. Samples: 37452370. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:26,210][50642] Avg episode reward: [(0, '21.990'), (1, '22.650')] [2023-10-08 02:52:27,173][52060] Updated weights for policy 0, policy_version 72680 (0.0007) [2023-10-08 02:52:27,542][52060] Updated weights for policy 0, policy_version 72690 (0.0008) [2023-10-08 02:52:27,912][52060] Updated weights for policy 0, policy_version 72700 (0.0007) [2023-10-08 02:52:29,105][52059] Updated weights for policy 1, policy_version 73602 (0.0010) [2023-10-08 02:52:29,468][52059] Updated weights for policy 1, policy_version 73612 (0.0007) [2023-10-08 02:52:29,827][52059] Updated weights for policy 1, policy_version 73622 (0.0007) [2023-10-08 02:52:30,189][52059] Updated weights for policy 1, policy_version 73632 (0.0008) [2023-10-08 02:52:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 149848064. Throughput: 0: 1728.4, 1: 1717.4. Samples: 37473002. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:31,211][50642] Avg episode reward: [(0, '18.840'), (1, '21.730')] [2023-10-08 02:52:31,949][52060] Updated weights for policy 0, policy_version 72710 (0.0008) [2023-10-08 02:52:32,320][52060] Updated weights for policy 0, policy_version 72720 (0.0009) [2023-10-08 02:52:32,694][52060] Updated weights for policy 0, policy_version 72730 (0.0011) [2023-10-08 02:52:33,968][52059] Updated weights for policy 1, policy_version 73642 (0.0008) [2023-10-08 02:52:34,341][52059] Updated weights for policy 1, policy_version 73652 (0.0010) [2023-10-08 02:52:34,708][52059] Updated weights for policy 1, policy_version 73662 (0.0007) [2023-10-08 02:52:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 149913600. Throughput: 0: 1701.1, 1: 1744.4. Samples: 37483512. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:36,211][50642] Avg episode reward: [(0, '20.510'), (1, '21.090')] [2023-10-08 02:52:36,566][52060] Updated weights for policy 0, policy_version 72740 (0.0010) [2023-10-08 02:52:36,936][52060] Updated weights for policy 0, policy_version 72750 (0.0007) [2023-10-08 02:52:37,308][52060] Updated weights for policy 0, policy_version 72760 (0.0007) [2023-10-08 02:52:38,616][52059] Updated weights for policy 1, policy_version 73672 (0.0008) [2023-10-08 02:52:38,983][52059] Updated weights for policy 1, policy_version 73682 (0.0010) [2023-10-08 02:52:39,345][52059] Updated weights for policy 1, policy_version 73692 (0.0008) [2023-10-08 02:52:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 149979136. Throughput: 0: 1723.3, 1: 1715.5. Samples: 37503650. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:41,211][50642] Avg episode reward: [(0, '21.310'), (1, '22.140')] [2023-10-08 02:52:41,273][52060] Updated weights for policy 0, policy_version 72770 (0.0007) [2023-10-08 02:52:41,644][52060] Updated weights for policy 0, policy_version 72780 (0.0010) [2023-10-08 02:52:42,014][52060] Updated weights for policy 0, policy_version 72790 (0.0008) [2023-10-08 02:52:42,382][52060] Updated weights for policy 0, policy_version 72800 (0.0010) [2023-10-08 02:52:43,354][52059] Updated weights for policy 1, policy_version 73702 (0.0009) [2023-10-08 02:52:43,721][52059] Updated weights for policy 1, policy_version 73712 (0.0008) [2023-10-08 02:52:44,087][52059] Updated weights for policy 1, policy_version 73722 (0.0010) [2023-10-08 02:52:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 150044672. Throughput: 0: 1723.8, 1: 1723.6. Samples: 37524978. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:46,211][50642] Avg episode reward: [(0, '20.890'), (1, '23.390')] [2023-10-08 02:52:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000073728_75497472.pth... [2023-10-08 02:52:46,223][52060] Updated weights for policy 0, policy_version 72810 (0.0008) [2023-10-08 02:52:46,254][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000072096_73826304.pth [2023-10-08 02:52:46,594][52060] Updated weights for policy 0, policy_version 72820 (0.0008) [2023-10-08 02:52:46,967][52060] Updated weights for policy 0, policy_version 72830 (0.0009) [2023-10-08 02:52:47,032][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000072832_74579968.pth... [2023-10-08 02:52:47,062][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000071200_72908800.pth [2023-10-08 02:52:47,975][52059] Updated weights for policy 1, policy_version 73732 (0.0009) [2023-10-08 02:52:48,335][52059] Updated weights for policy 1, policy_version 73742 (0.0007) [2023-10-08 02:52:48,696][52059] Updated weights for policy 1, policy_version 73752 (0.0010) [2023-10-08 02:52:50,976][52060] Updated weights for policy 0, policy_version 72840 (0.0008) [2023-10-08 02:52:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 150110208. Throughput: 0: 1712.7, 1: 1728.4. Samples: 37534754. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:52:51,211][50642] Avg episode reward: [(0, '18.760'), (1, '23.600')] [2023-10-08 02:52:51,347][52060] Updated weights for policy 0, policy_version 72850 (0.0008) [2023-10-08 02:52:51,718][52060] Updated weights for policy 0, policy_version 72860 (0.0008) [2023-10-08 02:52:52,738][52059] Updated weights for policy 1, policy_version 73762 (0.0009) [2023-10-08 02:52:53,103][52059] Updated weights for policy 1, policy_version 73772 (0.0009) [2023-10-08 02:52:53,464][52059] Updated weights for policy 1, policy_version 73782 (0.0007) [2023-10-08 02:52:53,823][52059] Updated weights for policy 1, policy_version 73792 (0.0009) [2023-10-08 02:52:55,751][52060] Updated weights for policy 0, policy_version 72870 (0.0008) [2023-10-08 02:52:56,139][52060] Updated weights for policy 0, policy_version 72880 (0.0007) [2023-10-08 02:52:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 150175744. Throughput: 0: 1723.0, 1: 1717.9. Samples: 37555624. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:52:56,211][50642] Avg episode reward: [(0, '20.750'), (1, '22.850')] [2023-10-08 02:52:56,524][52060] Updated weights for policy 0, policy_version 72890 (0.0010) [2023-10-08 02:52:57,836][52059] Updated weights for policy 1, policy_version 73802 (0.0008) [2023-10-08 02:52:58,199][52059] Updated weights for policy 1, policy_version 73812 (0.0011) [2023-10-08 02:52:58,565][52059] Updated weights for policy 1, policy_version 73822 (0.0008) [2023-10-08 02:53:00,394][52060] Updated weights for policy 0, policy_version 72900 (0.0009) [2023-10-08 02:53:00,765][52060] Updated weights for policy 0, policy_version 72910 (0.0009) [2023-10-08 02:53:01,131][52060] Updated weights for policy 0, policy_version 72920 (0.0009) [2023-10-08 02:53:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 150241280. Throughput: 0: 1708.9, 1: 1743.9. Samples: 37576422. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:01,211][50642] Avg episode reward: [(0, '22.470'), (1, '21.850')] [2023-10-08 02:53:02,529][52059] Updated weights for policy 1, policy_version 73832 (0.0010) [2023-10-08 02:53:02,891][52059] Updated weights for policy 1, policy_version 73842 (0.0008) [2023-10-08 02:53:03,260][52059] Updated weights for policy 1, policy_version 73852 (0.0007) [2023-10-08 02:53:05,144][52060] Updated weights for policy 0, policy_version 72930 (0.0009) [2023-10-08 02:53:05,513][52060] Updated weights for policy 0, policy_version 72940 (0.0008) [2023-10-08 02:53:05,887][52060] Updated weights for policy 0, policy_version 72950 (0.0007) [2023-10-08 02:53:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 150306816. Throughput: 0: 1719.8, 1: 1718.3. Samples: 37586316. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:06,211][50642] Avg episode reward: [(0, '21.020'), (1, '22.810')] [2023-10-08 02:53:06,251][52060] Updated weights for policy 0, policy_version 72960 (0.0008) [2023-10-08 02:53:06,985][52059] Updated weights for policy 1, policy_version 73862 (0.0009) [2023-10-08 02:53:07,354][52059] Updated weights for policy 1, policy_version 73872 (0.0007) [2023-10-08 02:53:07,727][52059] Updated weights for policy 1, policy_version 73882 (0.0007) [2023-10-08 02:53:10,362][52060] Updated weights for policy 0, policy_version 72970 (0.0009) [2023-10-08 02:53:10,719][52060] Updated weights for policy 0, policy_version 72980 (0.0008) [2023-10-08 02:53:11,101][52060] Updated weights for policy 0, policy_version 72990 (0.0009) [2023-10-08 02:53:11,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 150405120. Throughput: 0: 1724.2, 1: 1734.4. Samples: 37608008. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:11,210][50642] Avg episode reward: [(0, '19.220'), (1, '23.420')] [2023-10-08 02:53:11,726][52059] Updated weights for policy 1, policy_version 73892 (0.0008) [2023-10-08 02:53:12,128][52059] Updated weights for policy 1, policy_version 73902 (0.0009) [2023-10-08 02:53:12,500][52059] Updated weights for policy 1, policy_version 73912 (0.0010) [2023-10-08 02:53:14,912][52060] Updated weights for policy 0, policy_version 73000 (0.0008) [2023-10-08 02:53:15,279][52060] Updated weights for policy 0, policy_version 73010 (0.0008) [2023-10-08 02:53:15,650][52060] Updated weights for policy 0, policy_version 73020 (0.0009) [2023-10-08 02:53:16,210][50642] Fps is (10 sec: 16384.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 150470656. Throughput: 0: 1698.1, 1: 1746.5. Samples: 37628008. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:16,211][50642] Avg episode reward: [(0, '20.150'), (1, '22.020')] [2023-10-08 02:53:16,500][52059] Updated weights for policy 1, policy_version 73922 (0.0008) [2023-10-08 02:53:16,863][52059] Updated weights for policy 1, policy_version 73932 (0.0008) [2023-10-08 02:53:17,225][52059] Updated weights for policy 1, policy_version 73942 (0.0008) [2023-10-08 02:53:17,593][52059] Updated weights for policy 1, policy_version 73952 (0.0007) [2023-10-08 02:53:19,524][52060] Updated weights for policy 0, policy_version 73030 (0.0011) [2023-10-08 02:53:19,887][52060] Updated weights for policy 0, policy_version 73040 (0.0010) [2023-10-08 02:53:20,261][52060] Updated weights for policy 0, policy_version 73050 (0.0008) [2023-10-08 02:53:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 150536192. Throughput: 0: 1732.8, 1: 1718.9. Samples: 37638840. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:21,211][50642] Avg episode reward: [(0, '21.850'), (1, '22.650')] [2023-10-08 02:53:21,510][52059] Updated weights for policy 1, policy_version 73962 (0.0007) [2023-10-08 02:53:21,871][52059] Updated weights for policy 1, policy_version 73972 (0.0007) [2023-10-08 02:53:22,239][52059] Updated weights for policy 1, policy_version 73982 (0.0008) [2023-10-08 02:53:24,219][52060] Updated weights for policy 0, policy_version 73060 (0.0008) [2023-10-08 02:53:24,584][52060] Updated weights for policy 0, policy_version 73070 (0.0010) [2023-10-08 02:53:24,948][52060] Updated weights for policy 0, policy_version 73080 (0.0008) [2023-10-08 02:53:25,968][52059] Updated weights for policy 1, policy_version 73992 (0.0008) [2023-10-08 02:53:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 150601728. Throughput: 0: 1717.9, 1: 1748.6. Samples: 37659642. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:26,211][50642] Avg episode reward: [(0, '20.210'), (1, '24.460')] [2023-10-08 02:53:26,330][52059] Updated weights for policy 1, policy_version 74002 (0.0007) [2023-10-08 02:53:26,697][52059] Updated weights for policy 1, policy_version 74012 (0.0007) [2023-10-08 02:53:28,774][52060] Updated weights for policy 0, policy_version 73090 (0.0008) [2023-10-08 02:53:29,147][52060] Updated weights for policy 0, policy_version 73100 (0.0011) [2023-10-08 02:53:29,517][52060] Updated weights for policy 0, policy_version 73110 (0.0009) [2023-10-08 02:53:29,886][52060] Updated weights for policy 0, policy_version 73120 (0.0008) [2023-10-08 02:53:30,641][52059] Updated weights for policy 1, policy_version 74022 (0.0009) [2023-10-08 02:53:31,007][52059] Updated weights for policy 1, policy_version 74032 (0.0009) [2023-10-08 02:53:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 150667264. Throughput: 0: 1710.0, 1: 1739.0. Samples: 37680182. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:31,211][50642] Avg episode reward: [(0, '20.810'), (1, '25.300')] [2023-10-08 02:53:31,360][52059] Updated weights for policy 1, policy_version 74042 (0.0008) [2023-10-08 02:53:33,746][52060] Updated weights for policy 0, policy_version 73130 (0.0010) [2023-10-08 02:53:34,104][52060] Updated weights for policy 0, policy_version 73140 (0.0007) [2023-10-08 02:53:34,484][52060] Updated weights for policy 0, policy_version 73150 (0.0007) [2023-10-08 02:53:35,309][52059] Updated weights for policy 1, policy_version 74052 (0.0008) [2023-10-08 02:53:35,671][52059] Updated weights for policy 1, policy_version 74062 (0.0010) [2023-10-08 02:53:36,029][52059] Updated weights for policy 1, policy_version 74072 (0.0009) [2023-10-08 02:53:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 150732800. Throughput: 0: 1726.8, 1: 1745.3. Samples: 37690998. Policy #0 lag: (min: 9.0, avg: 14.7, max: 41.0) [2023-10-08 02:53:36,211][50642] Avg episode reward: [(0, '20.480'), (1, '25.280')] [2023-10-08 02:53:38,490][52060] Updated weights for policy 0, policy_version 73160 (0.0007) [2023-10-08 02:53:38,857][52060] Updated weights for policy 0, policy_version 73170 (0.0008) [2023-10-08 02:53:39,227][52060] Updated weights for policy 0, policy_version 73180 (0.0008) [2023-10-08 02:53:39,902][52059] Updated weights for policy 1, policy_version 74082 (0.0010) [2023-10-08 02:53:40,260][52059] Updated weights for policy 1, policy_version 74092 (0.0009) [2023-10-08 02:53:40,630][52059] Updated weights for policy 1, policy_version 74102 (0.0009) [2023-10-08 02:53:40,985][52059] Updated weights for policy 1, policy_version 74112 (0.0008) [2023-10-08 02:53:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 150831104. Throughput: 0: 1711.8, 1: 1757.3. Samples: 37711734. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:53:41,211][50642] Avg episode reward: [(0, '21.000'), (1, '23.470')] [2023-10-08 02:53:43,320][52060] Updated weights for policy 0, policy_version 73190 (0.0009) [2023-10-08 02:53:43,703][52060] Updated weights for policy 0, policy_version 73200 (0.0008) [2023-10-08 02:53:44,068][52060] Updated weights for policy 0, policy_version 73210 (0.0008) [2023-10-08 02:53:44,880][52059] Updated weights for policy 1, policy_version 74122 (0.0011) [2023-10-08 02:53:45,248][52059] Updated weights for policy 1, policy_version 74132 (0.0010) [2023-10-08 02:53:45,611][52059] Updated weights for policy 1, policy_version 74142 (0.0009) [2023-10-08 02:53:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 150896640. Throughput: 0: 1722.2, 1: 1729.8. Samples: 37731762. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:53:46,211][50642] Avg episode reward: [(0, '20.570'), (1, '21.650')] [2023-10-08 02:53:48,022][52060] Updated weights for policy 0, policy_version 73220 (0.0007) [2023-10-08 02:53:48,394][52060] Updated weights for policy 0, policy_version 73230 (0.0007) [2023-10-08 02:53:48,757][52060] Updated weights for policy 0, policy_version 73240 (0.0007) [2023-10-08 02:53:49,668][52059] Updated weights for policy 1, policy_version 74152 (0.0008) [2023-10-08 02:53:50,036][52059] Updated weights for policy 1, policy_version 74162 (0.0007) [2023-10-08 02:53:50,393][52059] Updated weights for policy 1, policy_version 74172 (0.0009) [2023-10-08 02:53:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 150962176. Throughput: 0: 1718.3, 1: 1759.9. Samples: 37742832. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:53:51,211][50642] Avg episode reward: [(0, '20.870'), (1, '25.180')] [2023-10-08 02:53:52,750][52060] Updated weights for policy 0, policy_version 73250 (0.0008) [2023-10-08 02:53:53,116][52060] Updated weights for policy 0, policy_version 73260 (0.0007) [2023-10-08 02:53:53,479][52060] Updated weights for policy 0, policy_version 73270 (0.0007) [2023-10-08 02:53:53,847][52060] Updated weights for policy 0, policy_version 73280 (0.0007) [2023-10-08 02:53:54,186][52059] Updated weights for policy 1, policy_version 74182 (0.0009) [2023-10-08 02:53:54,554][52059] Updated weights for policy 1, policy_version 74192 (0.0010) [2023-10-08 02:53:54,923][52059] Updated weights for policy 1, policy_version 74202 (0.0009) [2023-10-08 02:53:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 151027712. Throughput: 0: 1714.1, 1: 1733.7. Samples: 37763160. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:53:56,211][50642] Avg episode reward: [(0, '22.710'), (1, '24.130')] [2023-10-08 02:53:57,697][52060] Updated weights for policy 0, policy_version 73290 (0.0007) [2023-10-08 02:53:58,068][52060] Updated weights for policy 0, policy_version 73300 (0.0009) [2023-10-08 02:53:58,431][52060] Updated weights for policy 0, policy_version 73310 (0.0009) [2023-10-08 02:53:58,919][52059] Updated weights for policy 1, policy_version 74212 (0.0009) [2023-10-08 02:53:59,290][52059] Updated weights for policy 1, policy_version 74222 (0.0007) [2023-10-08 02:53:59,657][52059] Updated weights for policy 1, policy_version 74232 (0.0007) [2023-10-08 02:54:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 151093248. Throughput: 0: 1743.9, 1: 1729.7. Samples: 37784322. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:54:01,211][50642] Avg episode reward: [(0, '20.190'), (1, '22.260')] [2023-10-08 02:54:02,368][52060] Updated weights for policy 0, policy_version 73320 (0.0009) [2023-10-08 02:54:02,739][52060] Updated weights for policy 0, policy_version 73330 (0.0007) [2023-10-08 02:54:03,118][52060] Updated weights for policy 0, policy_version 73340 (0.0007) [2023-10-08 02:54:03,481][52059] Updated weights for policy 1, policy_version 74242 (0.0008) [2023-10-08 02:54:03,844][52059] Updated weights for policy 1, policy_version 74252 (0.0009) [2023-10-08 02:54:04,216][52059] Updated weights for policy 1, policy_version 74262 (0.0009) [2023-10-08 02:54:04,574][52059] Updated weights for policy 1, policy_version 74272 (0.0010) [2023-10-08 02:54:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 151158784. Throughput: 0: 1706.7, 1: 1750.4. Samples: 37794412. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:54:06,211][50642] Avg episode reward: [(0, '19.560'), (1, '20.750')] [2023-10-08 02:54:06,877][52060] Updated weights for policy 0, policy_version 73350 (0.0009) [2023-10-08 02:54:07,240][52060] Updated weights for policy 0, policy_version 73360 (0.0008) [2023-10-08 02:54:07,613][52060] Updated weights for policy 0, policy_version 73370 (0.0010) [2023-10-08 02:54:08,483][52059] Updated weights for policy 1, policy_version 74282 (0.0010) [2023-10-08 02:54:08,847][52059] Updated weights for policy 1, policy_version 74292 (0.0009) [2023-10-08 02:54:09,209][52059] Updated weights for policy 1, policy_version 74302 (0.0008) [2023-10-08 02:54:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 151224320. Throughput: 0: 1732.9, 1: 1723.5. Samples: 37815182. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:54:11,211][50642] Avg episode reward: [(0, '21.230'), (1, '24.100')] [2023-10-08 02:54:11,593][52060] Updated weights for policy 0, policy_version 73380 (0.0007) [2023-10-08 02:54:11,961][52060] Updated weights for policy 0, policy_version 73390 (0.0008) [2023-10-08 02:54:12,322][52060] Updated weights for policy 0, policy_version 73400 (0.0007) [2023-10-08 02:54:13,067][52059] Updated weights for policy 1, policy_version 74312 (0.0009) [2023-10-08 02:54:13,438][52059] Updated weights for policy 1, policy_version 74322 (0.0009) [2023-10-08 02:54:13,807][52059] Updated weights for policy 1, policy_version 74332 (0.0007) [2023-10-08 02:54:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 151289856. Throughput: 0: 1738.1, 1: 1737.3. Samples: 37836572. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:54:16,211][50642] Avg episode reward: [(0, '19.620'), (1, '23.610')] [2023-10-08 02:54:16,329][52060] Updated weights for policy 0, policy_version 73410 (0.0007) [2023-10-08 02:54:16,689][52060] Updated weights for policy 0, policy_version 73420 (0.0007) [2023-10-08 02:54:17,069][52060] Updated weights for policy 0, policy_version 73430 (0.0007) [2023-10-08 02:54:17,430][52060] Updated weights for policy 0, policy_version 73440 (0.0007) [2023-10-08 02:54:17,533][52059] Updated weights for policy 1, policy_version 74342 (0.0008) [2023-10-08 02:54:17,901][52059] Updated weights for policy 1, policy_version 74352 (0.0007) [2023-10-08 02:54:18,255][52059] Updated weights for policy 1, policy_version 74362 (0.0007) [2023-10-08 02:54:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 151355392. Throughput: 0: 1720.2, 1: 1727.6. Samples: 37846146. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:54:21,211][50642] Avg episode reward: [(0, '19.630'), (1, '22.640')] [2023-10-08 02:54:21,484][52060] Updated weights for policy 0, policy_version 73450 (0.0010) [2023-10-08 02:54:21,839][52060] Updated weights for policy 0, policy_version 73460 (0.0011) [2023-10-08 02:54:22,169][52059] Updated weights for policy 1, policy_version 74372 (0.0008) [2023-10-08 02:54:22,214][52060] Updated weights for policy 0, policy_version 73470 (0.0009) [2023-10-08 02:54:22,540][52059] Updated weights for policy 1, policy_version 74382 (0.0010) [2023-10-08 02:54:22,912][52059] Updated weights for policy 1, policy_version 74392 (0.0009) [2023-10-08 02:54:26,048][52060] Updated weights for policy 0, policy_version 73480 (0.0010) [2023-10-08 02:54:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 151420928. Throughput: 0: 1733.8, 1: 1729.7. Samples: 37867592. Policy #0 lag: (min: 19.0, avg: 19.0, max: 19.0) [2023-10-08 02:54:26,211][50642] Avg episode reward: [(0, '19.530'), (1, '20.910')] [2023-10-08 02:54:26,419][52060] Updated weights for policy 0, policy_version 73490 (0.0011) [2023-10-08 02:54:26,784][52060] Updated weights for policy 0, policy_version 73500 (0.0008) [2023-10-08 02:54:26,798][52059] Updated weights for policy 1, policy_version 74402 (0.0008) [2023-10-08 02:54:27,160][52059] Updated weights for policy 1, policy_version 74412 (0.0007) [2023-10-08 02:54:27,531][52059] Updated weights for policy 1, policy_version 74422 (0.0008) [2023-10-08 02:54:27,890][52059] Updated weights for policy 1, policy_version 74432 (0.0009) [2023-10-08 02:54:30,889][52060] Updated weights for policy 0, policy_version 73510 (0.0007) [2023-10-08 02:54:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 151486464. Throughput: 0: 1727.2, 1: 1756.2. Samples: 37888516. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:54:31,211][50642] Avg episode reward: [(0, '22.000'), (1, '23.890')] [2023-10-08 02:54:31,271][52060] Updated weights for policy 0, policy_version 73520 (0.0009) [2023-10-08 02:54:31,639][52060] Updated weights for policy 0, policy_version 73530 (0.0008) [2023-10-08 02:54:31,884][52059] Updated weights for policy 1, policy_version 74442 (0.0009) [2023-10-08 02:54:32,250][52059] Updated weights for policy 1, policy_version 74452 (0.0009) [2023-10-08 02:54:32,612][52059] Updated weights for policy 1, policy_version 74462 (0.0008) [2023-10-08 02:54:35,609][52060] Updated weights for policy 0, policy_version 73540 (0.0010) [2023-10-08 02:54:35,978][52060] Updated weights for policy 0, policy_version 73550 (0.0009) [2023-10-08 02:54:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 151552000. Throughput: 0: 1724.1, 1: 1727.5. Samples: 37898152. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:54:36,211][50642] Avg episode reward: [(0, '21.070'), (1, '21.920')] [2023-10-08 02:54:36,347][52060] Updated weights for policy 0, policy_version 73560 (0.0008) [2023-10-08 02:54:36,479][52059] Updated weights for policy 1, policy_version 74472 (0.0008) [2023-10-08 02:54:36,842][52059] Updated weights for policy 1, policy_version 74482 (0.0010) [2023-10-08 02:54:37,205][52059] Updated weights for policy 1, policy_version 74492 (0.0009) [2023-10-08 02:54:40,282][52060] Updated weights for policy 0, policy_version 73570 (0.0008) [2023-10-08 02:54:40,646][52060] Updated weights for policy 0, policy_version 73580 (0.0007) [2023-10-08 02:54:41,009][52060] Updated weights for policy 0, policy_version 73590 (0.0009) [2023-10-08 02:54:41,198][52059] Updated weights for policy 1, policy_version 74502 (0.0009) [2023-10-08 02:54:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 151617536. Throughput: 0: 1726.6, 1: 1748.9. Samples: 37919554. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:54:41,211][50642] Avg episode reward: [(0, '21.920'), (1, '24.400')] [2023-10-08 02:54:41,377][52060] Updated weights for policy 0, policy_version 73600 (0.0007) [2023-10-08 02:54:41,561][52059] Updated weights for policy 1, policy_version 74512 (0.0009) [2023-10-08 02:54:41,938][52059] Updated weights for policy 1, policy_version 74522 (0.0008) [2023-10-08 02:54:45,466][52060] Updated weights for policy 0, policy_version 73610 (0.0009) [2023-10-08 02:54:45,838][52060] Updated weights for policy 0, policy_version 73620 (0.0009) [2023-10-08 02:54:45,868][52059] Updated weights for policy 1, policy_version 74532 (0.0007) [2023-10-08 02:54:46,207][52060] Updated weights for policy 0, policy_version 73630 (0.0008) [2023-10-08 02:54:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 151683072. Throughput: 0: 1704.9, 1: 1755.9. Samples: 37940056. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:54:46,211][50642] Avg episode reward: [(0, '21.850'), (1, '22.440')] [2023-10-08 02:54:46,264][52059] Updated weights for policy 1, policy_version 74542 (0.0008) [2023-10-08 02:54:46,275][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000073632_75399168.pth... [2023-10-08 02:54:46,305][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000072000_73728000.pth [2023-10-08 02:54:46,625][52059] Updated weights for policy 1, policy_version 74552 (0.0008) [2023-10-08 02:54:46,907][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000074560_76349440.pth... [2023-10-08 02:54:46,945][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000072928_74678272.pth [2023-10-08 02:54:50,187][52060] Updated weights for policy 0, policy_version 73640 (0.0008) [2023-10-08 02:54:50,553][52060] Updated weights for policy 0, policy_version 73650 (0.0010) [2023-10-08 02:54:50,618][52059] Updated weights for policy 1, policy_version 74562 (0.0009) [2023-10-08 02:54:50,922][52060] Updated weights for policy 0, policy_version 73660 (0.0009) [2023-10-08 02:54:50,984][52059] Updated weights for policy 1, policy_version 74572 (0.0009) [2023-10-08 02:54:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 151781376. Throughput: 0: 1730.5, 1: 1736.8. Samples: 37950440. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:54:51,212][50642] Avg episode reward: [(0, '20.360'), (1, '22.440')] [2023-10-08 02:54:51,353][52059] Updated weights for policy 1, policy_version 74582 (0.0009) [2023-10-08 02:54:51,728][52059] Updated weights for policy 1, policy_version 74592 (0.0008) [2023-10-08 02:54:54,952][52060] Updated weights for policy 0, policy_version 73670 (0.0009) [2023-10-08 02:54:55,317][52060] Updated weights for policy 0, policy_version 73680 (0.0008) [2023-10-08 02:54:55,548][52059] Updated weights for policy 1, policy_version 74602 (0.0009) [2023-10-08 02:54:55,679][52060] Updated weights for policy 0, policy_version 73690 (0.0007) [2023-10-08 02:54:55,905][52059] Updated weights for policy 1, policy_version 74612 (0.0009) [2023-10-08 02:54:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 151846912. Throughput: 0: 1714.0, 1: 1764.0. Samples: 37971690. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:54:56,211][50642] Avg episode reward: [(0, '20.070'), (1, '22.880')] [2023-10-08 02:54:56,283][52059] Updated weights for policy 1, policy_version 74622 (0.0007) [2023-10-08 02:54:59,594][52060] Updated weights for policy 0, policy_version 73700 (0.0008) [2023-10-08 02:54:59,961][52060] Updated weights for policy 0, policy_version 73710 (0.0007) [2023-10-08 02:55:00,220][52059] Updated weights for policy 1, policy_version 74632 (0.0008) [2023-10-08 02:55:00,327][52060] Updated weights for policy 0, policy_version 73720 (0.0007) [2023-10-08 02:55:00,587][52059] Updated weights for policy 1, policy_version 74642 (0.0008) [2023-10-08 02:55:00,956][52059] Updated weights for policy 1, policy_version 74652 (0.0011) [2023-10-08 02:55:01,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 151945216. Throughput: 0: 1693.2, 1: 1738.4. Samples: 37990996. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:55:01,211][50642] Avg episode reward: [(0, '21.940'), (1, '24.830')] [2023-10-08 02:55:04,358][52060] Updated weights for policy 0, policy_version 73730 (0.0008) [2023-10-08 02:55:04,725][52060] Updated weights for policy 0, policy_version 73740 (0.0007) [2023-10-08 02:55:04,926][52059] Updated weights for policy 1, policy_version 74662 (0.0009) [2023-10-08 02:55:05,097][52060] Updated weights for policy 0, policy_version 73750 (0.0008) [2023-10-08 02:55:05,277][52059] Updated weights for policy 1, policy_version 74672 (0.0008) [2023-10-08 02:55:05,463][52060] Updated weights for policy 0, policy_version 73760 (0.0011) [2023-10-08 02:55:05,640][52059] Updated weights for policy 1, policy_version 74682 (0.0007) [2023-10-08 02:55:06,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 152010752. Throughput: 0: 1720.7, 1: 1756.8. Samples: 38002636. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:55:06,211][50642] Avg episode reward: [(0, '21.920'), (1, '24.160')] [2023-10-08 02:55:09,458][52060] Updated weights for policy 0, policy_version 73770 (0.0010) [2023-10-08 02:55:09,582][52059] Updated weights for policy 1, policy_version 74692 (0.0008) [2023-10-08 02:55:09,826][52060] Updated weights for policy 0, policy_version 73780 (0.0007) [2023-10-08 02:55:09,946][52059] Updated weights for policy 1, policy_version 74702 (0.0007) [2023-10-08 02:55:10,192][52060] Updated weights for policy 0, policy_version 73790 (0.0008) [2023-10-08 02:55:10,305][52059] Updated weights for policy 1, policy_version 74712 (0.0007) [2023-10-08 02:55:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 152076288. Throughput: 0: 1703.1, 1: 1746.9. Samples: 38022842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:55:11,211][50642] Avg episode reward: [(0, '20.340'), (1, '21.620')] [2023-10-08 02:55:14,005][52060] Updated weights for policy 0, policy_version 73800 (0.0008) [2023-10-08 02:55:14,183][52059] Updated weights for policy 1, policy_version 74722 (0.0010) [2023-10-08 02:55:14,371][52060] Updated weights for policy 0, policy_version 73810 (0.0008) [2023-10-08 02:55:14,555][52059] Updated weights for policy 1, policy_version 74732 (0.0007) [2023-10-08 02:55:14,727][52060] Updated weights for policy 0, policy_version 73820 (0.0009) [2023-10-08 02:55:14,926][52059] Updated weights for policy 1, policy_version 74742 (0.0007) [2023-10-08 02:55:15,280][52059] Updated weights for policy 1, policy_version 74752 (0.0009) [2023-10-08 02:55:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 152141824. Throughput: 0: 1705.3, 1: 1722.8. Samples: 38042780. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:16,212][50642] Avg episode reward: [(0, '20.330'), (1, '22.560')] [2023-10-08 02:55:18,766][52060] Updated weights for policy 0, policy_version 73830 (0.0008) [2023-10-08 02:55:19,134][52060] Updated weights for policy 0, policy_version 73840 (0.0008) [2023-10-08 02:55:19,318][52059] Updated weights for policy 1, policy_version 74762 (0.0009) [2023-10-08 02:55:19,496][52060] Updated weights for policy 0, policy_version 73850 (0.0008) [2023-10-08 02:55:19,697][52059] Updated weights for policy 1, policy_version 74772 (0.0008) [2023-10-08 02:55:20,054][52059] Updated weights for policy 1, policy_version 74782 (0.0008) [2023-10-08 02:55:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 152207360. Throughput: 0: 1723.4, 1: 1748.7. Samples: 38054398. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:21,211][50642] Avg episode reward: [(0, '21.230'), (1, '24.410')] [2023-10-08 02:55:23,527][52060] Updated weights for policy 0, policy_version 73860 (0.0008) [2023-10-08 02:55:23,903][52060] Updated weights for policy 0, policy_version 73870 (0.0010) [2023-10-08 02:55:23,982][52059] Updated weights for policy 1, policy_version 74792 (0.0007) [2023-10-08 02:55:24,265][52060] Updated weights for policy 0, policy_version 73880 (0.0007) [2023-10-08 02:55:24,350][52059] Updated weights for policy 1, policy_version 74802 (0.0008) [2023-10-08 02:55:24,718][52059] Updated weights for policy 1, policy_version 74812 (0.0010) [2023-10-08 02:55:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 152272896. Throughput: 0: 1704.4, 1: 1719.3. Samples: 38073620. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:26,211][50642] Avg episode reward: [(0, '20.420'), (1, '24.790')] [2023-10-08 02:55:28,095][52060] Updated weights for policy 0, policy_version 73890 (0.0008) [2023-10-08 02:55:28,470][52060] Updated weights for policy 0, policy_version 73900 (0.0008) [2023-10-08 02:55:28,601][52059] Updated weights for policy 1, policy_version 74822 (0.0008) [2023-10-08 02:55:28,832][52060] Updated weights for policy 0, policy_version 73910 (0.0008) [2023-10-08 02:55:28,964][52059] Updated weights for policy 1, policy_version 74832 (0.0007) [2023-10-08 02:55:29,197][52060] Updated weights for policy 0, policy_version 73920 (0.0008) [2023-10-08 02:55:29,324][52059] Updated weights for policy 1, policy_version 74842 (0.0010) [2023-10-08 02:55:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 152338432. Throughput: 0: 1722.7, 1: 1717.5. Samples: 38094862. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:31,211][50642] Avg episode reward: [(0, '19.620'), (1, '22.330')] [2023-10-08 02:55:33,107][52060] Updated weights for policy 0, policy_version 73930 (0.0008) [2023-10-08 02:55:33,401][52059] Updated weights for policy 1, policy_version 74852 (0.0009) [2023-10-08 02:55:33,477][52060] Updated weights for policy 0, policy_version 73940 (0.0010) [2023-10-08 02:55:33,801][52059] Updated weights for policy 1, policy_version 74862 (0.0008) [2023-10-08 02:55:33,850][52060] Updated weights for policy 0, policy_version 73950 (0.0009) [2023-10-08 02:55:34,157][52059] Updated weights for policy 1, policy_version 74872 (0.0009) [2023-10-08 02:55:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 152403968. Throughput: 0: 1706.4, 1: 1726.9. Samples: 38104936. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:36,211][50642] Avg episode reward: [(0, '22.850'), (1, '23.260')] [2023-10-08 02:55:37,763][52060] Updated weights for policy 0, policy_version 73960 (0.0008) [2023-10-08 02:55:38,037][52059] Updated weights for policy 1, policy_version 74882 (0.0008) [2023-10-08 02:55:38,133][52060] Updated weights for policy 0, policy_version 73970 (0.0007) [2023-10-08 02:55:38,398][52059] Updated weights for policy 1, policy_version 74892 (0.0008) [2023-10-08 02:55:38,492][52060] Updated weights for policy 0, policy_version 73980 (0.0009) [2023-10-08 02:55:38,766][52059] Updated weights for policy 1, policy_version 74902 (0.0009) [2023-10-08 02:55:39,128][52059] Updated weights for policy 1, policy_version 74912 (0.0007) [2023-10-08 02:55:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 152469504. Throughput: 0: 1710.2, 1: 1706.1. Samples: 38125426. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:41,211][50642] Avg episode reward: [(0, '21.990'), (1, '24.110')] [2023-10-08 02:55:42,642][52060] Updated weights for policy 0, policy_version 73990 (0.0008) [2023-10-08 02:55:42,999][52060] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-10-08 02:55:43,179][52059] Updated weights for policy 1, policy_version 74922 (0.0008) [2023-10-08 02:55:43,372][52060] Updated weights for policy 0, policy_version 74010 (0.0007) [2023-10-08 02:55:43,543][52059] Updated weights for policy 1, policy_version 74932 (0.0010) [2023-10-08 02:55:43,917][52059] Updated weights for policy 1, policy_version 74942 (0.0010) [2023-10-08 02:55:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 152535040. Throughput: 0: 1732.1, 1: 1732.6. Samples: 38146910. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:46,211][50642] Avg episode reward: [(0, '20.310'), (1, '24.170')] [2023-10-08 02:55:47,303][52060] Updated weights for policy 0, policy_version 74020 (0.0008) [2023-10-08 02:55:47,685][52060] Updated weights for policy 0, policy_version 74030 (0.0010) [2023-10-08 02:55:47,784][52059] Updated weights for policy 1, policy_version 74952 (0.0008) [2023-10-08 02:55:48,055][52060] Updated weights for policy 0, policy_version 74040 (0.0007) [2023-10-08 02:55:48,150][52059] Updated weights for policy 1, policy_version 74962 (0.0008) [2023-10-08 02:55:48,517][52059] Updated weights for policy 1, policy_version 74972 (0.0009) [2023-10-08 02:55:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 152600576. Throughput: 0: 1701.6, 1: 1708.3. Samples: 38156080. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:51,211][50642] Avg episode reward: [(0, '19.460'), (1, '23.360')] [2023-10-08 02:55:52,066][52060] Updated weights for policy 0, policy_version 74050 (0.0008) [2023-10-08 02:55:52,404][52059] Updated weights for policy 1, policy_version 74982 (0.0009) [2023-10-08 02:55:52,439][52060] Updated weights for policy 0, policy_version 74060 (0.0008) [2023-10-08 02:55:52,770][52059] Updated weights for policy 1, policy_version 74992 (0.0010) [2023-10-08 02:55:52,806][52060] Updated weights for policy 0, policy_version 74070 (0.0007) [2023-10-08 02:55:53,132][52059] Updated weights for policy 1, policy_version 75002 (0.0007) [2023-10-08 02:55:53,172][52060] Updated weights for policy 0, policy_version 74080 (0.0007) [2023-10-08 02:55:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 152666112. Throughput: 0: 1721.5, 1: 1710.6. Samples: 38177284. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:55:56,211][50642] Avg episode reward: [(0, '22.730'), (1, '23.670')] [2023-10-08 02:55:57,034][52059] Updated weights for policy 1, policy_version 75012 (0.0009) [2023-10-08 02:55:57,140][52060] Updated weights for policy 0, policy_version 74090 (0.0009) [2023-10-08 02:55:57,388][52059] Updated weights for policy 1, policy_version 75022 (0.0008) [2023-10-08 02:55:57,515][52060] Updated weights for policy 0, policy_version 74100 (0.0008) [2023-10-08 02:55:57,754][52059] Updated weights for policy 1, policy_version 75032 (0.0008) [2023-10-08 02:55:57,879][52060] Updated weights for policy 0, policy_version 74110 (0.0010) [2023-10-08 02:56:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 152731648. Throughput: 0: 1727.8, 1: 1731.6. Samples: 38198452. Policy #0 lag: (min: 30.0, avg: 30.0, max: 30.0) [2023-10-08 02:56:01,211][50642] Avg episode reward: [(0, '20.290'), (1, '22.740')] [2023-10-08 02:56:01,761][52059] Updated weights for policy 1, policy_version 75042 (0.0009) [2023-10-08 02:56:01,982][52060] Updated weights for policy 0, policy_version 74120 (0.0007) [2023-10-08 02:56:02,132][52059] Updated weights for policy 1, policy_version 75052 (0.0009) [2023-10-08 02:56:02,346][52060] Updated weights for policy 0, policy_version 74130 (0.0008) [2023-10-08 02:56:02,500][52059] Updated weights for policy 1, policy_version 75062 (0.0007) [2023-10-08 02:56:02,722][52060] Updated weights for policy 0, policy_version 74140 (0.0008) [2023-10-08 02:56:02,863][52059] Updated weights for policy 1, policy_version 75072 (0.0008) [2023-10-08 02:56:06,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 152797184. Throughput: 0: 1706.8, 1: 1704.4. Samples: 38207898. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:06,211][50642] Avg episode reward: [(0, '19.330'), (1, '23.640')] [2023-10-08 02:56:06,659][52060] Updated weights for policy 0, policy_version 74150 (0.0009) [2023-10-08 02:56:06,745][52059] Updated weights for policy 1, policy_version 75082 (0.0008) [2023-10-08 02:56:07,050][52060] Updated weights for policy 0, policy_version 74160 (0.0008) [2023-10-08 02:56:07,113][52059] Updated weights for policy 1, policy_version 75092 (0.0008) [2023-10-08 02:56:07,416][52060] Updated weights for policy 0, policy_version 74170 (0.0007) [2023-10-08 02:56:07,466][52059] Updated weights for policy 1, policy_version 75102 (0.0007) [2023-10-08 02:56:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 152862720. Throughput: 0: 1721.4, 1: 1735.0. Samples: 38229158. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:11,211][50642] Avg episode reward: [(0, '19.340'), (1, '23.410')] [2023-10-08 02:56:11,302][52060] Updated weights for policy 0, policy_version 74180 (0.0009) [2023-10-08 02:56:11,484][52059] Updated weights for policy 1, policy_version 75112 (0.0007) [2023-10-08 02:56:11,672][52060] Updated weights for policy 0, policy_version 74190 (0.0009) [2023-10-08 02:56:11,839][52059] Updated weights for policy 1, policy_version 75122 (0.0008) [2023-10-08 02:56:12,050][52060] Updated weights for policy 0, policy_version 74200 (0.0008) [2023-10-08 02:56:12,206][52059] Updated weights for policy 1, policy_version 75132 (0.0007) [2023-10-08 02:56:16,122][52059] Updated weights for policy 1, policy_version 75142 (0.0008) [2023-10-08 02:56:16,201][52060] Updated weights for policy 0, policy_version 74210 (0.0008) [2023-10-08 02:56:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 152928256. Throughput: 0: 1712.8, 1: 1736.0. Samples: 38250058. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:16,211][50642] Avg episode reward: [(0, '21.330'), (1, '23.840')] [2023-10-08 02:56:16,480][52059] Updated weights for policy 1, policy_version 75152 (0.0008) [2023-10-08 02:56:16,570][52060] Updated weights for policy 0, policy_version 74220 (0.0007) [2023-10-08 02:56:16,835][52059] Updated weights for policy 1, policy_version 75162 (0.0007) [2023-10-08 02:56:16,946][52060] Updated weights for policy 0, policy_version 74230 (0.0008) [2023-10-08 02:56:17,317][52060] Updated weights for policy 0, policy_version 74240 (0.0009) [2023-10-08 02:56:20,715][52059] Updated weights for policy 1, policy_version 75172 (0.0008) [2023-10-08 02:56:21,104][52059] Updated weights for policy 1, policy_version 75182 (0.0008) [2023-10-08 02:56:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 152993792. Throughput: 0: 1708.8, 1: 1729.8. Samples: 38259674. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:21,211][50642] Avg episode reward: [(0, '19.890'), (1, '22.040')] [2023-10-08 02:56:21,361][52060] Updated weights for policy 0, policy_version 74250 (0.0008) [2023-10-08 02:56:21,459][52059] Updated weights for policy 1, policy_version 75192 (0.0007) [2023-10-08 02:56:21,720][52060] Updated weights for policy 0, policy_version 74260 (0.0008) [2023-10-08 02:56:22,092][52060] Updated weights for policy 0, policy_version 74270 (0.0010) [2023-10-08 02:56:25,286][52059] Updated weights for policy 1, policy_version 75202 (0.0007) [2023-10-08 02:56:25,659][52059] Updated weights for policy 1, policy_version 75212 (0.0008) [2023-10-08 02:56:26,022][52059] Updated weights for policy 1, policy_version 75222 (0.0007) [2023-10-08 02:56:26,060][52060] Updated weights for policy 0, policy_version 74280 (0.0009) [2023-10-08 02:56:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 153059328. Throughput: 0: 1708.1, 1: 1743.1. Samples: 38280730. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:26,211][50642] Avg episode reward: [(0, '19.290'), (1, '23.450')] [2023-10-08 02:56:26,396][52059] Updated weights for policy 1, policy_version 75232 (0.0007) [2023-10-08 02:56:26,418][52060] Updated weights for policy 0, policy_version 74290 (0.0008) [2023-10-08 02:56:26,794][52060] Updated weights for policy 0, policy_version 74300 (0.0007) [2023-10-08 02:56:30,186][52059] Updated weights for policy 1, policy_version 75242 (0.0009) [2023-10-08 02:56:30,543][52059] Updated weights for policy 1, policy_version 75252 (0.0009) [2023-10-08 02:56:30,787][52060] Updated weights for policy 0, policy_version 74310 (0.0009) [2023-10-08 02:56:30,913][52059] Updated weights for policy 1, policy_version 75262 (0.0009) [2023-10-08 02:56:31,152][52060] Updated weights for policy 0, policy_version 74320 (0.0009) [2023-10-08 02:56:31,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 153157632. Throughput: 0: 1699.1, 1: 1718.4. Samples: 38300694. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:31,211][50642] Avg episode reward: [(0, '19.090'), (1, '23.480')] [2023-10-08 02:56:31,523][52060] Updated weights for policy 0, policy_version 74330 (0.0011) [2023-10-08 02:56:34,946][52059] Updated weights for policy 1, policy_version 75272 (0.0011) [2023-10-08 02:56:35,310][52059] Updated weights for policy 1, policy_version 75282 (0.0010) [2023-10-08 02:56:35,544][52060] Updated weights for policy 0, policy_version 74340 (0.0009) [2023-10-08 02:56:35,676][52059] Updated weights for policy 1, policy_version 75292 (0.0008) [2023-10-08 02:56:35,911][52060] Updated weights for policy 0, policy_version 74350 (0.0008) [2023-10-08 02:56:36,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 153223168. Throughput: 0: 1708.4, 1: 1750.3. Samples: 38311722. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:36,211][50642] Avg episode reward: [(0, '20.590'), (1, '26.160')] [2023-10-08 02:56:36,275][52060] Updated weights for policy 0, policy_version 74360 (0.0009) [2023-10-08 02:56:39,496][52059] Updated weights for policy 1, policy_version 75302 (0.0007) [2023-10-08 02:56:39,855][52059] Updated weights for policy 1, policy_version 75312 (0.0007) [2023-10-08 02:56:40,214][52059] Updated weights for policy 1, policy_version 75322 (0.0007) [2023-10-08 02:56:40,239][52060] Updated weights for policy 0, policy_version 74370 (0.0009) [2023-10-08 02:56:40,603][52060] Updated weights for policy 0, policy_version 74380 (0.0008) [2023-10-08 02:56:40,969][52060] Updated weights for policy 0, policy_version 74390 (0.0008) [2023-10-08 02:56:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 153288704. Throughput: 0: 1703.0, 1: 1741.4. Samples: 38332282. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:41,211][50642] Avg episode reward: [(0, '20.900'), (1, '23.560')] [2023-10-08 02:56:41,334][52060] Updated weights for policy 0, policy_version 74400 (0.0008) [2023-10-08 02:56:44,002][52059] Updated weights for policy 1, policy_version 75332 (0.0009) [2023-10-08 02:56:44,376][52059] Updated weights for policy 1, policy_version 75342 (0.0007) [2023-10-08 02:56:44,737][52059] Updated weights for policy 1, policy_version 75352 (0.0010) [2023-10-08 02:56:45,194][52060] Updated weights for policy 0, policy_version 74410 (0.0008) [2023-10-08 02:56:45,558][52060] Updated weights for policy 0, policy_version 74420 (0.0010) [2023-10-08 02:56:45,927][52060] Updated weights for policy 0, policy_version 74430 (0.0010) [2023-10-08 02:56:46,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 153387008. Throughput: 0: 1685.1, 1: 1730.7. Samples: 38352166. Policy #0 lag: (min: 19.0, avg: 38.5, max: 40.0) [2023-10-08 02:56:46,211][50642] Avg episode reward: [(0, '20.270'), (1, '21.450')] [2023-10-08 02:56:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000075360_77168640.pth... [2023-10-08 02:56:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000074432_76218368.pth... [2023-10-08 02:56:46,252][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000073728_75497472.pth [2023-10-08 02:56:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000072832_74579968.pth [2023-10-08 02:56:48,656][52059] Updated weights for policy 1, policy_version 75362 (0.0009) [2023-10-08 02:56:49,023][52059] Updated weights for policy 1, policy_version 75372 (0.0007) [2023-10-08 02:56:49,382][52059] Updated weights for policy 1, policy_version 75382 (0.0007) [2023-10-08 02:56:49,747][52059] Updated weights for policy 1, policy_version 75392 (0.0009) [2023-10-08 02:56:49,792][52060] Updated weights for policy 0, policy_version 74440 (0.0009) [2023-10-08 02:56:50,157][52060] Updated weights for policy 0, policy_version 74450 (0.0007) [2023-10-08 02:56:50,528][52060] Updated weights for policy 0, policy_version 74460 (0.0011) [2023-10-08 02:56:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 153452544. Throughput: 0: 1704.9, 1: 1752.2. Samples: 38363466. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:56:51,211][50642] Avg episode reward: [(0, '19.690'), (1, '23.980')] [2023-10-08 02:56:53,797][52059] Updated weights for policy 1, policy_version 75402 (0.0008) [2023-10-08 02:56:54,159][52059] Updated weights for policy 1, policy_version 75412 (0.0008) [2023-10-08 02:56:54,524][52060] Updated weights for policy 0, policy_version 74470 (0.0008) [2023-10-08 02:56:54,536][52059] Updated weights for policy 1, policy_version 75422 (0.0009) [2023-10-08 02:56:54,915][52060] Updated weights for policy 0, policy_version 74480 (0.0008) [2023-10-08 02:56:55,282][52060] Updated weights for policy 0, policy_version 74490 (0.0009) [2023-10-08 02:56:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 153518080. Throughput: 0: 1696.1, 1: 1726.4. Samples: 38383170. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:56:56,211][50642] Avg episode reward: [(0, '22.770'), (1, '24.110')] [2023-10-08 02:56:58,384][52059] Updated weights for policy 1, policy_version 75432 (0.0007) [2023-10-08 02:56:58,758][52059] Updated weights for policy 1, policy_version 75442 (0.0007) [2023-10-08 02:56:59,119][52059] Updated weights for policy 1, policy_version 75452 (0.0008) [2023-10-08 02:56:59,149][52060] Updated weights for policy 0, policy_version 74500 (0.0009) [2023-10-08 02:56:59,532][52060] Updated weights for policy 0, policy_version 74510 (0.0010) [2023-10-08 02:56:59,894][52060] Updated weights for policy 0, policy_version 74520 (0.0009) [2023-10-08 02:57:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 153583616. Throughput: 0: 1684.9, 1: 1725.4. Samples: 38403524. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:01,211][50642] Avg episode reward: [(0, '19.760'), (1, '24.900')] [2023-10-08 02:57:03,212][52059] Updated weights for policy 1, policy_version 75462 (0.0008) [2023-10-08 02:57:03,580][52059] Updated weights for policy 1, policy_version 75472 (0.0010) [2023-10-08 02:57:03,858][52060] Updated weights for policy 0, policy_version 74530 (0.0009) [2023-10-08 02:57:03,949][52059] Updated weights for policy 1, policy_version 75482 (0.0009) [2023-10-08 02:57:04,232][52060] Updated weights for policy 0, policy_version 74540 (0.0008) [2023-10-08 02:57:04,592][52060] Updated weights for policy 0, policy_version 74550 (0.0010) [2023-10-08 02:57:04,958][52060] Updated weights for policy 0, policy_version 74560 (0.0009) [2023-10-08 02:57:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 153649152. Throughput: 0: 1710.2, 1: 1729.2. Samples: 38414450. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:06,211][50642] Avg episode reward: [(0, '19.020'), (1, '21.400')] [2023-10-08 02:57:07,743][52059] Updated weights for policy 1, policy_version 75492 (0.0008) [2023-10-08 02:57:08,112][52059] Updated weights for policy 1, policy_version 75502 (0.0008) [2023-10-08 02:57:08,482][52059] Updated weights for policy 1, policy_version 75512 (0.0007) [2023-10-08 02:57:09,189][52060] Updated weights for policy 0, policy_version 74570 (0.0009) [2023-10-08 02:57:09,553][52060] Updated weights for policy 0, policy_version 74580 (0.0007) [2023-10-08 02:57:09,929][52060] Updated weights for policy 0, policy_version 74590 (0.0008) [2023-10-08 02:57:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 153714688. Throughput: 0: 1687.9, 1: 1728.6. Samples: 38434472. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:11,211][50642] Avg episode reward: [(0, '20.570'), (1, '22.190')] [2023-10-08 02:57:12,379][52059] Updated weights for policy 1, policy_version 75522 (0.0008) [2023-10-08 02:57:12,750][52059] Updated weights for policy 1, policy_version 75532 (0.0009) [2023-10-08 02:57:13,116][52059] Updated weights for policy 1, policy_version 75542 (0.0009) [2023-10-08 02:57:13,481][52059] Updated weights for policy 1, policy_version 75552 (0.0007) [2023-10-08 02:57:13,841][52060] Updated weights for policy 0, policy_version 74600 (0.0008) [2023-10-08 02:57:14,216][52060] Updated weights for policy 0, policy_version 74610 (0.0008) [2023-10-08 02:57:14,588][52060] Updated weights for policy 0, policy_version 74620 (0.0009) [2023-10-08 02:57:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 153780224. Throughput: 0: 1694.2, 1: 1753.3. Samples: 38455830. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:16,211][50642] Avg episode reward: [(0, '23.830'), (1, '23.010')] [2023-10-08 02:57:17,183][52059] Updated weights for policy 1, policy_version 75562 (0.0007) [2023-10-08 02:57:17,543][52059] Updated weights for policy 1, policy_version 75572 (0.0009) [2023-10-08 02:57:17,899][52059] Updated weights for policy 1, policy_version 75582 (0.0009) [2023-10-08 02:57:18,664][52060] Updated weights for policy 0, policy_version 74630 (0.0009) [2023-10-08 02:57:19,035][52060] Updated weights for policy 0, policy_version 74640 (0.0010) [2023-10-08 02:57:19,411][52060] Updated weights for policy 0, policy_version 74650 (0.0009) [2023-10-08 02:57:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 153845760. Throughput: 0: 1707.6, 1: 1725.2. Samples: 38466198. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:21,211][50642] Avg episode reward: [(0, '18.180'), (1, '25.650')] [2023-10-08 02:57:21,836][52059] Updated weights for policy 1, policy_version 75592 (0.0010) [2023-10-08 02:57:22,203][52059] Updated weights for policy 1, policy_version 75602 (0.0008) [2023-10-08 02:57:22,568][52059] Updated weights for policy 1, policy_version 75612 (0.0010) [2023-10-08 02:57:23,461][52060] Updated weights for policy 0, policy_version 74660 (0.0009) [2023-10-08 02:57:23,821][52060] Updated weights for policy 0, policy_version 74670 (0.0010) [2023-10-08 02:57:24,188][52060] Updated weights for policy 0, policy_version 74680 (0.0010) [2023-10-08 02:57:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 153911296. Throughput: 0: 1695.3, 1: 1739.7. Samples: 38486858. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:26,211][50642] Avg episode reward: [(0, '19.500'), (1, '21.480')] [2023-10-08 02:57:26,662][52059] Updated weights for policy 1, policy_version 75622 (0.0009) [2023-10-08 02:57:27,022][52059] Updated weights for policy 1, policy_version 75632 (0.0007) [2023-10-08 02:57:27,384][52059] Updated weights for policy 1, policy_version 75642 (0.0007) [2023-10-08 02:57:27,915][52060] Updated weights for policy 0, policy_version 74690 (0.0009) [2023-10-08 02:57:28,284][52060] Updated weights for policy 0, policy_version 74700 (0.0009) [2023-10-08 02:57:28,644][52060] Updated weights for policy 0, policy_version 74710 (0.0007) [2023-10-08 02:57:29,013][52060] Updated weights for policy 0, policy_version 74720 (0.0010) [2023-10-08 02:57:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 153976832. Throughput: 0: 1719.3, 1: 1749.5. Samples: 38508260. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:31,211][50642] Avg episode reward: [(0, '20.360'), (1, '24.170')] [2023-10-08 02:57:31,418][52059] Updated weights for policy 1, policy_version 75652 (0.0007) [2023-10-08 02:57:31,783][52059] Updated weights for policy 1, policy_version 75662 (0.0007) [2023-10-08 02:57:32,148][52059] Updated weights for policy 1, policy_version 75672 (0.0007) [2023-10-08 02:57:32,865][52060] Updated weights for policy 0, policy_version 74730 (0.0008) [2023-10-08 02:57:33,235][52060] Updated weights for policy 0, policy_version 74740 (0.0009) [2023-10-08 02:57:33,606][52060] Updated weights for policy 0, policy_version 74750 (0.0008) [2023-10-08 02:57:35,989][52059] Updated weights for policy 1, policy_version 75682 (0.0009) [2023-10-08 02:57:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 154042368. Throughput: 0: 1699.8, 1: 1727.9. Samples: 38517714. Policy #0 lag: (min: 20.0, avg: 29.5, max: 52.0) [2023-10-08 02:57:36,211][50642] Avg episode reward: [(0, '21.320'), (1, '22.690')] [2023-10-08 02:57:36,353][52059] Updated weights for policy 1, policy_version 75692 (0.0011) [2023-10-08 02:57:36,720][52059] Updated weights for policy 1, policy_version 75702 (0.0008) [2023-10-08 02:57:37,089][52059] Updated weights for policy 1, policy_version 75712 (0.0008) [2023-10-08 02:57:37,517][52060] Updated weights for policy 0, policy_version 74760 (0.0009) [2023-10-08 02:57:37,888][52060] Updated weights for policy 0, policy_version 74770 (0.0007) [2023-10-08 02:57:38,264][52060] Updated weights for policy 0, policy_version 74780 (0.0008) [2023-10-08 02:57:40,952][52059] Updated weights for policy 1, policy_version 75722 (0.0009) [2023-10-08 02:57:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 154107904. Throughput: 0: 1710.3, 1: 1750.3. Samples: 38538896. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:57:41,211][50642] Avg episode reward: [(0, '19.550'), (1, '23.990')] [2023-10-08 02:57:41,312][52059] Updated weights for policy 1, policy_version 75732 (0.0010) [2023-10-08 02:57:41,676][52059] Updated weights for policy 1, policy_version 75742 (0.0008) [2023-10-08 02:57:42,408][52060] Updated weights for policy 0, policy_version 74790 (0.0007) [2023-10-08 02:57:42,796][52060] Updated weights for policy 0, policy_version 74800 (0.0009) [2023-10-08 02:57:43,168][52060] Updated weights for policy 0, policy_version 74810 (0.0007) [2023-10-08 02:57:45,670][52059] Updated weights for policy 1, policy_version 75752 (0.0009) [2023-10-08 02:57:46,046][52059] Updated weights for policy 1, policy_version 75762 (0.0009) [2023-10-08 02:57:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 154173440. Throughput: 0: 1722.0, 1: 1740.6. Samples: 38559340. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:57:46,211][50642] Avg episode reward: [(0, '18.970'), (1, '24.530')] [2023-10-08 02:57:46,401][52059] Updated weights for policy 1, policy_version 75772 (0.0010) [2023-10-08 02:57:47,231][52060] Updated weights for policy 0, policy_version 74820 (0.0007) [2023-10-08 02:57:47,596][52060] Updated weights for policy 0, policy_version 74830 (0.0010) [2023-10-08 02:57:47,976][52060] Updated weights for policy 0, policy_version 74840 (0.0011) [2023-10-08 02:57:50,228][52059] Updated weights for policy 1, policy_version 75782 (0.0011) [2023-10-08 02:57:50,599][52059] Updated weights for policy 1, policy_version 75792 (0.0011) [2023-10-08 02:57:50,958][52059] Updated weights for policy 1, policy_version 75802 (0.0009) [2023-10-08 02:57:51,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 154271744. Throughput: 0: 1694.4, 1: 1744.8. Samples: 38569210. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:57:51,211][50642] Avg episode reward: [(0, '22.210'), (1, '22.040')] [2023-10-08 02:57:52,071][52060] Updated weights for policy 0, policy_version 74850 (0.0009) [2023-10-08 02:57:52,458][52060] Updated weights for policy 0, policy_version 74860 (0.0009) [2023-10-08 02:57:52,824][52060] Updated weights for policy 0, policy_version 74870 (0.0010) [2023-10-08 02:57:53,195][52060] Updated weights for policy 0, policy_version 74880 (0.0007) [2023-10-08 02:57:55,049][52059] Updated weights for policy 1, policy_version 75812 (0.0010) [2023-10-08 02:57:55,443][52059] Updated weights for policy 1, policy_version 75822 (0.0011) [2023-10-08 02:57:55,802][52059] Updated weights for policy 1, policy_version 75832 (0.0010) [2023-10-08 02:57:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13884.8). Total num frames: 154337280. Throughput: 0: 1717.1, 1: 1745.9. Samples: 38590308. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:57:56,211][50642] Avg episode reward: [(0, '20.550'), (1, '22.800')] [2023-10-08 02:57:57,283][52060] Updated weights for policy 0, policy_version 74890 (0.0008) [2023-10-08 02:57:57,659][52060] Updated weights for policy 0, policy_version 74900 (0.0007) [2023-10-08 02:57:58,041][52060] Updated weights for policy 0, policy_version 74910 (0.0010) [2023-10-08 02:57:59,720][52059] Updated weights for policy 1, policy_version 75842 (0.0010) [2023-10-08 02:58:00,077][52059] Updated weights for policy 1, policy_version 75852 (0.0010) [2023-10-08 02:58:00,435][52059] Updated weights for policy 1, policy_version 75862 (0.0009) [2023-10-08 02:58:00,799][52059] Updated weights for policy 1, policy_version 75872 (0.0007) [2023-10-08 02:58:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 154402816. Throughput: 0: 1722.4, 1: 1710.2. Samples: 38610298. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:58:01,211][50642] Avg episode reward: [(0, '18.110'), (1, '22.730')] [2023-10-08 02:58:02,081][52060] Updated weights for policy 0, policy_version 74920 (0.0008) [2023-10-08 02:58:02,457][52060] Updated weights for policy 0, policy_version 74930 (0.0007) [2023-10-08 02:58:02,820][52060] Updated weights for policy 0, policy_version 74940 (0.0011) [2023-10-08 02:58:04,667][52059] Updated weights for policy 1, policy_version 75882 (0.0010) [2023-10-08 02:58:05,019][52059] Updated weights for policy 1, policy_version 75892 (0.0008) [2023-10-08 02:58:05,382][52059] Updated weights for policy 1, policy_version 75902 (0.0007) [2023-10-08 02:58:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 154468352. Throughput: 0: 1701.3, 1: 1744.0. Samples: 38621238. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:58:06,211][50642] Avg episode reward: [(0, '19.790'), (1, '23.220')] [2023-10-08 02:58:06,878][52060] Updated weights for policy 0, policy_version 74950 (0.0009) [2023-10-08 02:58:07,246][52060] Updated weights for policy 0, policy_version 74960 (0.0011) [2023-10-08 02:58:07,604][52060] Updated weights for policy 0, policy_version 74970 (0.0008) [2023-10-08 02:58:09,296][52059] Updated weights for policy 1, policy_version 75912 (0.0010) [2023-10-08 02:58:09,657][52059] Updated weights for policy 1, policy_version 75922 (0.0011) [2023-10-08 02:58:10,023][52059] Updated weights for policy 1, policy_version 75932 (0.0010) [2023-10-08 02:58:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 154533888. Throughput: 0: 1716.3, 1: 1721.1. Samples: 38641542. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:58:11,211][50642] Avg episode reward: [(0, '22.700'), (1, '24.300')] [2023-10-08 02:58:11,595][52060] Updated weights for policy 0, policy_version 74980 (0.0010) [2023-10-08 02:58:11,967][52060] Updated weights for policy 0, policy_version 74990 (0.0009) [2023-10-08 02:58:12,347][52060] Updated weights for policy 0, policy_version 75000 (0.0008) [2023-10-08 02:58:13,903][52059] Updated weights for policy 1, policy_version 75942 (0.0008) [2023-10-08 02:58:14,257][52059] Updated weights for policy 1, policy_version 75952 (0.0010) [2023-10-08 02:58:14,619][52059] Updated weights for policy 1, policy_version 75962 (0.0007) [2023-10-08 02:58:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 154599424. Throughput: 0: 1710.1, 1: 1717.8. Samples: 38662516. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:58:16,211][50642] Avg episode reward: [(0, '19.460'), (1, '23.840')] [2023-10-08 02:58:16,301][52060] Updated weights for policy 0, policy_version 75010 (0.0010) [2023-10-08 02:58:16,677][52060] Updated weights for policy 0, policy_version 75020 (0.0009) [2023-10-08 02:58:17,039][52060] Updated weights for policy 0, policy_version 75030 (0.0007) [2023-10-08 02:58:17,418][52060] Updated weights for policy 0, policy_version 75040 (0.0007) [2023-10-08 02:58:18,424][52059] Updated weights for policy 1, policy_version 75972 (0.0008) [2023-10-08 02:58:18,785][52059] Updated weights for policy 1, policy_version 75982 (0.0008) [2023-10-08 02:58:19,145][52059] Updated weights for policy 1, policy_version 75992 (0.0008) [2023-10-08 02:58:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 154664960. Throughput: 0: 1708.8, 1: 1736.6. Samples: 38672760. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:58:21,211][50642] Avg episode reward: [(0, '17.060'), (1, '24.570')] [2023-10-08 02:58:21,390][52060] Updated weights for policy 0, policy_version 75050 (0.0010) [2023-10-08 02:58:21,761][52060] Updated weights for policy 0, policy_version 75060 (0.0009) [2023-10-08 02:58:22,137][52060] Updated weights for policy 0, policy_version 75070 (0.0008) [2023-10-08 02:58:23,095][52059] Updated weights for policy 1, policy_version 76002 (0.0007) [2023-10-08 02:58:23,457][52059] Updated weights for policy 1, policy_version 76012 (0.0007) [2023-10-08 02:58:23,819][52059] Updated weights for policy 1, policy_version 76022 (0.0007) [2023-10-08 02:58:24,186][52059] Updated weights for policy 1, policy_version 76032 (0.0008) [2023-10-08 02:58:25,990][52060] Updated weights for policy 0, policy_version 75080 (0.0007) [2023-10-08 02:58:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 154730496. Throughput: 0: 1710.9, 1: 1723.4. Samples: 38693440. Policy #0 lag: (min: 21.0, avg: 21.0, max: 21.0) [2023-10-08 02:58:26,211][50642] Avg episode reward: [(0, '20.330'), (1, '24.870')] [2023-10-08 02:58:26,359][52060] Updated weights for policy 0, policy_version 75090 (0.0008) [2023-10-08 02:58:26,735][52060] Updated weights for policy 0, policy_version 75100 (0.0007) [2023-10-08 02:58:28,169][52059] Updated weights for policy 1, policy_version 76042 (0.0009) [2023-10-08 02:58:28,534][52059] Updated weights for policy 1, policy_version 76052 (0.0010) [2023-10-08 02:58:28,887][52059] Updated weights for policy 1, policy_version 76062 (0.0009) [2023-10-08 02:58:30,655][52060] Updated weights for policy 0, policy_version 75110 (0.0008) [2023-10-08 02:58:31,016][52060] Updated weights for policy 0, policy_version 75120 (0.0008) [2023-10-08 02:58:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 154796032. Throughput: 0: 1707.2, 1: 1733.0. Samples: 38714150. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:58:31,211][50642] Avg episode reward: [(0, '23.270'), (1, '23.220')] [2023-10-08 02:58:31,394][52060] Updated weights for policy 0, policy_version 75130 (0.0008) [2023-10-08 02:58:32,835][52059] Updated weights for policy 1, policy_version 76072 (0.0008) [2023-10-08 02:58:33,196][52059] Updated weights for policy 1, policy_version 76082 (0.0007) [2023-10-08 02:58:33,561][52059] Updated weights for policy 1, policy_version 76092 (0.0007) [2023-10-08 02:58:35,366][52060] Updated weights for policy 0, policy_version 75140 (0.0009) [2023-10-08 02:58:35,733][52060] Updated weights for policy 0, policy_version 75150 (0.0010) [2023-10-08 02:58:36,098][52060] Updated weights for policy 0, policy_version 75160 (0.0011) [2023-10-08 02:58:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 154861568. Throughput: 0: 1717.3, 1: 1721.1. Samples: 38723936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:58:36,211][50642] Avg episode reward: [(0, '19.340'), (1, '22.800')] [2023-10-08 02:58:37,479][52059] Updated weights for policy 1, policy_version 76102 (0.0008) [2023-10-08 02:58:37,847][52059] Updated weights for policy 1, policy_version 76112 (0.0008) [2023-10-08 02:58:38,208][52059] Updated weights for policy 1, policy_version 76122 (0.0008) [2023-10-08 02:58:40,065][52060] Updated weights for policy 0, policy_version 75170 (0.0007) [2023-10-08 02:58:40,441][52060] Updated weights for policy 0, policy_version 75180 (0.0008) [2023-10-08 02:58:40,806][52060] Updated weights for policy 0, policy_version 75190 (0.0008) [2023-10-08 02:58:41,178][52060] Updated weights for policy 0, policy_version 75200 (0.0009) [2023-10-08 02:58:41,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 154959872. Throughput: 0: 1722.3, 1: 1724.2. Samples: 38745404. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:58:41,211][50642] Avg episode reward: [(0, '19.680'), (1, '21.610')] [2023-10-08 02:58:42,095][52059] Updated weights for policy 1, policy_version 76132 (0.0007) [2023-10-08 02:58:42,464][52059] Updated weights for policy 1, policy_version 76142 (0.0007) [2023-10-08 02:58:42,822][52059] Updated weights for policy 1, policy_version 76152 (0.0008) [2023-10-08 02:58:45,075][52060] Updated weights for policy 0, policy_version 75210 (0.0007) [2023-10-08 02:58:45,446][52060] Updated weights for policy 0, policy_version 75220 (0.0009) [2023-10-08 02:58:45,806][52060] Updated weights for policy 0, policy_version 75230 (0.0011) [2023-10-08 02:58:46,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 155025408. Throughput: 0: 1692.6, 1: 1757.2. Samples: 38765538. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:58:46,211][50642] Avg episode reward: [(0, '20.050'), (1, '19.190')] [2023-10-08 02:58:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000075232_77037568.pth... [2023-10-08 02:58:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000076160_77987840.pth... [2023-10-08 02:58:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000074560_76349440.pth [2023-10-08 02:58:46,260][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000073632_75399168.pth [2023-10-08 02:58:46,698][52059] Updated weights for policy 1, policy_version 76162 (0.0010) [2023-10-08 02:58:47,060][52059] Updated weights for policy 1, policy_version 76172 (0.0009) [2023-10-08 02:58:47,429][52059] Updated weights for policy 1, policy_version 76182 (0.0009) [2023-10-08 02:58:47,789][52059] Updated weights for policy 1, policy_version 76192 (0.0011) [2023-10-08 02:58:49,795][52060] Updated weights for policy 0, policy_version 75240 (0.0007) [2023-10-08 02:58:50,175][52060] Updated weights for policy 0, policy_version 75250 (0.0007) [2023-10-08 02:58:50,533][52060] Updated weights for policy 0, policy_version 75260 (0.0007) [2023-10-08 02:58:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 155090944. Throughput: 0: 1720.4, 1: 1721.1. Samples: 38776106. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:58:51,211][50642] Avg episode reward: [(0, '19.770'), (1, '16.990')] [2023-10-08 02:58:51,943][52059] Updated weights for policy 1, policy_version 76202 (0.0009) [2023-10-08 02:58:52,304][52059] Updated weights for policy 1, policy_version 76212 (0.0010) [2023-10-08 02:58:52,682][52059] Updated weights for policy 1, policy_version 76222 (0.0010) [2023-10-08 02:58:54,587][52060] Updated weights for policy 0, policy_version 75270 (0.0008) [2023-10-08 02:58:54,959][52060] Updated weights for policy 0, policy_version 75280 (0.0007) [2023-10-08 02:58:55,325][52060] Updated weights for policy 0, policy_version 75290 (0.0010) [2023-10-08 02:58:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 155156480. Throughput: 0: 1707.6, 1: 1742.5. Samples: 38796796. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:58:56,211][50642] Avg episode reward: [(0, '18.440'), (1, '17.370')] [2023-10-08 02:58:56,471][52059] Updated weights for policy 1, policy_version 76232 (0.0008) [2023-10-08 02:58:56,834][52059] Updated weights for policy 1, policy_version 76242 (0.0007) [2023-10-08 02:58:57,185][52059] Updated weights for policy 1, policy_version 76252 (0.0007) [2023-10-08 02:58:59,306][52060] Updated weights for policy 0, policy_version 75300 (0.0008) [2023-10-08 02:58:59,667][52060] Updated weights for policy 0, policy_version 75310 (0.0008) [2023-10-08 02:59:00,035][52060] Updated weights for policy 0, policy_version 75320 (0.0008) [2023-10-08 02:59:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 155222016. Throughput: 0: 1690.3, 1: 1749.0. Samples: 38817282. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:59:01,211][50642] Avg episode reward: [(0, '19.360'), (1, '14.550')] [2023-10-08 02:59:01,279][52059] Updated weights for policy 1, policy_version 76262 (0.0010) [2023-10-08 02:59:01,647][52059] Updated weights for policy 1, policy_version 76272 (0.0009) [2023-10-08 02:59:02,002][52059] Updated weights for policy 1, policy_version 76282 (0.0008) [2023-10-08 02:59:04,075][52060] Updated weights for policy 0, policy_version 75330 (0.0009) [2023-10-08 02:59:04,443][52060] Updated weights for policy 0, policy_version 75340 (0.0007) [2023-10-08 02:59:04,810][52060] Updated weights for policy 0, policy_version 75350 (0.0007) [2023-10-08 02:59:05,183][52060] Updated weights for policy 0, policy_version 75360 (0.0007) [2023-10-08 02:59:06,009][52059] Updated weights for policy 1, policy_version 76292 (0.0008) [2023-10-08 02:59:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 155287552. Throughput: 0: 1719.1, 1: 1730.3. Samples: 38827980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:59:06,211][50642] Avg episode reward: [(0, '19.920'), (1, '15.790')] [2023-10-08 02:59:06,366][52059] Updated weights for policy 1, policy_version 76302 (0.0007) [2023-10-08 02:59:06,737][52059] Updated weights for policy 1, policy_version 76312 (0.0009) [2023-10-08 02:59:09,110][52060] Updated weights for policy 0, policy_version 75370 (0.0010) [2023-10-08 02:59:09,477][52060] Updated weights for policy 0, policy_version 75380 (0.0008) [2023-10-08 02:59:09,855][52060] Updated weights for policy 0, policy_version 75390 (0.0007) [2023-10-08 02:59:10,605][52059] Updated weights for policy 1, policy_version 76322 (0.0008) [2023-10-08 02:59:10,976][52059] Updated weights for policy 1, policy_version 76332 (0.0009) [2023-10-08 02:59:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 155353088. Throughput: 0: 1696.2, 1: 1743.7. Samples: 38848232. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 02:59:11,211][50642] Avg episode reward: [(0, '20.130'), (1, '14.810')] [2023-10-08 02:59:11,345][52059] Updated weights for policy 1, policy_version 76342 (0.0007) [2023-10-08 02:59:11,706][52059] Updated weights for policy 1, policy_version 76352 (0.0007) [2023-10-08 02:59:13,808][52060] Updated weights for policy 0, policy_version 75400 (0.0010) [2023-10-08 02:59:14,168][52060] Updated weights for policy 0, policy_version 75410 (0.0011) [2023-10-08 02:59:14,547][52060] Updated weights for policy 0, policy_version 75420 (0.0010) [2023-10-08 02:59:15,637][52059] Updated weights for policy 1, policy_version 76362 (0.0009) [2023-10-08 02:59:16,008][52059] Updated weights for policy 1, policy_version 76372 (0.0010) [2023-10-08 02:59:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 155418624. Throughput: 0: 1704.0, 1: 1736.4. Samples: 38868968. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:16,211][50642] Avg episode reward: [(0, '19.500'), (1, '14.960')] [2023-10-08 02:59:16,359][52059] Updated weights for policy 1, policy_version 76382 (0.0008) [2023-10-08 02:59:18,591][52060] Updated weights for policy 0, policy_version 75430 (0.0008) [2023-10-08 02:59:18,970][52060] Updated weights for policy 0, policy_version 75440 (0.0008) [2023-10-08 02:59:19,332][52060] Updated weights for policy 0, policy_version 75450 (0.0010) [2023-10-08 02:59:20,054][52059] Updated weights for policy 1, policy_version 76392 (0.0009) [2023-10-08 02:59:20,407][52059] Updated weights for policy 1, policy_version 76402 (0.0009) [2023-10-08 02:59:20,767][52059] Updated weights for policy 1, policy_version 76412 (0.0008) [2023-10-08 02:59:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 155516928. Throughput: 0: 1711.7, 1: 1755.2. Samples: 38879950. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:21,211][50642] Avg episode reward: [(0, '20.800'), (1, '15.850')] [2023-10-08 02:59:23,362][52060] Updated weights for policy 0, policy_version 75460 (0.0009) [2023-10-08 02:59:23,728][52060] Updated weights for policy 0, policy_version 75470 (0.0010) [2023-10-08 02:59:24,093][52060] Updated weights for policy 0, policy_version 75480 (0.0010) [2023-10-08 02:59:24,737][52059] Updated weights for policy 1, policy_version 76422 (0.0008) [2023-10-08 02:59:25,101][52059] Updated weights for policy 1, policy_version 76432 (0.0007) [2023-10-08 02:59:25,459][52059] Updated weights for policy 1, policy_version 76442 (0.0008) [2023-10-08 02:59:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 155582464. Throughput: 0: 1684.8, 1: 1747.6. Samples: 38899858. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:26,211][50642] Avg episode reward: [(0, '20.350'), (1, '16.080')] [2023-10-08 02:59:28,012][52060] Updated weights for policy 0, policy_version 75490 (0.0010) [2023-10-08 02:59:28,379][52060] Updated weights for policy 0, policy_version 75500 (0.0008) [2023-10-08 02:59:28,739][52060] Updated weights for policy 0, policy_version 75510 (0.0007) [2023-10-08 02:59:29,110][52060] Updated weights for policy 0, policy_version 75520 (0.0009) [2023-10-08 02:59:29,459][52059] Updated weights for policy 1, policy_version 76452 (0.0009) [2023-10-08 02:59:29,852][52059] Updated weights for policy 1, policy_version 76462 (0.0009) [2023-10-08 02:59:30,220][52059] Updated weights for policy 1, policy_version 76472 (0.0009) [2023-10-08 02:59:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 155648000. Throughput: 0: 1720.4, 1: 1724.6. Samples: 38920560. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:31,211][50642] Avg episode reward: [(0, '17.430'), (1, '16.210')] [2023-10-08 02:59:32,867][52060] Updated weights for policy 0, policy_version 75530 (0.0008) [2023-10-08 02:59:33,237][52060] Updated weights for policy 0, policy_version 75540 (0.0009) [2023-10-08 02:59:33,614][52060] Updated weights for policy 0, policy_version 75550 (0.0009) [2023-10-08 02:59:34,144][52059] Updated weights for policy 1, policy_version 76482 (0.0009) [2023-10-08 02:59:34,511][52059] Updated weights for policy 1, policy_version 76492 (0.0007) [2023-10-08 02:59:34,871][52059] Updated weights for policy 1, policy_version 76502 (0.0007) [2023-10-08 02:59:35,232][52059] Updated weights for policy 1, policy_version 76512 (0.0007) [2023-10-08 02:59:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 155713536. Throughput: 0: 1689.2, 1: 1758.4. Samples: 38931252. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:36,211][50642] Avg episode reward: [(0, '17.740'), (1, '15.090')] [2023-10-08 02:59:37,602][52060] Updated weights for policy 0, policy_version 75560 (0.0008) [2023-10-08 02:59:37,967][52060] Updated weights for policy 0, policy_version 75570 (0.0009) [2023-10-08 02:59:38,332][52060] Updated weights for policy 0, policy_version 75580 (0.0009) [2023-10-08 02:59:39,003][52059] Updated weights for policy 1, policy_version 76522 (0.0008) [2023-10-08 02:59:39,371][52059] Updated weights for policy 1, policy_version 76532 (0.0010) [2023-10-08 02:59:39,743][52059] Updated weights for policy 1, policy_version 76542 (0.0010) [2023-10-08 02:59:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 155779072. Throughput: 0: 1708.8, 1: 1732.3. Samples: 38951642. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:41,211][50642] Avg episode reward: [(0, '20.510'), (1, '16.550')] [2023-10-08 02:59:42,362][52060] Updated weights for policy 0, policy_version 75590 (0.0009) [2023-10-08 02:59:42,734][52060] Updated weights for policy 0, policy_version 75600 (0.0007) [2023-10-08 02:59:43,108][52060] Updated weights for policy 0, policy_version 75610 (0.0007) [2023-10-08 02:59:43,680][52059] Updated weights for policy 1, policy_version 76552 (0.0009) [2023-10-08 02:59:44,047][52059] Updated weights for policy 1, policy_version 76562 (0.0010) [2023-10-08 02:59:44,411][52059] Updated weights for policy 1, policy_version 76572 (0.0007) [2023-10-08 02:59:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 155844608. Throughput: 0: 1723.1, 1: 1735.1. Samples: 38972902. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:46,211][50642] Avg episode reward: [(0, '20.970'), (1, '15.640')] [2023-10-08 02:59:47,223][52060] Updated weights for policy 0, policy_version 75620 (0.0010) [2023-10-08 02:59:47,597][52060] Updated weights for policy 0, policy_version 75630 (0.0008) [2023-10-08 02:59:47,960][52060] Updated weights for policy 0, policy_version 75640 (0.0007) [2023-10-08 02:59:48,245][52059] Updated weights for policy 1, policy_version 76582 (0.0007) [2023-10-08 02:59:48,608][52059] Updated weights for policy 1, policy_version 76592 (0.0008) [2023-10-08 02:59:48,971][52059] Updated weights for policy 1, policy_version 76602 (0.0010) [2023-10-08 02:59:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 155910144. Throughput: 0: 1692.0, 1: 1746.1. Samples: 38982694. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:51,211][50642] Avg episode reward: [(0, '17.980'), (1, '15.140')] [2023-10-08 02:59:51,950][52060] Updated weights for policy 0, policy_version 75650 (0.0008) [2023-10-08 02:59:52,317][52060] Updated weights for policy 0, policy_version 75660 (0.0008) [2023-10-08 02:59:52,690][52060] Updated weights for policy 0, policy_version 75670 (0.0008) [2023-10-08 02:59:52,725][52059] Updated weights for policy 1, policy_version 76612 (0.0009) [2023-10-08 02:59:53,051][52060] Updated weights for policy 0, policy_version 75680 (0.0008) [2023-10-08 02:59:53,087][52059] Updated weights for policy 1, policy_version 76622 (0.0008) [2023-10-08 02:59:53,449][52059] Updated weights for policy 1, policy_version 76632 (0.0007) [2023-10-08 02:59:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 155975680. Throughput: 0: 1719.6, 1: 1742.4. Samples: 39004024. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 02:59:56,211][50642] Avg episode reward: [(0, '19.520'), (1, '16.830')] [2023-10-08 02:59:57,003][52060] Updated weights for policy 0, policy_version 75690 (0.0007) [2023-10-08 02:59:57,369][52059] Updated weights for policy 1, policy_version 76642 (0.0008) [2023-10-08 02:59:57,370][52060] Updated weights for policy 0, policy_version 75700 (0.0008) [2023-10-08 02:59:57,729][52060] Updated weights for policy 0, policy_version 75710 (0.0007) [2023-10-08 02:59:57,739][52059] Updated weights for policy 1, policy_version 76652 (0.0007) [2023-10-08 02:59:58,101][52059] Updated weights for policy 1, policy_version 76662 (0.0009) [2023-10-08 02:59:58,462][52059] Updated weights for policy 1, policy_version 76672 (0.0010) [2023-10-08 03:00:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 156041216. Throughput: 0: 1723.7, 1: 1755.1. Samples: 39025512. Policy #0 lag: (min: 31.0, avg: 38.8, max: 63.0) [2023-10-08 03:00:01,211][50642] Avg episode reward: [(0, '21.360'), (1, '14.890')] [2023-10-08 03:00:01,569][52060] Updated weights for policy 0, policy_version 75720 (0.0010) [2023-10-08 03:00:01,937][52060] Updated weights for policy 0, policy_version 75730 (0.0008) [2023-10-08 03:00:02,311][52060] Updated weights for policy 0, policy_version 75740 (0.0007) [2023-10-08 03:00:02,435][52059] Updated weights for policy 1, policy_version 76682 (0.0009) [2023-10-08 03:00:02,806][52059] Updated weights for policy 1, policy_version 76692 (0.0007) [2023-10-08 03:00:03,172][52059] Updated weights for policy 1, policy_version 76702 (0.0007) [2023-10-08 03:00:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 156106752. Throughput: 0: 1708.9, 1: 1736.4. Samples: 39034988. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:06,211][50642] Avg episode reward: [(0, '21.230'), (1, '16.170')] [2023-10-08 03:00:06,383][52060] Updated weights for policy 0, policy_version 75750 (0.0007) [2023-10-08 03:00:06,763][52060] Updated weights for policy 0, policy_version 75760 (0.0008) [2023-10-08 03:00:06,856][52059] Updated weights for policy 1, policy_version 76712 (0.0008) [2023-10-08 03:00:07,123][52060] Updated weights for policy 0, policy_version 75770 (0.0009) [2023-10-08 03:00:07,217][52059] Updated weights for policy 1, policy_version 76722 (0.0007) [2023-10-08 03:00:07,577][52059] Updated weights for policy 1, policy_version 76732 (0.0009) [2023-10-08 03:00:11,135][52060] Updated weights for policy 0, policy_version 75780 (0.0008) [2023-10-08 03:00:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 156172288. Throughput: 0: 1732.8, 1: 1744.2. Samples: 39056324. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:11,211][50642] Avg episode reward: [(0, '19.390'), (1, '15.640')] [2023-10-08 03:00:11,498][52060] Updated weights for policy 0, policy_version 75790 (0.0009) [2023-10-08 03:00:11,519][52059] Updated weights for policy 1, policy_version 76742 (0.0007) [2023-10-08 03:00:11,855][52060] Updated weights for policy 0, policy_version 75800 (0.0007) [2023-10-08 03:00:11,881][52059] Updated weights for policy 1, policy_version 76752 (0.0008) [2023-10-08 03:00:12,252][52059] Updated weights for policy 1, policy_version 76762 (0.0009) [2023-10-08 03:00:15,816][52060] Updated weights for policy 0, policy_version 75810 (0.0009) [2023-10-08 03:00:16,191][52060] Updated weights for policy 0, policy_version 75820 (0.0011) [2023-10-08 03:00:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 156237824. Throughput: 0: 1726.7, 1: 1762.1. Samples: 39077556. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:16,211][50642] Avg episode reward: [(0, '20.120'), (1, '15.900')] [2023-10-08 03:00:16,390][52059] Updated weights for policy 1, policy_version 76772 (0.0008) [2023-10-08 03:00:16,556][52060] Updated weights for policy 0, policy_version 75830 (0.0010) [2023-10-08 03:00:16,782][52059] Updated weights for policy 1, policy_version 76782 (0.0007) [2023-10-08 03:00:16,925][52060] Updated weights for policy 0, policy_version 75840 (0.0009) [2023-10-08 03:00:17,152][52059] Updated weights for policy 1, policy_version 76792 (0.0008) [2023-10-08 03:00:20,948][52060] Updated weights for policy 0, policy_version 75850 (0.0008) [2023-10-08 03:00:20,986][52059] Updated weights for policy 1, policy_version 76802 (0.0009) [2023-10-08 03:00:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 156303360. Throughput: 0: 1730.3, 1: 1727.7. Samples: 39086862. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:21,211][50642] Avg episode reward: [(0, '24.010'), (1, '16.150')] [2023-10-08 03:00:21,308][52060] Updated weights for policy 0, policy_version 75860 (0.0007) [2023-10-08 03:00:21,353][52059] Updated weights for policy 1, policy_version 76812 (0.0007) [2023-10-08 03:00:21,675][52060] Updated weights for policy 0, policy_version 75870 (0.0008) [2023-10-08 03:00:21,726][52059] Updated weights for policy 1, policy_version 76822 (0.0007) [2023-10-08 03:00:22,089][52059] Updated weights for policy 1, policy_version 76832 (0.0007) [2023-10-08 03:00:25,643][52060] Updated weights for policy 0, policy_version 75880 (0.0008) [2023-10-08 03:00:26,008][52060] Updated weights for policy 0, policy_version 75890 (0.0008) [2023-10-08 03:00:26,025][52059] Updated weights for policy 1, policy_version 76842 (0.0010) [2023-10-08 03:00:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 156368896. Throughput: 0: 1728.5, 1: 1754.2. Samples: 39108364. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:26,211][50642] Avg episode reward: [(0, '20.190'), (1, '15.890')] [2023-10-08 03:00:26,381][52060] Updated weights for policy 0, policy_version 75900 (0.0007) [2023-10-08 03:00:26,388][52059] Updated weights for policy 1, policy_version 76852 (0.0008) [2023-10-08 03:00:26,758][52059] Updated weights for policy 1, policy_version 76862 (0.0008) [2023-10-08 03:00:30,208][52060] Updated weights for policy 0, policy_version 75910 (0.0008) [2023-10-08 03:00:30,577][52060] Updated weights for policy 0, policy_version 75920 (0.0007) [2023-10-08 03:00:30,856][52059] Updated weights for policy 1, policy_version 76872 (0.0009) [2023-10-08 03:00:30,951][52060] Updated weights for policy 0, policy_version 75930 (0.0007) [2023-10-08 03:00:31,207][52059] Updated weights for policy 1, policy_version 76882 (0.0008) [2023-10-08 03:00:31,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 156467200. Throughput: 0: 1715.9, 1: 1739.0. Samples: 39128372. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:31,211][50642] Avg episode reward: [(0, '18.470'), (1, '15.770')] [2023-10-08 03:00:31,578][52059] Updated weights for policy 1, policy_version 76892 (0.0010) [2023-10-08 03:00:35,018][52060] Updated weights for policy 0, policy_version 75940 (0.0008) [2023-10-08 03:00:35,381][52060] Updated weights for policy 0, policy_version 75950 (0.0009) [2023-10-08 03:00:35,564][52059] Updated weights for policy 1, policy_version 76902 (0.0009) [2023-10-08 03:00:35,741][52060] Updated weights for policy 0, policy_version 75960 (0.0009) [2023-10-08 03:00:35,923][52059] Updated weights for policy 1, policy_version 76912 (0.0009) [2023-10-08 03:00:36,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 156532736. Throughput: 0: 1736.5, 1: 1738.2. Samples: 39139056. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:36,211][50642] Avg episode reward: [(0, '20.020'), (1, '17.070')] [2023-10-08 03:00:36,292][52059] Updated weights for policy 1, policy_version 76922 (0.0007) [2023-10-08 03:00:39,701][52060] Updated weights for policy 0, policy_version 75970 (0.0009) [2023-10-08 03:00:40,068][52060] Updated weights for policy 0, policy_version 75980 (0.0009) [2023-10-08 03:00:40,173][52059] Updated weights for policy 1, policy_version 76932 (0.0008) [2023-10-08 03:00:40,442][52060] Updated weights for policy 0, policy_version 75990 (0.0008) [2023-10-08 03:00:40,533][52059] Updated weights for policy 1, policy_version 76942 (0.0008) [2023-10-08 03:00:40,806][52060] Updated weights for policy 0, policy_version 76000 (0.0009) [2023-10-08 03:00:40,893][52059] Updated weights for policy 1, policy_version 76952 (0.0007) [2023-10-08 03:00:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 156631040. Throughput: 0: 1723.7, 1: 1741.8. Samples: 39159972. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:41,211][50642] Avg episode reward: [(0, '22.740'), (1, '15.480')] [2023-10-08 03:00:44,819][52060] Updated weights for policy 0, policy_version 76010 (0.0008) [2023-10-08 03:00:44,919][52059] Updated weights for policy 1, policy_version 76962 (0.0008) [2023-10-08 03:00:45,170][52060] Updated weights for policy 0, policy_version 76020 (0.0009) [2023-10-08 03:00:45,278][52059] Updated weights for policy 1, policy_version 76972 (0.0009) [2023-10-08 03:00:45,546][52060] Updated weights for policy 0, policy_version 76030 (0.0008) [2023-10-08 03:00:45,649][52059] Updated weights for policy 1, policy_version 76982 (0.0008) [2023-10-08 03:00:46,011][52059] Updated weights for policy 1, policy_version 76992 (0.0009) [2023-10-08 03:00:46,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 156696576. Throughput: 0: 1697.2, 1: 1713.3. Samples: 39178984. Policy #0 lag: (min: 31.0, avg: 33.8, max: 63.0) [2023-10-08 03:00:46,211][50642] Avg episode reward: [(0, '20.200'), (1, '16.830')] [2023-10-08 03:00:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000076992_78839808.pth... [2023-10-08 03:00:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000076032_77856768.pth... [2023-10-08 03:00:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000075360_77168640.pth [2023-10-08 03:00:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000074432_76218368.pth [2023-10-08 03:00:49,403][52060] Updated weights for policy 0, policy_version 76040 (0.0009) [2023-10-08 03:00:49,775][52060] Updated weights for policy 0, policy_version 76050 (0.0007) [2023-10-08 03:00:50,019][52059] Updated weights for policy 1, policy_version 77002 (0.0008) [2023-10-08 03:00:50,153][52060] Updated weights for policy 0, policy_version 76060 (0.0008) [2023-10-08 03:00:50,382][52059] Updated weights for policy 1, policy_version 77012 (0.0008) [2023-10-08 03:00:50,746][52059] Updated weights for policy 1, policy_version 77022 (0.0007) [2023-10-08 03:00:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 156762112. Throughput: 0: 1727.8, 1: 1735.3. Samples: 39190828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:00:51,211][50642] Avg episode reward: [(0, '18.760'), (1, '17.150')] [2023-10-08 03:00:54,301][52060] Updated weights for policy 0, policy_version 76070 (0.0008) [2023-10-08 03:00:54,686][52060] Updated weights for policy 0, policy_version 76080 (0.0008) [2023-10-08 03:00:54,962][52059] Updated weights for policy 1, policy_version 77032 (0.0008) [2023-10-08 03:00:55,055][52060] Updated weights for policy 0, policy_version 76090 (0.0007) [2023-10-08 03:00:55,329][52059] Updated weights for policy 1, policy_version 77042 (0.0009) [2023-10-08 03:00:55,685][52059] Updated weights for policy 1, policy_version 77052 (0.0011) [2023-10-08 03:00:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 156827648. Throughput: 0: 1706.8, 1: 1718.9. Samples: 39210480. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:00:56,212][50642] Avg episode reward: [(0, '20.670'), (1, '14.910')] [2023-10-08 03:00:58,812][52060] Updated weights for policy 0, policy_version 76100 (0.0007) [2023-10-08 03:00:59,188][52060] Updated weights for policy 0, policy_version 76110 (0.0008) [2023-10-08 03:00:59,553][52059] Updated weights for policy 1, policy_version 77062 (0.0009) [2023-10-08 03:00:59,556][52060] Updated weights for policy 0, policy_version 76120 (0.0008) [2023-10-08 03:00:59,924][52059] Updated weights for policy 1, policy_version 77072 (0.0010) [2023-10-08 03:01:00,283][52059] Updated weights for policy 1, policy_version 77082 (0.0011) [2023-10-08 03:01:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 156893184. Throughput: 0: 1699.1, 1: 1696.8. Samples: 39230370. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:01,211][50642] Avg episode reward: [(0, '21.500'), (1, '17.170')] [2023-10-08 03:01:03,386][52060] Updated weights for policy 0, policy_version 76130 (0.0008) [2023-10-08 03:01:03,753][52060] Updated weights for policy 0, policy_version 76140 (0.0010) [2023-10-08 03:01:04,113][52060] Updated weights for policy 0, policy_version 76150 (0.0009) [2023-10-08 03:01:04,232][52059] Updated weights for policy 1, policy_version 77092 (0.0009) [2023-10-08 03:01:04,484][52060] Updated weights for policy 0, policy_version 76160 (0.0007) [2023-10-08 03:01:04,641][52059] Updated weights for policy 1, policy_version 77102 (0.0008) [2023-10-08 03:01:05,010][52059] Updated weights for policy 1, policy_version 77112 (0.0008) [2023-10-08 03:01:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 156958720. Throughput: 0: 1716.8, 1: 1729.5. Samples: 39241946. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:06,211][50642] Avg episode reward: [(0, '20.080'), (1, '16.420')] [2023-10-08 03:01:08,451][52060] Updated weights for policy 0, policy_version 76170 (0.0011) [2023-10-08 03:01:08,821][52060] Updated weights for policy 0, policy_version 76180 (0.0010) [2023-10-08 03:01:08,880][52059] Updated weights for policy 1, policy_version 77122 (0.0007) [2023-10-08 03:01:09,185][52060] Updated weights for policy 0, policy_version 76190 (0.0009) [2023-10-08 03:01:09,238][52059] Updated weights for policy 1, policy_version 77132 (0.0007) [2023-10-08 03:01:09,600][52059] Updated weights for policy 1, policy_version 77142 (0.0007) [2023-10-08 03:01:09,967][52059] Updated weights for policy 1, policy_version 77152 (0.0008) [2023-10-08 03:01:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 157024256. Throughput: 0: 1700.2, 1: 1698.5. Samples: 39261306. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:11,211][50642] Avg episode reward: [(0, '20.890'), (1, '16.480')] [2023-10-08 03:01:13,261][52060] Updated weights for policy 0, policy_version 76200 (0.0008) [2023-10-08 03:01:13,639][52060] Updated weights for policy 0, policy_version 76210 (0.0008) [2023-10-08 03:01:13,852][52059] Updated weights for policy 1, policy_version 77162 (0.0008) [2023-10-08 03:01:13,997][52060] Updated weights for policy 0, policy_version 76220 (0.0009) [2023-10-08 03:01:14,225][52059] Updated weights for policy 1, policy_version 77172 (0.0008) [2023-10-08 03:01:14,597][52059] Updated weights for policy 1, policy_version 77182 (0.0007) [2023-10-08 03:01:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 157089792. Throughput: 0: 1715.8, 1: 1707.2. Samples: 39282408. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:16,211][50642] Avg episode reward: [(0, '20.920'), (1, '17.700')] [2023-10-08 03:01:17,986][52060] Updated weights for policy 0, policy_version 76230 (0.0009) [2023-10-08 03:01:18,351][52060] Updated weights for policy 0, policy_version 76240 (0.0009) [2023-10-08 03:01:18,568][52059] Updated weights for policy 1, policy_version 77192 (0.0009) [2023-10-08 03:01:18,728][52060] Updated weights for policy 0, policy_version 76250 (0.0009) [2023-10-08 03:01:18,930][52059] Updated weights for policy 1, policy_version 77202 (0.0009) [2023-10-08 03:01:19,286][52059] Updated weights for policy 1, policy_version 77212 (0.0008) [2023-10-08 03:01:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 157155328. Throughput: 0: 1701.3, 1: 1712.4. Samples: 39292674. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:21,211][50642] Avg episode reward: [(0, '21.010'), (1, '19.380')] [2023-10-08 03:01:22,757][52060] Updated weights for policy 0, policy_version 76260 (0.0008) [2023-10-08 03:01:23,033][52059] Updated weights for policy 1, policy_version 77222 (0.0009) [2023-10-08 03:01:23,131][52060] Updated weights for policy 0, policy_version 76270 (0.0008) [2023-10-08 03:01:23,395][52059] Updated weights for policy 1, policy_version 77232 (0.0009) [2023-10-08 03:01:23,494][52060] Updated weights for policy 0, policy_version 76280 (0.0008) [2023-10-08 03:01:23,758][52059] Updated weights for policy 1, policy_version 77242 (0.0008) [2023-10-08 03:01:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 157220864. Throughput: 0: 1703.6, 1: 1700.6. Samples: 39313160. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:26,211][50642] Avg episode reward: [(0, '19.080'), (1, '20.160')] [2023-10-08 03:01:27,508][52060] Updated weights for policy 0, policy_version 76290 (0.0008) [2023-10-08 03:01:27,775][52059] Updated weights for policy 1, policy_version 77252 (0.0009) [2023-10-08 03:01:27,875][52060] Updated weights for policy 0, policy_version 76300 (0.0008) [2023-10-08 03:01:28,142][52059] Updated weights for policy 1, policy_version 77262 (0.0007) [2023-10-08 03:01:28,244][52060] Updated weights for policy 0, policy_version 76310 (0.0009) [2023-10-08 03:01:28,506][52059] Updated weights for policy 1, policy_version 77272 (0.0007) [2023-10-08 03:01:28,598][52060] Updated weights for policy 0, policy_version 76320 (0.0009) [2023-10-08 03:01:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 157286400. Throughput: 0: 1731.9, 1: 1720.0. Samples: 39334318. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:31,211][50642] Avg episode reward: [(0, '20.560'), (1, '21.530')] [2023-10-08 03:01:32,478][52059] Updated weights for policy 1, policy_version 77282 (0.0007) [2023-10-08 03:01:32,584][52060] Updated weights for policy 0, policy_version 76330 (0.0008) [2023-10-08 03:01:32,842][52059] Updated weights for policy 1, policy_version 77292 (0.0007) [2023-10-08 03:01:32,952][52060] Updated weights for policy 0, policy_version 76340 (0.0009) [2023-10-08 03:01:33,211][52059] Updated weights for policy 1, policy_version 77302 (0.0007) [2023-10-08 03:01:33,321][52060] Updated weights for policy 0, policy_version 76350 (0.0008) [2023-10-08 03:01:33,575][52059] Updated weights for policy 1, policy_version 77312 (0.0007) [2023-10-08 03:01:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 157351936. Throughput: 0: 1696.3, 1: 1697.1. Samples: 39343532. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:01:36,211][50642] Avg episode reward: [(0, '22.850'), (1, '19.180')] [2023-10-08 03:01:37,167][52060] Updated weights for policy 0, policy_version 76360 (0.0008) [2023-10-08 03:01:37,423][52059] Updated weights for policy 1, policy_version 77322 (0.0008) [2023-10-08 03:01:37,528][52060] Updated weights for policy 0, policy_version 76370 (0.0007) [2023-10-08 03:01:37,793][52059] Updated weights for policy 1, policy_version 77332 (0.0009) [2023-10-08 03:01:37,899][52060] Updated weights for policy 0, policy_version 76380 (0.0008) [2023-10-08 03:01:38,153][52059] Updated weights for policy 1, policy_version 77342 (0.0008) [2023-10-08 03:01:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 157417472. Throughput: 0: 1722.6, 1: 1715.5. Samples: 39365192. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:01:41,211][50642] Avg episode reward: [(0, '21.200'), (1, '19.260')] [2023-10-08 03:01:41,831][52060] Updated weights for policy 0, policy_version 76390 (0.0009) [2023-10-08 03:01:42,092][52059] Updated weights for policy 1, policy_version 77352 (0.0007) [2023-10-08 03:01:42,219][52060] Updated weights for policy 0, policy_version 76400 (0.0008) [2023-10-08 03:01:42,449][52059] Updated weights for policy 1, policy_version 77362 (0.0008) [2023-10-08 03:01:42,585][52060] Updated weights for policy 0, policy_version 76410 (0.0008) [2023-10-08 03:01:42,804][52059] Updated weights for policy 1, policy_version 77372 (0.0007) [2023-10-08 03:01:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 157483008. Throughput: 0: 1729.2, 1: 1737.0. Samples: 39386350. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:01:46,211][50642] Avg episode reward: [(0, '19.850'), (1, '19.830')] [2023-10-08 03:01:46,509][52060] Updated weights for policy 0, policy_version 76420 (0.0008) [2023-10-08 03:01:46,836][52059] Updated weights for policy 1, policy_version 77382 (0.0009) [2023-10-08 03:01:46,881][52060] Updated weights for policy 0, policy_version 76430 (0.0008) [2023-10-08 03:01:47,191][52059] Updated weights for policy 1, policy_version 77392 (0.0007) [2023-10-08 03:01:47,250][52060] Updated weights for policy 0, policy_version 76440 (0.0008) [2023-10-08 03:01:47,554][52059] Updated weights for policy 1, policy_version 77402 (0.0007) [2023-10-08 03:01:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 157548544. Throughput: 0: 1710.7, 1: 1706.4. Samples: 39395716. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:01:51,211][50642] Avg episode reward: [(0, '20.870'), (1, '19.340')] [2023-10-08 03:01:51,220][52060] Updated weights for policy 0, policy_version 76450 (0.0008) [2023-10-08 03:01:51,528][52059] Updated weights for policy 1, policy_version 77412 (0.0008) [2023-10-08 03:01:51,591][52060] Updated weights for policy 0, policy_version 76460 (0.0008) [2023-10-08 03:01:51,898][52059] Updated weights for policy 1, policy_version 77422 (0.0007) [2023-10-08 03:01:51,949][52060] Updated weights for policy 0, policy_version 76470 (0.0008) [2023-10-08 03:01:52,268][52059] Updated weights for policy 1, policy_version 77432 (0.0009) [2023-10-08 03:01:52,314][52060] Updated weights for policy 0, policy_version 76480 (0.0008) [2023-10-08 03:01:56,073][52059] Updated weights for policy 1, policy_version 77442 (0.0007) [2023-10-08 03:01:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 157614080. Throughput: 0: 1721.2, 1: 1740.7. Samples: 39417092. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:01:56,211][50642] Avg episode reward: [(0, '21.670'), (1, '20.290')] [2023-10-08 03:01:56,367][52060] Updated weights for policy 0, policy_version 76490 (0.0007) [2023-10-08 03:01:56,432][52059] Updated weights for policy 1, policy_version 77452 (0.0007) [2023-10-08 03:01:56,739][52060] Updated weights for policy 0, policy_version 76500 (0.0008) [2023-10-08 03:01:56,794][52059] Updated weights for policy 1, policy_version 77462 (0.0007) [2023-10-08 03:01:57,103][52060] Updated weights for policy 0, policy_version 76510 (0.0008) [2023-10-08 03:01:57,163][52059] Updated weights for policy 1, policy_version 77472 (0.0007) [2023-10-08 03:02:01,197][52060] Updated weights for policy 0, policy_version 76520 (0.0009) [2023-10-08 03:02:01,203][52059] Updated weights for policy 1, policy_version 77482 (0.0009) [2023-10-08 03:02:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 157679616. Throughput: 0: 1718.6, 1: 1744.0. Samples: 39438224. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:02:01,211][50642] Avg episode reward: [(0, '21.550'), (1, '21.040')] [2023-10-08 03:02:01,568][52060] Updated weights for policy 0, policy_version 76530 (0.0010) [2023-10-08 03:02:01,569][52059] Updated weights for policy 1, policy_version 77492 (0.0009) [2023-10-08 03:02:01,929][52059] Updated weights for policy 1, policy_version 77502 (0.0007) [2023-10-08 03:02:01,938][52060] Updated weights for policy 0, policy_version 76540 (0.0009) [2023-10-08 03:02:05,782][52059] Updated weights for policy 1, policy_version 77512 (0.0008) [2023-10-08 03:02:05,870][52060] Updated weights for policy 0, policy_version 76550 (0.0008) [2023-10-08 03:02:06,152][52059] Updated weights for policy 1, policy_version 77522 (0.0007) [2023-10-08 03:02:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 157745152. Throughput: 0: 1715.7, 1: 1729.6. Samples: 39447712. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:02:06,211][50642] Avg episode reward: [(0, '19.150'), (1, '23.400')] [2023-10-08 03:02:06,239][52060] Updated weights for policy 0, policy_version 76560 (0.0009) [2023-10-08 03:02:06,520][52059] Updated weights for policy 1, policy_version 77532 (0.0007) [2023-10-08 03:02:06,614][52060] Updated weights for policy 0, policy_version 76570 (0.0007) [2023-10-08 03:02:10,461][52059] Updated weights for policy 1, policy_version 77542 (0.0009) [2023-10-08 03:02:10,585][52060] Updated weights for policy 0, policy_version 76580 (0.0007) [2023-10-08 03:02:10,830][52059] Updated weights for policy 1, policy_version 77552 (0.0008) [2023-10-08 03:02:10,946][52060] Updated weights for policy 0, policy_version 76590 (0.0007) [2023-10-08 03:02:11,188][52059] Updated weights for policy 1, policy_version 77562 (0.0010) [2023-10-08 03:02:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 157810688. Throughput: 0: 1719.5, 1: 1745.0. Samples: 39469064. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:02:11,211][50642] Avg episode reward: [(0, '19.990'), (1, '22.650')] [2023-10-08 03:02:11,310][52060] Updated weights for policy 0, policy_version 76600 (0.0008) [2023-10-08 03:02:15,029][52059] Updated weights for policy 1, policy_version 77572 (0.0008) [2023-10-08 03:02:15,386][52059] Updated weights for policy 1, policy_version 77582 (0.0007) [2023-10-08 03:02:15,414][52060] Updated weights for policy 0, policy_version 76610 (0.0008) [2023-10-08 03:02:15,749][52059] Updated weights for policy 1, policy_version 77592 (0.0008) [2023-10-08 03:02:15,778][52060] Updated weights for policy 0, policy_version 76620 (0.0007) [2023-10-08 03:02:16,150][52060] Updated weights for policy 0, policy_version 76630 (0.0009) [2023-10-08 03:02:16,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 157908992. Throughput: 0: 1702.6, 1: 1733.7. Samples: 39488952. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:02:16,211][50642] Avg episode reward: [(0, '20.410'), (1, '22.990')] [2023-10-08 03:02:16,503][52060] Updated weights for policy 0, policy_version 76640 (0.0011) [2023-10-08 03:02:19,713][52059] Updated weights for policy 1, policy_version 77602 (0.0008) [2023-10-08 03:02:20,078][52059] Updated weights for policy 1, policy_version 77612 (0.0008) [2023-10-08 03:02:20,445][52059] Updated weights for policy 1, policy_version 77622 (0.0008) [2023-10-08 03:02:20,467][52060] Updated weights for policy 0, policy_version 76650 (0.0009) [2023-10-08 03:02:20,809][52059] Updated weights for policy 1, policy_version 77632 (0.0009) [2023-10-08 03:02:20,839][52060] Updated weights for policy 0, policy_version 76660 (0.0008) [2023-10-08 03:02:21,209][52060] Updated weights for policy 0, policy_version 76670 (0.0011) [2023-10-08 03:02:21,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 157974528. Throughput: 0: 1718.6, 1: 1755.3. Samples: 39499860. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:02:21,211][50642] Avg episode reward: [(0, '22.460'), (1, '23.490')] [2023-10-08 03:02:24,522][52059] Updated weights for policy 1, policy_version 77642 (0.0008) [2023-10-08 03:02:24,880][52059] Updated weights for policy 1, policy_version 77652 (0.0007) [2023-10-08 03:02:25,149][52060] Updated weights for policy 0, policy_version 76680 (0.0007) [2023-10-08 03:02:25,250][52059] Updated weights for policy 1, policy_version 77662 (0.0008) [2023-10-08 03:02:25,508][52060] Updated weights for policy 0, policy_version 76690 (0.0010) [2023-10-08 03:02:25,887][52060] Updated weights for policy 0, policy_version 76700 (0.0008) [2023-10-08 03:02:26,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 158072832. Throughput: 0: 1714.0, 1: 1737.5. Samples: 39520506. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:26,211][50642] Avg episode reward: [(0, '20.190'), (1, '26.110')] [2023-10-08 03:02:29,236][52059] Updated weights for policy 1, policy_version 77672 (0.0009) [2023-10-08 03:02:29,606][52059] Updated weights for policy 1, policy_version 77682 (0.0007) [2023-10-08 03:02:29,885][52060] Updated weights for policy 0, policy_version 76710 (0.0008) [2023-10-08 03:02:29,960][52059] Updated weights for policy 1, policy_version 77692 (0.0007) [2023-10-08 03:02:30,253][52060] Updated weights for policy 0, policy_version 76720 (0.0008) [2023-10-08 03:02:30,625][52060] Updated weights for policy 0, policy_version 76730 (0.0008) [2023-10-08 03:02:31,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 158138368. Throughput: 0: 1686.4, 1: 1729.5. Samples: 39540066. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:31,211][50642] Avg episode reward: [(0, '20.250'), (1, '20.890')] [2023-10-08 03:02:34,007][52059] Updated weights for policy 1, policy_version 77702 (0.0008) [2023-10-08 03:02:34,370][52059] Updated weights for policy 1, policy_version 77712 (0.0009) [2023-10-08 03:02:34,573][52060] Updated weights for policy 0, policy_version 76740 (0.0008) [2023-10-08 03:02:34,725][52059] Updated weights for policy 1, policy_version 77722 (0.0008) [2023-10-08 03:02:34,929][52060] Updated weights for policy 0, policy_version 76750 (0.0008) [2023-10-08 03:02:35,291][52060] Updated weights for policy 0, policy_version 76760 (0.0007) [2023-10-08 03:02:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 158203904. Throughput: 0: 1714.0, 1: 1756.1. Samples: 39551872. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:36,211][50642] Avg episode reward: [(0, '19.960'), (1, '21.920')] [2023-10-08 03:02:38,920][52059] Updated weights for policy 1, policy_version 77732 (0.0008) [2023-10-08 03:02:39,167][52060] Updated weights for policy 0, policy_version 76770 (0.0008) [2023-10-08 03:02:39,306][52059] Updated weights for policy 1, policy_version 77742 (0.0008) [2023-10-08 03:02:39,527][52060] Updated weights for policy 0, policy_version 76780 (0.0008) [2023-10-08 03:02:39,675][52059] Updated weights for policy 1, policy_version 77752 (0.0007) [2023-10-08 03:02:39,892][52060] Updated weights for policy 0, policy_version 76790 (0.0008) [2023-10-08 03:02:40,257][52060] Updated weights for policy 0, policy_version 76800 (0.0009) [2023-10-08 03:02:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 158269440. Throughput: 0: 1699.6, 1: 1724.6. Samples: 39571182. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:41,211][50642] Avg episode reward: [(0, '21.960'), (1, '20.560')] [2023-10-08 03:02:43,368][52059] Updated weights for policy 1, policy_version 77762 (0.0007) [2023-10-08 03:02:43,729][52059] Updated weights for policy 1, policy_version 77772 (0.0007) [2023-10-08 03:02:44,099][52059] Updated weights for policy 1, policy_version 77782 (0.0008) [2023-10-08 03:02:44,321][52060] Updated weights for policy 0, policy_version 76810 (0.0007) [2023-10-08 03:02:44,451][52059] Updated weights for policy 1, policy_version 77792 (0.0008) [2023-10-08 03:02:44,684][52060] Updated weights for policy 0, policy_version 76820 (0.0007) [2023-10-08 03:02:45,051][52060] Updated weights for policy 0, policy_version 76830 (0.0009) [2023-10-08 03:02:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 158334976. Throughput: 0: 1691.4, 1: 1724.6. Samples: 39591942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:46,211][50642] Avg episode reward: [(0, '21.060'), (1, '23.920')] [2023-10-08 03:02:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000076832_78675968.pth... [2023-10-08 03:02:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000077792_79659008.pth... [2023-10-08 03:02:46,250][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000076160_77987840.pth [2023-10-08 03:02:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000075232_77037568.pth [2023-10-08 03:02:48,387][52059] Updated weights for policy 1, policy_version 77802 (0.0008) [2023-10-08 03:02:48,748][52059] Updated weights for policy 1, policy_version 77812 (0.0007) [2023-10-08 03:02:49,113][52059] Updated weights for policy 1, policy_version 77822 (0.0007) [2023-10-08 03:02:49,117][52060] Updated weights for policy 0, policy_version 76840 (0.0008) [2023-10-08 03:02:49,487][52060] Updated weights for policy 0, policy_version 76850 (0.0007) [2023-10-08 03:02:49,854][52060] Updated weights for policy 0, policy_version 76860 (0.0007) [2023-10-08 03:02:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 158400512. Throughput: 0: 1716.2, 1: 1732.3. Samples: 39602896. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:51,211][50642] Avg episode reward: [(0, '22.580'), (1, '22.700')] [2023-10-08 03:02:52,987][52059] Updated weights for policy 1, policy_version 77832 (0.0007) [2023-10-08 03:02:53,343][52059] Updated weights for policy 1, policy_version 77842 (0.0011) [2023-10-08 03:02:53,703][52059] Updated weights for policy 1, policy_version 77852 (0.0009) [2023-10-08 03:02:53,884][52060] Updated weights for policy 0, policy_version 76870 (0.0009) [2023-10-08 03:02:54,259][52060] Updated weights for policy 0, policy_version 76880 (0.0008) [2023-10-08 03:02:54,628][52060] Updated weights for policy 0, policy_version 76890 (0.0009) [2023-10-08 03:02:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 158466048. Throughput: 0: 1689.1, 1: 1723.8. Samples: 39622644. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:02:56,211][50642] Avg episode reward: [(0, '20.920'), (1, '20.930')] [2023-10-08 03:02:57,671][52059] Updated weights for policy 1, policy_version 77862 (0.0008) [2023-10-08 03:02:58,038][52059] Updated weights for policy 1, policy_version 77872 (0.0010) [2023-10-08 03:02:58,398][52059] Updated weights for policy 1, policy_version 77882 (0.0009) [2023-10-08 03:02:58,404][52060] Updated weights for policy 0, policy_version 76900 (0.0008) [2023-10-08 03:02:58,770][52060] Updated weights for policy 0, policy_version 76910 (0.0007) [2023-10-08 03:02:59,129][52060] Updated weights for policy 0, policy_version 76920 (0.0010) [2023-10-08 03:03:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 158531584. Throughput: 0: 1708.8, 1: 1744.4. Samples: 39644350. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:01,211][50642] Avg episode reward: [(0, '18.960'), (1, '21.720')] [2023-10-08 03:03:02,338][52059] Updated weights for policy 1, policy_version 77892 (0.0008) [2023-10-08 03:03:02,695][52059] Updated weights for policy 1, policy_version 77902 (0.0008) [2023-10-08 03:03:03,060][52059] Updated weights for policy 1, policy_version 77912 (0.0009) [2023-10-08 03:03:03,118][52060] Updated weights for policy 0, policy_version 76930 (0.0008) [2023-10-08 03:03:03,493][52060] Updated weights for policy 0, policy_version 76940 (0.0010) [2023-10-08 03:03:03,861][52060] Updated weights for policy 0, policy_version 76950 (0.0008) [2023-10-08 03:03:04,223][52060] Updated weights for policy 0, policy_version 76960 (0.0009) [2023-10-08 03:03:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 158597120. Throughput: 0: 1704.2, 1: 1724.3. Samples: 39654140. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:06,211][50642] Avg episode reward: [(0, '19.030'), (1, '24.990')] [2023-10-08 03:03:06,908][52059] Updated weights for policy 1, policy_version 77922 (0.0008) [2023-10-08 03:03:07,270][52059] Updated weights for policy 1, policy_version 77932 (0.0007) [2023-10-08 03:03:07,637][52059] Updated weights for policy 1, policy_version 77942 (0.0007) [2023-10-08 03:03:07,996][52059] Updated weights for policy 1, policy_version 77952 (0.0010) [2023-10-08 03:03:08,207][52060] Updated weights for policy 0, policy_version 76970 (0.0008) [2023-10-08 03:03:08,577][52060] Updated weights for policy 0, policy_version 76980 (0.0009) [2023-10-08 03:03:08,944][52060] Updated weights for policy 0, policy_version 76990 (0.0009) [2023-10-08 03:03:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 158662656. Throughput: 0: 1694.7, 1: 1743.9. Samples: 39675240. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:11,211][50642] Avg episode reward: [(0, '22.300'), (1, '25.420')] [2023-10-08 03:03:11,913][52059] Updated weights for policy 1, policy_version 77962 (0.0010) [2023-10-08 03:03:12,272][52059] Updated weights for policy 1, policy_version 77972 (0.0009) [2023-10-08 03:03:12,627][52059] Updated weights for policy 1, policy_version 77982 (0.0007) [2023-10-08 03:03:12,993][52060] Updated weights for policy 0, policy_version 77000 (0.0010) [2023-10-08 03:03:13,361][52060] Updated weights for policy 0, policy_version 77010 (0.0008) [2023-10-08 03:03:13,725][52060] Updated weights for policy 0, policy_version 77020 (0.0010) [2023-10-08 03:03:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 158728192. Throughput: 0: 1722.8, 1: 1757.5. Samples: 39696678. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:16,211][50642] Avg episode reward: [(0, '20.590'), (1, '22.010')] [2023-10-08 03:03:16,463][52059] Updated weights for policy 1, policy_version 77992 (0.0008) [2023-10-08 03:03:16,821][52059] Updated weights for policy 1, policy_version 78002 (0.0007) [2023-10-08 03:03:17,184][52059] Updated weights for policy 1, policy_version 78012 (0.0011) [2023-10-08 03:03:17,610][52060] Updated weights for policy 0, policy_version 77030 (0.0008) [2023-10-08 03:03:17,992][52060] Updated weights for policy 0, policy_version 77040 (0.0009) [2023-10-08 03:03:18,361][52060] Updated weights for policy 0, policy_version 77050 (0.0009) [2023-10-08 03:03:21,138][52059] Updated weights for policy 1, policy_version 78022 (0.0008) [2023-10-08 03:03:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 158793728. Throughput: 0: 1694.3, 1: 1731.1. Samples: 39706016. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:21,211][50642] Avg episode reward: [(0, '21.020'), (1, '23.170')] [2023-10-08 03:03:21,506][52059] Updated weights for policy 1, policy_version 78032 (0.0007) [2023-10-08 03:03:21,880][52059] Updated weights for policy 1, policy_version 78042 (0.0008) [2023-10-08 03:03:22,337][52060] Updated weights for policy 0, policy_version 77060 (0.0010) [2023-10-08 03:03:22,717][52060] Updated weights for policy 0, policy_version 77070 (0.0010) [2023-10-08 03:03:23,073][52060] Updated weights for policy 0, policy_version 77080 (0.0008) [2023-10-08 03:03:25,774][52059] Updated weights for policy 1, policy_version 78052 (0.0009) [2023-10-08 03:03:26,160][52059] Updated weights for policy 1, policy_version 78062 (0.0007) [2023-10-08 03:03:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 158859264. Throughput: 0: 1712.6, 1: 1763.8. Samples: 39727620. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:26,211][50642] Avg episode reward: [(0, '20.060'), (1, '23.580')] [2023-10-08 03:03:26,527][52059] Updated weights for policy 1, policy_version 78072 (0.0007) [2023-10-08 03:03:26,852][52060] Updated weights for policy 0, policy_version 77090 (0.0007) [2023-10-08 03:03:27,218][52060] Updated weights for policy 0, policy_version 77100 (0.0007) [2023-10-08 03:03:27,585][52060] Updated weights for policy 0, policy_version 77110 (0.0007) [2023-10-08 03:03:27,945][52060] Updated weights for policy 0, policy_version 77120 (0.0007) [2023-10-08 03:03:30,374][52059] Updated weights for policy 1, policy_version 78082 (0.0007) [2023-10-08 03:03:30,729][52059] Updated weights for policy 1, policy_version 78092 (0.0009) [2023-10-08 03:03:31,097][52059] Updated weights for policy 1, policy_version 78102 (0.0009) [2023-10-08 03:03:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 158924800. Throughput: 0: 1733.6, 1: 1748.4. Samples: 39748628. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:31,211][50642] Avg episode reward: [(0, '20.590'), (1, '23.400')] [2023-10-08 03:03:31,467][52059] Updated weights for policy 1, policy_version 78112 (0.0008) [2023-10-08 03:03:31,868][52060] Updated weights for policy 0, policy_version 77130 (0.0008) [2023-10-08 03:03:32,241][52060] Updated weights for policy 0, policy_version 77140 (0.0007) [2023-10-08 03:03:32,601][52060] Updated weights for policy 0, policy_version 77150 (0.0011) [2023-10-08 03:03:35,395][52059] Updated weights for policy 1, policy_version 78122 (0.0007) [2023-10-08 03:03:35,753][52059] Updated weights for policy 1, policy_version 78132 (0.0008) [2023-10-08 03:03:36,122][52059] Updated weights for policy 1, policy_version 78142 (0.0011) [2023-10-08 03:03:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 159023104. Throughput: 0: 1707.4, 1: 1755.7. Samples: 39758734. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:36,211][50642] Avg episode reward: [(0, '22.020'), (1, '23.610')] [2023-10-08 03:03:36,567][52060] Updated weights for policy 0, policy_version 77160 (0.0008) [2023-10-08 03:03:36,944][52060] Updated weights for policy 0, policy_version 77170 (0.0009) [2023-10-08 03:03:37,313][52060] Updated weights for policy 0, policy_version 77180 (0.0009) [2023-10-08 03:03:39,985][52059] Updated weights for policy 1, policy_version 78152 (0.0009) [2023-10-08 03:03:40,343][52059] Updated weights for policy 1, policy_version 78162 (0.0007) [2023-10-08 03:03:40,709][52059] Updated weights for policy 1, policy_version 78172 (0.0009) [2023-10-08 03:03:41,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 159088640. Throughput: 0: 1735.0, 1: 1760.7. Samples: 39779948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:41,211][50642] Avg episode reward: [(0, '22.520'), (1, '23.610')] [2023-10-08 03:03:41,332][52060] Updated weights for policy 0, policy_version 77190 (0.0007) [2023-10-08 03:03:41,692][52060] Updated weights for policy 0, policy_version 77200 (0.0007) [2023-10-08 03:03:42,064][52060] Updated weights for policy 0, policy_version 77210 (0.0012) [2023-10-08 03:03:44,476][52059] Updated weights for policy 1, policy_version 78182 (0.0009) [2023-10-08 03:03:44,839][52059] Updated weights for policy 1, policy_version 78192 (0.0007) [2023-10-08 03:03:45,205][52059] Updated weights for policy 1, policy_version 78202 (0.0007) [2023-10-08 03:03:46,081][52060] Updated weights for policy 0, policy_version 77220 (0.0008) [2023-10-08 03:03:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 159154176. Throughput: 0: 1727.9, 1: 1735.5. Samples: 39800202. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:46,211][50642] Avg episode reward: [(0, '20.210'), (1, '25.050')] [2023-10-08 03:03:46,448][52060] Updated weights for policy 0, policy_version 77230 (0.0007) [2023-10-08 03:03:46,817][52060] Updated weights for policy 0, policy_version 77240 (0.0009) [2023-10-08 03:03:48,986][52059] Updated weights for policy 1, policy_version 78212 (0.0008) [2023-10-08 03:03:49,358][52059] Updated weights for policy 1, policy_version 78222 (0.0007) [2023-10-08 03:03:49,717][52059] Updated weights for policy 1, policy_version 78232 (0.0008) [2023-10-08 03:03:51,027][52060] Updated weights for policy 0, policy_version 77250 (0.0007) [2023-10-08 03:03:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 159219712. Throughput: 0: 1717.6, 1: 1764.6. Samples: 39810836. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:51,211][50642] Avg episode reward: [(0, '19.430'), (1, '22.820')] [2023-10-08 03:03:51,404][52060] Updated weights for policy 0, policy_version 77260 (0.0008) [2023-10-08 03:03:51,765][52060] Updated weights for policy 0, policy_version 77270 (0.0010) [2023-10-08 03:03:52,131][52060] Updated weights for policy 0, policy_version 77280 (0.0009) [2023-10-08 03:03:53,564][52059] Updated weights for policy 1, policy_version 78242 (0.0009) [2023-10-08 03:03:53,928][52059] Updated weights for policy 1, policy_version 78252 (0.0008) [2023-10-08 03:03:54,290][52059] Updated weights for policy 1, policy_version 78262 (0.0010) [2023-10-08 03:03:54,651][52059] Updated weights for policy 1, policy_version 78272 (0.0008) [2023-10-08 03:03:55,964][52060] Updated weights for policy 0, policy_version 77290 (0.0009) [2023-10-08 03:03:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 159285248. Throughput: 0: 1733.6, 1: 1734.3. Samples: 39831294. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:03:56,211][50642] Avg episode reward: [(0, '19.010'), (1, '22.220')] [2023-10-08 03:03:56,324][52060] Updated weights for policy 0, policy_version 77300 (0.0007) [2023-10-08 03:03:56,693][52060] Updated weights for policy 0, policy_version 77310 (0.0008) [2023-10-08 03:03:58,597][52059] Updated weights for policy 1, policy_version 78282 (0.0009) [2023-10-08 03:03:58,958][52059] Updated weights for policy 1, policy_version 78292 (0.0008) [2023-10-08 03:03:59,327][52059] Updated weights for policy 1, policy_version 78302 (0.0007) [2023-10-08 03:04:00,525][52060] Updated weights for policy 0, policy_version 77320 (0.0007) [2023-10-08 03:04:00,884][52060] Updated weights for policy 0, policy_version 77330 (0.0009) [2023-10-08 03:04:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 159350784. Throughput: 0: 1722.5, 1: 1734.0. Samples: 39852222. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:04:01,211][50642] Avg episode reward: [(0, '19.830'), (1, '23.780')] [2023-10-08 03:04:01,253][52060] Updated weights for policy 0, policy_version 77340 (0.0008) [2023-10-08 03:04:03,237][52059] Updated weights for policy 1, policy_version 78312 (0.0010) [2023-10-08 03:04:03,606][52059] Updated weights for policy 1, policy_version 78322 (0.0011) [2023-10-08 03:04:03,972][52059] Updated weights for policy 1, policy_version 78332 (0.0009) [2023-10-08 03:04:05,303][52060] Updated weights for policy 0, policy_version 77350 (0.0009) [2023-10-08 03:04:05,677][52060] Updated weights for policy 0, policy_version 77360 (0.0009) [2023-10-08 03:04:06,050][52060] Updated weights for policy 0, policy_version 77370 (0.0008) [2023-10-08 03:04:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 159416320. Throughput: 0: 1739.6, 1: 1741.0. Samples: 39862644. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:04:06,211][50642] Avg episode reward: [(0, '19.070'), (1, '25.690')] [2023-10-08 03:04:08,008][52059] Updated weights for policy 1, policy_version 78342 (0.0008) [2023-10-08 03:04:08,370][52059] Updated weights for policy 1, policy_version 78352 (0.0008) [2023-10-08 03:04:08,738][52059] Updated weights for policy 1, policy_version 78362 (0.0010) [2023-10-08 03:04:09,856][52060] Updated weights for policy 0, policy_version 77380 (0.0008) [2023-10-08 03:04:10,227][52060] Updated weights for policy 0, policy_version 77390 (0.0009) [2023-10-08 03:04:10,601][52060] Updated weights for policy 0, policy_version 77400 (0.0008) [2023-10-08 03:04:11,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 159514624. Throughput: 0: 1739.6, 1: 1729.0. Samples: 39883706. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:11,211][50642] Avg episode reward: [(0, '19.380'), (1, '23.970')] [2023-10-08 03:04:12,566][52059] Updated weights for policy 1, policy_version 78372 (0.0007) [2023-10-08 03:04:12,961][52059] Updated weights for policy 1, policy_version 78382 (0.0007) [2023-10-08 03:04:13,318][52059] Updated weights for policy 1, policy_version 78392 (0.0008) [2023-10-08 03:04:14,469][52060] Updated weights for policy 0, policy_version 77410 (0.0008) [2023-10-08 03:04:14,825][52060] Updated weights for policy 0, policy_version 77420 (0.0007) [2023-10-08 03:04:15,195][52060] Updated weights for policy 0, policy_version 77430 (0.0007) [2023-10-08 03:04:15,561][52060] Updated weights for policy 0, policy_version 77440 (0.0008) [2023-10-08 03:04:16,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 159580160. Throughput: 0: 1707.0, 1: 1745.9. Samples: 39904008. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:16,211][50642] Avg episode reward: [(0, '19.790'), (1, '21.050')] [2023-10-08 03:04:17,298][52059] Updated weights for policy 1, policy_version 78402 (0.0007) [2023-10-08 03:04:17,652][52059] Updated weights for policy 1, policy_version 78412 (0.0009) [2023-10-08 03:04:18,011][52059] Updated weights for policy 1, policy_version 78422 (0.0009) [2023-10-08 03:04:18,375][52059] Updated weights for policy 1, policy_version 78432 (0.0008) [2023-10-08 03:04:19,564][52060] Updated weights for policy 0, policy_version 77450 (0.0009) [2023-10-08 03:04:19,931][52060] Updated weights for policy 0, policy_version 77460 (0.0008) [2023-10-08 03:04:20,296][52060] Updated weights for policy 0, policy_version 77470 (0.0009) [2023-10-08 03:04:21,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 159645696. Throughput: 0: 1736.4, 1: 1729.3. Samples: 39914692. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:21,211][50642] Avg episode reward: [(0, '19.460'), (1, '24.470')] [2023-10-08 03:04:22,402][52059] Updated weights for policy 1, policy_version 78442 (0.0009) [2023-10-08 03:04:22,772][52059] Updated weights for policy 1, policy_version 78452 (0.0009) [2023-10-08 03:04:23,132][52059] Updated weights for policy 1, policy_version 78462 (0.0010) [2023-10-08 03:04:24,193][52060] Updated weights for policy 0, policy_version 77480 (0.0007) [2023-10-08 03:04:24,566][52060] Updated weights for policy 0, policy_version 77490 (0.0008) [2023-10-08 03:04:24,930][52060] Updated weights for policy 0, policy_version 77500 (0.0009) [2023-10-08 03:04:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 159711232. Throughput: 0: 1716.4, 1: 1728.0. Samples: 39934946. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:26,211][50642] Avg episode reward: [(0, '21.840'), (1, '27.790')] [2023-10-08 03:04:26,966][52059] Updated weights for policy 1, policy_version 78472 (0.0008) [2023-10-08 03:04:27,322][52059] Updated weights for policy 1, policy_version 78482 (0.0010) [2023-10-08 03:04:27,691][52059] Updated weights for policy 1, policy_version 78492 (0.0010) [2023-10-08 03:04:28,901][52060] Updated weights for policy 0, policy_version 77510 (0.0009) [2023-10-08 03:04:29,265][52060] Updated weights for policy 0, policy_version 77520 (0.0009) [2023-10-08 03:04:29,635][52060] Updated weights for policy 0, policy_version 77530 (0.0007) [2023-10-08 03:04:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 159776768. Throughput: 0: 1712.1, 1: 1752.7. Samples: 39956116. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:31,211][50642] Avg episode reward: [(0, '20.270'), (1, '24.320')] [2023-10-08 03:04:31,699][52059] Updated weights for policy 1, policy_version 78502 (0.0010) [2023-10-08 03:04:32,062][52059] Updated weights for policy 1, policy_version 78512 (0.0009) [2023-10-08 03:04:32,435][52059] Updated weights for policy 1, policy_version 78522 (0.0009) [2023-10-08 03:04:33,729][52060] Updated weights for policy 0, policy_version 77540 (0.0007) [2023-10-08 03:04:34,094][52060] Updated weights for policy 0, policy_version 77550 (0.0008) [2023-10-08 03:04:34,461][52060] Updated weights for policy 0, policy_version 77560 (0.0007) [2023-10-08 03:04:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 159842304. Throughput: 0: 1735.4, 1: 1719.2. Samples: 39966294. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:36,211][50642] Avg episode reward: [(0, '22.070'), (1, '20.990')] [2023-10-08 03:04:36,325][52059] Updated weights for policy 1, policy_version 78532 (0.0010) [2023-10-08 03:04:36,688][52059] Updated weights for policy 1, policy_version 78542 (0.0008) [2023-10-08 03:04:37,056][52059] Updated weights for policy 1, policy_version 78552 (0.0007) [2023-10-08 03:04:38,453][52060] Updated weights for policy 0, policy_version 77570 (0.0007) [2023-10-08 03:04:38,811][52060] Updated weights for policy 0, policy_version 77580 (0.0010) [2023-10-08 03:04:39,187][52060] Updated weights for policy 0, policy_version 77590 (0.0008) [2023-10-08 03:04:39,555][52060] Updated weights for policy 0, policy_version 77600 (0.0011) [2023-10-08 03:04:41,027][52059] Updated weights for policy 1, policy_version 78562 (0.0008) [2023-10-08 03:04:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 159907840. Throughput: 0: 1706.2, 1: 1748.4. Samples: 39986754. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:41,211][50642] Avg episode reward: [(0, '19.190'), (1, '23.470')] [2023-10-08 03:04:41,393][52059] Updated weights for policy 1, policy_version 78572 (0.0007) [2023-10-08 03:04:41,753][52059] Updated weights for policy 1, policy_version 78582 (0.0010) [2023-10-08 03:04:42,117][52059] Updated weights for policy 1, policy_version 78592 (0.0009) [2023-10-08 03:04:43,562][52060] Updated weights for policy 0, policy_version 77610 (0.0010) [2023-10-08 03:04:43,917][52060] Updated weights for policy 0, policy_version 77620 (0.0011) [2023-10-08 03:04:44,295][52060] Updated weights for policy 0, policy_version 77630 (0.0011) [2023-10-08 03:04:46,032][52059] Updated weights for policy 1, policy_version 78602 (0.0007) [2023-10-08 03:04:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 159973376. Throughput: 0: 1713.5, 1: 1741.0. Samples: 40007672. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:46,211][50642] Avg episode reward: [(0, '18.330'), (1, '26.900')] [2023-10-08 03:04:46,217][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000077632_79495168.pth... [2023-10-08 03:04:46,245][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000076032_77856768.pth [2023-10-08 03:04:46,388][52059] Updated weights for policy 1, policy_version 78612 (0.0007) [2023-10-08 03:04:46,761][52059] Updated weights for policy 1, policy_version 78622 (0.0007) [2023-10-08 03:04:46,827][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000078624_80510976.pth... [2023-10-08 03:04:46,857][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000076992_78839808.pth [2023-10-08 03:04:48,311][52060] Updated weights for policy 0, policy_version 77640 (0.0011) [2023-10-08 03:04:48,674][52060] Updated weights for policy 0, policy_version 77650 (0.0011) [2023-10-08 03:04:49,046][52060] Updated weights for policy 0, policy_version 77660 (0.0011) [2023-10-08 03:04:50,761][52059] Updated weights for policy 1, policy_version 78632 (0.0008) [2023-10-08 03:04:51,121][52059] Updated weights for policy 1, policy_version 78642 (0.0008) [2023-10-08 03:04:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 160038912. Throughput: 0: 1706.4, 1: 1737.3. Samples: 40017612. Policy #0 lag: (min: 2.0, avg: 7.6, max: 34.0) [2023-10-08 03:04:51,211][50642] Avg episode reward: [(0, '22.230'), (1, '23.810')] [2023-10-08 03:04:51,488][52059] Updated weights for policy 1, policy_version 78652 (0.0008) [2023-10-08 03:04:53,060][52060] Updated weights for policy 0, policy_version 77670 (0.0008) [2023-10-08 03:04:53,427][52060] Updated weights for policy 0, policy_version 77680 (0.0008) [2023-10-08 03:04:53,791][52060] Updated weights for policy 0, policy_version 77690 (0.0008) [2023-10-08 03:04:55,484][52059] Updated weights for policy 1, policy_version 78662 (0.0009) [2023-10-08 03:04:55,846][52059] Updated weights for policy 1, policy_version 78672 (0.0011) [2023-10-08 03:04:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 160104448. Throughput: 0: 1697.0, 1: 1741.1. Samples: 40038418. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:04:56,210][50642] Avg episode reward: [(0, '18.740'), (1, '22.400')] [2023-10-08 03:04:56,214][52059] Updated weights for policy 1, policy_version 78682 (0.0009) [2023-10-08 03:04:57,673][52060] Updated weights for policy 0, policy_version 77700 (0.0007) [2023-10-08 03:04:58,054][52060] Updated weights for policy 0, policy_version 77710 (0.0009) [2023-10-08 03:04:58,422][52060] Updated weights for policy 0, policy_version 77720 (0.0007) [2023-10-08 03:05:00,177][52059] Updated weights for policy 1, policy_version 78692 (0.0007) [2023-10-08 03:05:00,578][52059] Updated weights for policy 1, policy_version 78702 (0.0009) [2023-10-08 03:05:00,943][52059] Updated weights for policy 1, policy_version 78712 (0.0008) [2023-10-08 03:05:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 160169984. Throughput: 0: 1722.4, 1: 1715.7. Samples: 40058726. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:01,211][50642] Avg episode reward: [(0, '19.560'), (1, '22.920')] [2023-10-08 03:05:02,299][52060] Updated weights for policy 0, policy_version 77730 (0.0007) [2023-10-08 03:05:02,676][52060] Updated weights for policy 0, policy_version 77740 (0.0008) [2023-10-08 03:05:03,039][52060] Updated weights for policy 0, policy_version 77750 (0.0007) [2023-10-08 03:05:03,406][52060] Updated weights for policy 0, policy_version 77760 (0.0008) [2023-10-08 03:05:04,780][52059] Updated weights for policy 1, policy_version 78722 (0.0009) [2023-10-08 03:05:05,138][52059] Updated weights for policy 1, policy_version 78732 (0.0009) [2023-10-08 03:05:05,497][52059] Updated weights for policy 1, policy_version 78742 (0.0007) [2023-10-08 03:05:05,861][52059] Updated weights for policy 1, policy_version 78752 (0.0011) [2023-10-08 03:05:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 160268288. Throughput: 0: 1693.7, 1: 1739.2. Samples: 40069170. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:06,211][50642] Avg episode reward: [(0, '18.500'), (1, '24.780')] [2023-10-08 03:05:07,244][52060] Updated weights for policy 0, policy_version 77770 (0.0009) [2023-10-08 03:05:07,618][52060] Updated weights for policy 0, policy_version 77780 (0.0010) [2023-10-08 03:05:07,977][52060] Updated weights for policy 0, policy_version 77790 (0.0009) [2023-10-08 03:05:09,851][52059] Updated weights for policy 1, policy_version 78762 (0.0009) [2023-10-08 03:05:10,218][52059] Updated weights for policy 1, policy_version 78772 (0.0009) [2023-10-08 03:05:10,586][52059] Updated weights for policy 1, policy_version 78782 (0.0007) [2023-10-08 03:05:11,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 160333824. Throughput: 0: 1714.7, 1: 1736.7. Samples: 40090260. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:11,211][50642] Avg episode reward: [(0, '23.510'), (1, '24.820')] [2023-10-08 03:05:12,098][52060] Updated weights for policy 0, policy_version 77800 (0.0009) [2023-10-08 03:05:12,480][52060] Updated weights for policy 0, policy_version 77810 (0.0012) [2023-10-08 03:05:12,843][52060] Updated weights for policy 0, policy_version 77820 (0.0009) [2023-10-08 03:05:14,396][52059] Updated weights for policy 1, policy_version 78792 (0.0009) [2023-10-08 03:05:14,762][52059] Updated weights for policy 1, policy_version 78802 (0.0009) [2023-10-08 03:05:15,130][52059] Updated weights for policy 1, policy_version 78812 (0.0010) [2023-10-08 03:05:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 160399360. Throughput: 0: 1726.6, 1: 1715.6. Samples: 40111016. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:16,211][50642] Avg episode reward: [(0, '17.460'), (1, '22.330')] [2023-10-08 03:05:16,867][52060] Updated weights for policy 0, policy_version 77830 (0.0009) [2023-10-08 03:05:17,246][52060] Updated weights for policy 0, policy_version 77840 (0.0011) [2023-10-08 03:05:17,621][52060] Updated weights for policy 0, policy_version 77850 (0.0011) [2023-10-08 03:05:19,100][52059] Updated weights for policy 1, policy_version 78822 (0.0010) [2023-10-08 03:05:19,456][52059] Updated weights for policy 1, policy_version 78832 (0.0007) [2023-10-08 03:05:19,823][52059] Updated weights for policy 1, policy_version 78842 (0.0008) [2023-10-08 03:05:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 160464896. Throughput: 0: 1705.4, 1: 1751.1. Samples: 40121836. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:21,211][50642] Avg episode reward: [(0, '17.550'), (1, '23.300')] [2023-10-08 03:05:21,525][52060] Updated weights for policy 0, policy_version 77860 (0.0010) [2023-10-08 03:05:21,891][52060] Updated weights for policy 0, policy_version 77870 (0.0008) [2023-10-08 03:05:22,255][52060] Updated weights for policy 0, policy_version 77880 (0.0007) [2023-10-08 03:05:23,648][52059] Updated weights for policy 1, policy_version 78852 (0.0009) [2023-10-08 03:05:24,015][52059] Updated weights for policy 1, policy_version 78862 (0.0008) [2023-10-08 03:05:24,380][52059] Updated weights for policy 1, policy_version 78872 (0.0009) [2023-10-08 03:05:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 160530432. Throughput: 0: 1728.8, 1: 1719.4. Samples: 40141922. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:26,211][50642] Avg episode reward: [(0, '19.900'), (1, '24.620')] [2023-10-08 03:05:26,366][52060] Updated weights for policy 0, policy_version 77890 (0.0008) [2023-10-08 03:05:26,740][52060] Updated weights for policy 0, policy_version 77900 (0.0007) [2023-10-08 03:05:27,112][52060] Updated weights for policy 0, policy_version 77910 (0.0009) [2023-10-08 03:05:27,477][52060] Updated weights for policy 0, policy_version 77920 (0.0010) [2023-10-08 03:05:28,281][52059] Updated weights for policy 1, policy_version 78882 (0.0008) [2023-10-08 03:05:28,642][52059] Updated weights for policy 1, policy_version 78892 (0.0008) [2023-10-08 03:05:29,002][52059] Updated weights for policy 1, policy_version 78902 (0.0010) [2023-10-08 03:05:29,374][52059] Updated weights for policy 1, policy_version 78912 (0.0007) [2023-10-08 03:05:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 160595968. Throughput: 0: 1729.8, 1: 1725.8. Samples: 40163174. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:31,211][50642] Avg episode reward: [(0, '21.950'), (1, '24.510')] [2023-10-08 03:05:31,457][52060] Updated weights for policy 0, policy_version 77930 (0.0010) [2023-10-08 03:05:31,831][52060] Updated weights for policy 0, policy_version 77940 (0.0011) [2023-10-08 03:05:32,202][52060] Updated weights for policy 0, policy_version 77950 (0.0011) [2023-10-08 03:05:33,319][52059] Updated weights for policy 1, policy_version 78922 (0.0007) [2023-10-08 03:05:33,682][52059] Updated weights for policy 1, policy_version 78932 (0.0008) [2023-10-08 03:05:34,049][52059] Updated weights for policy 1, policy_version 78942 (0.0007) [2023-10-08 03:05:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 160661504. Throughput: 0: 1716.8, 1: 1733.2. Samples: 40172864. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:36,211][50642] Avg episode reward: [(0, '19.290'), (1, '24.620')] [2023-10-08 03:05:36,218][52060] Updated weights for policy 0, policy_version 77960 (0.0007) [2023-10-08 03:05:36,580][52060] Updated weights for policy 0, policy_version 77970 (0.0008) [2023-10-08 03:05:36,945][52060] Updated weights for policy 0, policy_version 77980 (0.0007) [2023-10-08 03:05:37,862][52059] Updated weights for policy 1, policy_version 78952 (0.0008) [2023-10-08 03:05:38,225][52059] Updated weights for policy 1, policy_version 78962 (0.0008) [2023-10-08 03:05:38,593][52059] Updated weights for policy 1, policy_version 78972 (0.0008) [2023-10-08 03:05:41,006][52060] Updated weights for policy 0, policy_version 77990 (0.0008) [2023-10-08 03:05:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 160727040. Throughput: 0: 1726.2, 1: 1730.8. Samples: 40193982. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:41,211][50642] Avg episode reward: [(0, '18.410'), (1, '23.820')] [2023-10-08 03:05:41,359][52060] Updated weights for policy 0, policy_version 78000 (0.0010) [2023-10-08 03:05:41,731][52060] Updated weights for policy 0, policy_version 78010 (0.0008) [2023-10-08 03:05:42,319][52059] Updated weights for policy 1, policy_version 78982 (0.0009) [2023-10-08 03:05:42,677][52059] Updated weights for policy 1, policy_version 78992 (0.0008) [2023-10-08 03:05:43,038][52059] Updated weights for policy 1, policy_version 79002 (0.0009) [2023-10-08 03:05:45,634][52060] Updated weights for policy 0, policy_version 78020 (0.0009) [2023-10-08 03:05:46,035][52060] Updated weights for policy 0, policy_version 78030 (0.0008) [2023-10-08 03:05:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 160792576. Throughput: 0: 1718.7, 1: 1762.0. Samples: 40215356. Policy #0 lag: (min: 31.0, avg: 32.3, max: 56.0) [2023-10-08 03:05:46,211][50642] Avg episode reward: [(0, '20.930'), (1, '25.110')] [2023-10-08 03:05:46,407][52060] Updated weights for policy 0, policy_version 78040 (0.0008) [2023-10-08 03:05:46,804][52059] Updated weights for policy 1, policy_version 79012 (0.0008) [2023-10-08 03:05:47,210][52059] Updated weights for policy 1, policy_version 79022 (0.0009) [2023-10-08 03:05:47,579][52059] Updated weights for policy 1, policy_version 79032 (0.0008) [2023-10-08 03:05:50,310][52060] Updated weights for policy 0, policy_version 78050 (0.0008) [2023-10-08 03:05:50,669][52060] Updated weights for policy 0, policy_version 78060 (0.0009) [2023-10-08 03:05:51,030][52060] Updated weights for policy 0, policy_version 78070 (0.0010) [2023-10-08 03:05:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 160858112. Throughput: 0: 1723.0, 1: 1733.8. Samples: 40224728. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:05:51,211][50642] Avg episode reward: [(0, '22.550'), (1, '26.120')] [2023-10-08 03:05:51,406][52060] Updated weights for policy 0, policy_version 78080 (0.0010) [2023-10-08 03:05:51,639][52059] Updated weights for policy 1, policy_version 79042 (0.0009) [2023-10-08 03:05:52,004][52059] Updated weights for policy 1, policy_version 79052 (0.0008) [2023-10-08 03:05:52,365][52059] Updated weights for policy 1, policy_version 79062 (0.0008) [2023-10-08 03:05:52,732][52059] Updated weights for policy 1, policy_version 79072 (0.0008) [2023-10-08 03:05:55,397][52060] Updated weights for policy 0, policy_version 78090 (0.0008) [2023-10-08 03:05:55,762][52060] Updated weights for policy 0, policy_version 78100 (0.0009) [2023-10-08 03:05:56,131][52060] Updated weights for policy 0, policy_version 78110 (0.0008) [2023-10-08 03:05:56,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 160956416. Throughput: 0: 1722.9, 1: 1737.7. Samples: 40245988. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:05:56,211][50642] Avg episode reward: [(0, '18.330'), (1, '22.280')] [2023-10-08 03:05:56,694][52059] Updated weights for policy 1, policy_version 79082 (0.0009) [2023-10-08 03:05:57,054][52059] Updated weights for policy 1, policy_version 79092 (0.0007) [2023-10-08 03:05:57,431][52059] Updated weights for policy 1, policy_version 79102 (0.0009) [2023-10-08 03:06:00,041][52060] Updated weights for policy 0, policy_version 78120 (0.0010) [2023-10-08 03:06:00,406][52060] Updated weights for policy 0, policy_version 78130 (0.0007) [2023-10-08 03:06:00,771][52060] Updated weights for policy 0, policy_version 78140 (0.0009) [2023-10-08 03:06:01,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 161021952. Throughput: 0: 1694.3, 1: 1760.1. Samples: 40266464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:01,211][50642] Avg episode reward: [(0, '18.620'), (1, '22.990')] [2023-10-08 03:06:01,315][52059] Updated weights for policy 1, policy_version 79112 (0.0009) [2023-10-08 03:06:01,673][52059] Updated weights for policy 1, policy_version 79122 (0.0009) [2023-10-08 03:06:02,037][52059] Updated weights for policy 1, policy_version 79132 (0.0010) [2023-10-08 03:06:04,630][52060] Updated weights for policy 0, policy_version 78150 (0.0009) [2023-10-08 03:06:04,995][52060] Updated weights for policy 0, policy_version 78160 (0.0008) [2023-10-08 03:06:05,366][52060] Updated weights for policy 0, policy_version 78170 (0.0008) [2023-10-08 03:06:06,000][52059] Updated weights for policy 1, policy_version 79142 (0.0010) [2023-10-08 03:06:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 161087488. Throughput: 0: 1720.6, 1: 1726.9. Samples: 40276972. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:06,211][50642] Avg episode reward: [(0, '18.500'), (1, '24.700')] [2023-10-08 03:06:06,367][52059] Updated weights for policy 1, policy_version 79152 (0.0009) [2023-10-08 03:06:06,733][52059] Updated weights for policy 1, policy_version 79162 (0.0008) [2023-10-08 03:06:09,280][52060] Updated weights for policy 0, policy_version 78180 (0.0010) [2023-10-08 03:06:09,641][52060] Updated weights for policy 0, policy_version 78190 (0.0010) [2023-10-08 03:06:10,009][52060] Updated weights for policy 0, policy_version 78200 (0.0010) [2023-10-08 03:06:10,691][52059] Updated weights for policy 1, policy_version 79172 (0.0009) [2023-10-08 03:06:11,063][52059] Updated weights for policy 1, policy_version 79182 (0.0010) [2023-10-08 03:06:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 161153024. Throughput: 0: 1704.7, 1: 1758.3. Samples: 40297758. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:11,211][50642] Avg episode reward: [(0, '19.210'), (1, '27.330')] [2023-10-08 03:06:11,417][52059] Updated weights for policy 1, policy_version 79192 (0.0009) [2023-10-08 03:06:13,934][52060] Updated weights for policy 0, policy_version 78210 (0.0011) [2023-10-08 03:06:14,292][52060] Updated weights for policy 0, policy_version 78220 (0.0008) [2023-10-08 03:06:14,666][52060] Updated weights for policy 0, policy_version 78230 (0.0008) [2023-10-08 03:06:15,044][52060] Updated weights for policy 0, policy_version 78240 (0.0011) [2023-10-08 03:06:15,281][52059] Updated weights for policy 1, policy_version 79202 (0.0008) [2023-10-08 03:06:15,643][52059] Updated weights for policy 1, policy_version 79212 (0.0010) [2023-10-08 03:06:16,016][52059] Updated weights for policy 1, policy_version 79222 (0.0008) [2023-10-08 03:06:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 161218560. Throughput: 0: 1695.7, 1: 1746.1. Samples: 40318054. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:16,211][50642] Avg episode reward: [(0, '17.830'), (1, '23.190')] [2023-10-08 03:06:16,383][52059] Updated weights for policy 1, policy_version 79232 (0.0007) [2023-10-08 03:06:19,107][52060] Updated weights for policy 0, policy_version 78250 (0.0008) [2023-10-08 03:06:19,464][52060] Updated weights for policy 0, policy_version 78260 (0.0008) [2023-10-08 03:06:19,834][52060] Updated weights for policy 0, policy_version 78270 (0.0010) [2023-10-08 03:06:20,272][52059] Updated weights for policy 1, policy_version 79242 (0.0009) [2023-10-08 03:06:20,644][52059] Updated weights for policy 1, policy_version 79252 (0.0011) [2023-10-08 03:06:21,008][52059] Updated weights for policy 1, policy_version 79262 (0.0009) [2023-10-08 03:06:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 161316864. Throughput: 0: 1724.8, 1: 1754.4. Samples: 40329428. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:21,211][50642] Avg episode reward: [(0, '18.810'), (1, '22.560')] [2023-10-08 03:06:23,839][52060] Updated weights for policy 0, policy_version 78280 (0.0009) [2023-10-08 03:06:24,207][52060] Updated weights for policy 0, policy_version 78290 (0.0011) [2023-10-08 03:06:24,584][52060] Updated weights for policy 0, policy_version 78300 (0.0009) [2023-10-08 03:06:25,012][52059] Updated weights for policy 1, policy_version 79272 (0.0007) [2023-10-08 03:06:25,362][52059] Updated weights for policy 1, policy_version 79282 (0.0009) [2023-10-08 03:06:25,731][52059] Updated weights for policy 1, policy_version 79292 (0.0008) [2023-10-08 03:06:26,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 161382400. Throughput: 0: 1697.1, 1: 1754.8. Samples: 40349318. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:26,211][50642] Avg episode reward: [(0, '19.730'), (1, '24.110')] [2023-10-08 03:06:28,437][52060] Updated weights for policy 0, policy_version 78310 (0.0009) [2023-10-08 03:06:28,809][52060] Updated weights for policy 0, policy_version 78320 (0.0009) [2023-10-08 03:06:29,183][52060] Updated weights for policy 0, policy_version 78330 (0.0009) [2023-10-08 03:06:29,593][52059] Updated weights for policy 1, policy_version 79302 (0.0008) [2023-10-08 03:06:29,961][52059] Updated weights for policy 1, policy_version 79312 (0.0010) [2023-10-08 03:06:30,332][52059] Updated weights for policy 1, policy_version 79322 (0.0008) [2023-10-08 03:06:31,211][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 161447936. Throughput: 0: 1707.6, 1: 1725.4. Samples: 40369842. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:31,212][50642] Avg episode reward: [(0, '22.660'), (1, '25.760')] [2023-10-08 03:06:33,245][52060] Updated weights for policy 0, policy_version 78340 (0.0010) [2023-10-08 03:06:33,624][52060] Updated weights for policy 0, policy_version 78350 (0.0008) [2023-10-08 03:06:33,997][52060] Updated weights for policy 0, policy_version 78360 (0.0009) [2023-10-08 03:06:34,195][52059] Updated weights for policy 1, policy_version 79332 (0.0007) [2023-10-08 03:06:34,555][52059] Updated weights for policy 1, policy_version 79342 (0.0008) [2023-10-08 03:06:34,917][52059] Updated weights for policy 1, policy_version 79352 (0.0011) [2023-10-08 03:06:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 161513472. Throughput: 0: 1709.8, 1: 1763.4. Samples: 40381020. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:36,211][50642] Avg episode reward: [(0, '19.030'), (1, '26.860')] [2023-10-08 03:06:37,980][52060] Updated weights for policy 0, policy_version 78370 (0.0010) [2023-10-08 03:06:38,348][52060] Updated weights for policy 0, policy_version 78380 (0.0010) [2023-10-08 03:06:38,705][52060] Updated weights for policy 0, policy_version 78390 (0.0010) [2023-10-08 03:06:38,824][52059] Updated weights for policy 1, policy_version 79362 (0.0009) [2023-10-08 03:06:39,077][52060] Updated weights for policy 0, policy_version 78400 (0.0009) [2023-10-08 03:06:39,197][52059] Updated weights for policy 1, policy_version 79372 (0.0008) [2023-10-08 03:06:39,565][52059] Updated weights for policy 1, policy_version 79382 (0.0008) [2023-10-08 03:06:39,929][52059] Updated weights for policy 1, policy_version 79392 (0.0007) [2023-10-08 03:06:41,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 161579008. Throughput: 0: 1694.8, 1: 1739.0. Samples: 40400510. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:41,211][50642] Avg episode reward: [(0, '21.550'), (1, '23.520')] [2023-10-08 03:06:43,026][52060] Updated weights for policy 0, policy_version 78410 (0.0010) [2023-10-08 03:06:43,399][52060] Updated weights for policy 0, policy_version 78420 (0.0009) [2023-10-08 03:06:43,761][52060] Updated weights for policy 0, policy_version 78430 (0.0009) [2023-10-08 03:06:43,917][52059] Updated weights for policy 1, policy_version 79402 (0.0010) [2023-10-08 03:06:44,291][52059] Updated weights for policy 1, policy_version 79412 (0.0010) [2023-10-08 03:06:44,655][52059] Updated weights for policy 1, policy_version 79422 (0.0009) [2023-10-08 03:06:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 161644544. Throughput: 0: 1718.8, 1: 1728.8. Samples: 40421608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:46,211][50642] Avg episode reward: [(0, '19.840'), (1, '24.140')] [2023-10-08 03:06:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000079424_81330176.pth... [2023-10-08 03:06:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000078432_80314368.pth... [2023-10-08 03:06:46,249][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000077792_79659008.pth [2023-10-08 03:06:46,252][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000076832_78675968.pth [2023-10-08 03:06:47,782][52060] Updated weights for policy 0, policy_version 78440 (0.0009) [2023-10-08 03:06:48,144][52060] Updated weights for policy 0, policy_version 78450 (0.0007) [2023-10-08 03:06:48,463][52059] Updated weights for policy 1, policy_version 79432 (0.0008) [2023-10-08 03:06:48,514][52060] Updated weights for policy 0, policy_version 78460 (0.0008) [2023-10-08 03:06:48,834][52059] Updated weights for policy 1, policy_version 79442 (0.0008) [2023-10-08 03:06:49,199][52059] Updated weights for policy 1, policy_version 79452 (0.0008) [2023-10-08 03:06:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 161710080. Throughput: 0: 1692.7, 1: 1745.3. Samples: 40431682. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:51,211][50642] Avg episode reward: [(0, '20.940'), (1, '24.680')] [2023-10-08 03:06:52,409][52060] Updated weights for policy 0, policy_version 78470 (0.0009) [2023-10-08 03:06:52,777][52060] Updated weights for policy 0, policy_version 78480 (0.0010) [2023-10-08 03:06:53,149][52060] Updated weights for policy 0, policy_version 78490 (0.0008) [2023-10-08 03:06:53,215][52059] Updated weights for policy 1, policy_version 79462 (0.0008) [2023-10-08 03:06:53,577][52059] Updated weights for policy 1, policy_version 79472 (0.0008) [2023-10-08 03:06:53,940][52059] Updated weights for policy 1, policy_version 79482 (0.0007) [2023-10-08 03:06:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 161775616. Throughput: 0: 1713.2, 1: 1726.6. Samples: 40452548. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:06:56,211][50642] Avg episode reward: [(0, '19.150'), (1, '26.120')] [2023-10-08 03:06:57,184][52060] Updated weights for policy 0, policy_version 78500 (0.0009) [2023-10-08 03:06:57,560][52060] Updated weights for policy 0, policy_version 78510 (0.0010) [2023-10-08 03:06:57,634][52059] Updated weights for policy 1, policy_version 79492 (0.0008) [2023-10-08 03:06:57,928][52060] Updated weights for policy 0, policy_version 78520 (0.0008) [2023-10-08 03:06:57,991][52059] Updated weights for policy 1, policy_version 79502 (0.0007) [2023-10-08 03:06:58,354][52059] Updated weights for policy 1, policy_version 79512 (0.0007) [2023-10-08 03:07:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 161841152. Throughput: 0: 1721.5, 1: 1746.6. Samples: 40474120. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:01,211][50642] Avg episode reward: [(0, '20.900'), (1, '25.530')] [2023-10-08 03:07:01,958][52060] Updated weights for policy 0, policy_version 78530 (0.0007) [2023-10-08 03:07:02,177][52059] Updated weights for policy 1, policy_version 79522 (0.0008) [2023-10-08 03:07:02,329][52060] Updated weights for policy 0, policy_version 78540 (0.0009) [2023-10-08 03:07:02,544][52059] Updated weights for policy 1, policy_version 79532 (0.0008) [2023-10-08 03:07:02,695][52060] Updated weights for policy 0, policy_version 78550 (0.0007) [2023-10-08 03:07:02,907][52059] Updated weights for policy 1, policy_version 79542 (0.0007) [2023-10-08 03:07:03,057][52060] Updated weights for policy 0, policy_version 78560 (0.0007) [2023-10-08 03:07:03,271][52059] Updated weights for policy 1, policy_version 79552 (0.0008) [2023-10-08 03:07:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 161906688. Throughput: 0: 1694.1, 1: 1727.9. Samples: 40483418. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:06,211][50642] Avg episode reward: [(0, '21.190'), (1, '22.930')] [2023-10-08 03:07:07,099][52060] Updated weights for policy 0, policy_version 78570 (0.0007) [2023-10-08 03:07:07,201][52059] Updated weights for policy 1, policy_version 79562 (0.0009) [2023-10-08 03:07:07,461][52060] Updated weights for policy 0, policy_version 78580 (0.0007) [2023-10-08 03:07:07,569][52059] Updated weights for policy 1, policy_version 79572 (0.0009) [2023-10-08 03:07:07,832][52060] Updated weights for policy 0, policy_version 78590 (0.0009) [2023-10-08 03:07:07,933][52059] Updated weights for policy 1, policy_version 79582 (0.0007) [2023-10-08 03:07:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 161972224. Throughput: 0: 1724.0, 1: 1734.5. Samples: 40504948. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:11,211][50642] Avg episode reward: [(0, '21.230'), (1, '25.520')] [2023-10-08 03:07:11,848][52060] Updated weights for policy 0, policy_version 78600 (0.0008) [2023-10-08 03:07:11,868][52059] Updated weights for policy 1, policy_version 79592 (0.0009) [2023-10-08 03:07:12,219][52060] Updated weights for policy 0, policy_version 78610 (0.0008) [2023-10-08 03:07:12,230][52059] Updated weights for policy 1, policy_version 79602 (0.0008) [2023-10-08 03:07:12,585][52060] Updated weights for policy 0, policy_version 78620 (0.0008) [2023-10-08 03:07:12,602][52059] Updated weights for policy 1, policy_version 79612 (0.0008) [2023-10-08 03:07:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 162037760. Throughput: 0: 1721.1, 1: 1757.6. Samples: 40526382. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:16,211][50642] Avg episode reward: [(0, '21.150'), (1, '24.620')] [2023-10-08 03:07:16,513][52060] Updated weights for policy 0, policy_version 78630 (0.0009) [2023-10-08 03:07:16,531][52059] Updated weights for policy 1, policy_version 79622 (0.0008) [2023-10-08 03:07:16,882][52060] Updated weights for policy 0, policy_version 78640 (0.0007) [2023-10-08 03:07:16,888][52059] Updated weights for policy 1, policy_version 79632 (0.0007) [2023-10-08 03:07:17,247][52060] Updated weights for policy 0, policy_version 78650 (0.0007) [2023-10-08 03:07:17,250][52059] Updated weights for policy 1, policy_version 79642 (0.0007) [2023-10-08 03:07:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 162103296. Throughput: 0: 1711.3, 1: 1723.9. Samples: 40535608. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:21,211][50642] Avg episode reward: [(0, '20.280'), (1, '24.140')] [2023-10-08 03:07:21,262][52059] Updated weights for policy 1, policy_version 79652 (0.0009) [2023-10-08 03:07:21,309][52060] Updated weights for policy 0, policy_version 78660 (0.0010) [2023-10-08 03:07:21,624][52059] Updated weights for policy 1, policy_version 79662 (0.0008) [2023-10-08 03:07:21,684][52060] Updated weights for policy 0, policy_version 78670 (0.0008) [2023-10-08 03:07:21,993][52059] Updated weights for policy 1, policy_version 79672 (0.0009) [2023-10-08 03:07:22,042][52060] Updated weights for policy 0, policy_version 78680 (0.0008) [2023-10-08 03:07:25,851][52059] Updated weights for policy 1, policy_version 79682 (0.0009) [2023-10-08 03:07:25,996][52060] Updated weights for policy 0, policy_version 78690 (0.0008) [2023-10-08 03:07:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 162168832. Throughput: 0: 1719.3, 1: 1748.7. Samples: 40556568. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:26,211][50642] Avg episode reward: [(0, '22.780'), (1, '23.840')] [2023-10-08 03:07:26,211][52059] Updated weights for policy 1, policy_version 79692 (0.0009) [2023-10-08 03:07:26,372][52060] Updated weights for policy 0, policy_version 78700 (0.0008) [2023-10-08 03:07:26,574][52059] Updated weights for policy 1, policy_version 79702 (0.0008) [2023-10-08 03:07:26,739][52060] Updated weights for policy 0, policy_version 78710 (0.0008) [2023-10-08 03:07:26,942][52059] Updated weights for policy 1, policy_version 79712 (0.0008) [2023-10-08 03:07:27,105][52060] Updated weights for policy 0, policy_version 78720 (0.0008) [2023-10-08 03:07:30,759][52059] Updated weights for policy 1, policy_version 79722 (0.0008) [2023-10-08 03:07:31,121][52059] Updated weights for policy 1, policy_version 79732 (0.0008) [2023-10-08 03:07:31,173][52060] Updated weights for policy 0, policy_version 78730 (0.0008) [2023-10-08 03:07:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 162234368. Throughput: 0: 1720.4, 1: 1746.5. Samples: 40577620. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:07:31,211][50642] Avg episode reward: [(0, '20.820'), (1, '25.030')] [2023-10-08 03:07:31,493][52059] Updated weights for policy 1, policy_version 79742 (0.0009) [2023-10-08 03:07:31,541][52060] Updated weights for policy 0, policy_version 78740 (0.0008) [2023-10-08 03:07:31,914][52060] Updated weights for policy 0, policy_version 78750 (0.0010) [2023-10-08 03:07:35,437][52059] Updated weights for policy 1, policy_version 79752 (0.0009) [2023-10-08 03:07:35,729][52060] Updated weights for policy 0, policy_version 78760 (0.0008) [2023-10-08 03:07:35,812][52059] Updated weights for policy 1, policy_version 79762 (0.0009) [2023-10-08 03:07:36,092][52060] Updated weights for policy 0, policy_version 78770 (0.0008) [2023-10-08 03:07:36,177][52059] Updated weights for policy 1, policy_version 79772 (0.0009) [2023-10-08 03:07:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 162299904. Throughput: 0: 1722.4, 1: 1740.4. Samples: 40587508. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:07:36,211][50642] Avg episode reward: [(0, '19.170'), (1, '25.020')] [2023-10-08 03:07:36,451][52060] Updated weights for policy 0, policy_version 78780 (0.0007) [2023-10-08 03:07:40,052][52059] Updated weights for policy 1, policy_version 79782 (0.0007) [2023-10-08 03:07:40,313][52060] Updated weights for policy 0, policy_version 78790 (0.0009) [2023-10-08 03:07:40,410][52059] Updated weights for policy 1, policy_version 79792 (0.0008) [2023-10-08 03:07:40,676][52060] Updated weights for policy 0, policy_version 78800 (0.0008) [2023-10-08 03:07:40,779][52059] Updated weights for policy 1, policy_version 79802 (0.0008) [2023-10-08 03:07:41,046][52060] Updated weights for policy 0, policy_version 78810 (0.0008) [2023-10-08 03:07:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 162398208. Throughput: 0: 1717.1, 1: 1758.0. Samples: 40608926. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:07:41,211][50642] Avg episode reward: [(0, '20.850'), (1, '24.040')] [2023-10-08 03:07:44,711][52059] Updated weights for policy 1, policy_version 79812 (0.0009) [2023-10-08 03:07:45,074][52060] Updated weights for policy 0, policy_version 78820 (0.0008) [2023-10-08 03:07:45,079][52059] Updated weights for policy 1, policy_version 79822 (0.0008) [2023-10-08 03:07:45,437][52060] Updated weights for policy 0, policy_version 78830 (0.0007) [2023-10-08 03:07:45,443][52059] Updated weights for policy 1, policy_version 79832 (0.0009) [2023-10-08 03:07:45,816][52060] Updated weights for policy 0, policy_version 78840 (0.0007) [2023-10-08 03:07:46,210][50642] Fps is (10 sec: 19660.7, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 162496512. Throughput: 0: 1695.8, 1: 1723.4. Samples: 40627982. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:07:46,211][50642] Avg episode reward: [(0, '22.530'), (1, '24.750')] [2023-10-08 03:07:49,512][52059] Updated weights for policy 1, policy_version 79842 (0.0009) [2023-10-08 03:07:49,787][52060] Updated weights for policy 0, policy_version 78850 (0.0007) [2023-10-08 03:07:49,877][52059] Updated weights for policy 1, policy_version 79852 (0.0007) [2023-10-08 03:07:50,164][52060] Updated weights for policy 0, policy_version 78860 (0.0008) [2023-10-08 03:07:50,231][52059] Updated weights for policy 1, policy_version 79862 (0.0007) [2023-10-08 03:07:50,531][52060] Updated weights for policy 0, policy_version 78870 (0.0008) [2023-10-08 03:07:50,592][52059] Updated weights for policy 1, policy_version 79872 (0.0007) [2023-10-08 03:07:50,896][52060] Updated weights for policy 0, policy_version 78880 (0.0008) [2023-10-08 03:07:51,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 162562048. Throughput: 0: 1722.1, 1: 1752.1. Samples: 40639760. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:07:51,211][50642] Avg episode reward: [(0, '20.900'), (1, '24.650')] [2023-10-08 03:07:54,725][52059] Updated weights for policy 1, policy_version 79882 (0.0008) [2023-10-08 03:07:54,923][52060] Updated weights for policy 0, policy_version 78890 (0.0010) [2023-10-08 03:07:55,080][52059] Updated weights for policy 1, policy_version 79892 (0.0008) [2023-10-08 03:07:55,291][52060] Updated weights for policy 0, policy_version 78900 (0.0007) [2023-10-08 03:07:55,446][52059] Updated weights for policy 1, policy_version 79902 (0.0010) [2023-10-08 03:07:55,659][52060] Updated weights for policy 0, policy_version 78910 (0.0009) [2023-10-08 03:07:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 162627584. Throughput: 0: 1713.7, 1: 1734.7. Samples: 40660124. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:07:56,211][50642] Avg episode reward: [(0, '20.060'), (1, '25.690')] [2023-10-08 03:07:59,269][52059] Updated weights for policy 1, policy_version 79912 (0.0011) [2023-10-08 03:07:59,623][52060] Updated weights for policy 0, policy_version 78920 (0.0008) [2023-10-08 03:07:59,638][52059] Updated weights for policy 1, policy_version 79922 (0.0009) [2023-10-08 03:08:00,001][52059] Updated weights for policy 1, policy_version 79932 (0.0008) [2023-10-08 03:08:00,002][52060] Updated weights for policy 0, policy_version 78930 (0.0008) [2023-10-08 03:08:00,363][52060] Updated weights for policy 0, policy_version 78940 (0.0010) [2023-10-08 03:08:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 162693120. Throughput: 0: 1690.2, 1: 1718.0. Samples: 40679752. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:08:01,211][50642] Avg episode reward: [(0, '20.520'), (1, '25.190')] [2023-10-08 03:08:03,825][52059] Updated weights for policy 1, policy_version 79942 (0.0008) [2023-10-08 03:08:04,191][52059] Updated weights for policy 1, policy_version 79952 (0.0008) [2023-10-08 03:08:04,407][52060] Updated weights for policy 0, policy_version 78950 (0.0009) [2023-10-08 03:08:04,554][52059] Updated weights for policy 1, policy_version 79962 (0.0008) [2023-10-08 03:08:04,774][52060] Updated weights for policy 0, policy_version 78960 (0.0008) [2023-10-08 03:08:05,145][52060] Updated weights for policy 0, policy_version 78970 (0.0007) [2023-10-08 03:08:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 162758656. Throughput: 0: 1720.8, 1: 1742.4. Samples: 40691452. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:08:06,211][50642] Avg episode reward: [(0, '23.690'), (1, '22.600')] [2023-10-08 03:08:08,575][52059] Updated weights for policy 1, policy_version 79972 (0.0008) [2023-10-08 03:08:08,957][52059] Updated weights for policy 1, policy_version 79982 (0.0008) [2023-10-08 03:08:09,123][52060] Updated weights for policy 0, policy_version 78980 (0.0008) [2023-10-08 03:08:09,311][52059] Updated weights for policy 1, policy_version 79992 (0.0008) [2023-10-08 03:08:09,519][52060] Updated weights for policy 0, policy_version 78990 (0.0008) [2023-10-08 03:08:09,885][52060] Updated weights for policy 0, policy_version 79000 (0.0009) [2023-10-08 03:08:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 162824192. Throughput: 0: 1704.1, 1: 1718.4. Samples: 40710582. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:08:11,211][50642] Avg episode reward: [(0, '20.310'), (1, '23.240')] [2023-10-08 03:08:13,003][52059] Updated weights for policy 1, policy_version 80002 (0.0010) [2023-10-08 03:08:13,370][52059] Updated weights for policy 1, policy_version 80012 (0.0008) [2023-10-08 03:08:13,740][52059] Updated weights for policy 1, policy_version 80022 (0.0007) [2023-10-08 03:08:13,968][52060] Updated weights for policy 0, policy_version 79010 (0.0011) [2023-10-08 03:08:14,097][52059] Updated weights for policy 1, policy_version 80032 (0.0008) [2023-10-08 03:08:14,333][52060] Updated weights for policy 0, policy_version 79020 (0.0007) [2023-10-08 03:08:14,708][52060] Updated weights for policy 0, policy_version 79030 (0.0008) [2023-10-08 03:08:15,070][52060] Updated weights for policy 0, policy_version 79040 (0.0008) [2023-10-08 03:08:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 162889728. Throughput: 0: 1687.9, 1: 1735.2. Samples: 40731658. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:08:16,211][50642] Avg episode reward: [(0, '19.330'), (1, '23.970')] [2023-10-08 03:08:17,978][52059] Updated weights for policy 1, policy_version 80042 (0.0008) [2023-10-08 03:08:18,346][52059] Updated weights for policy 1, policy_version 80052 (0.0009) [2023-10-08 03:08:18,719][52059] Updated weights for policy 1, policy_version 80062 (0.0010) [2023-10-08 03:08:19,083][52060] Updated weights for policy 0, policy_version 79050 (0.0010) [2023-10-08 03:08:19,445][52060] Updated weights for policy 0, policy_version 79060 (0.0007) [2023-10-08 03:08:19,825][52060] Updated weights for policy 0, policy_version 79070 (0.0009) [2023-10-08 03:08:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 162955264. Throughput: 0: 1711.9, 1: 1726.0. Samples: 40742214. Policy #0 lag: (min: 31.0, avg: 34.8, max: 63.0) [2023-10-08 03:08:21,211][50642] Avg episode reward: [(0, '20.740'), (1, '25.610')] [2023-10-08 03:08:22,770][52059] Updated weights for policy 1, policy_version 80072 (0.0010) [2023-10-08 03:08:23,135][52059] Updated weights for policy 1, policy_version 80082 (0.0011) [2023-10-08 03:08:23,501][52059] Updated weights for policy 1, policy_version 80092 (0.0010) [2023-10-08 03:08:23,849][52060] Updated weights for policy 0, policy_version 79080 (0.0009) [2023-10-08 03:08:24,214][52060] Updated weights for policy 0, policy_version 79090 (0.0008) [2023-10-08 03:08:24,585][52060] Updated weights for policy 0, policy_version 79100 (0.0007) [2023-10-08 03:08:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 163020800. Throughput: 0: 1686.9, 1: 1719.6. Samples: 40762220. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:26,211][50642] Avg episode reward: [(0, '21.550'), (1, '22.760')] [2023-10-08 03:08:27,326][52059] Updated weights for policy 1, policy_version 80102 (0.0008) [2023-10-08 03:08:27,701][52059] Updated weights for policy 1, policy_version 80112 (0.0007) [2023-10-08 03:08:28,066][52059] Updated weights for policy 1, policy_version 80122 (0.0008) [2023-10-08 03:08:28,542][52060] Updated weights for policy 0, policy_version 79110 (0.0009) [2023-10-08 03:08:28,909][52060] Updated weights for policy 0, policy_version 79120 (0.0008) [2023-10-08 03:08:29,282][52060] Updated weights for policy 0, policy_version 79130 (0.0008) [2023-10-08 03:08:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 163086336. Throughput: 0: 1710.2, 1: 1751.9. Samples: 40783774. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:31,211][50642] Avg episode reward: [(0, '17.560'), (1, '23.120')] [2023-10-08 03:08:31,877][52059] Updated weights for policy 1, policy_version 80132 (0.0010) [2023-10-08 03:08:32,232][52059] Updated weights for policy 1, policy_version 80142 (0.0010) [2023-10-08 03:08:32,592][52059] Updated weights for policy 1, policy_version 80152 (0.0009) [2023-10-08 03:08:33,220][52060] Updated weights for policy 0, policy_version 79140 (0.0008) [2023-10-08 03:08:33,604][52060] Updated weights for policy 0, policy_version 79150 (0.0009) [2023-10-08 03:08:33,971][52060] Updated weights for policy 0, policy_version 79160 (0.0007) [2023-10-08 03:08:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 163151872. Throughput: 0: 1698.8, 1: 1722.4. Samples: 40793712. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:36,211][50642] Avg episode reward: [(0, '19.050'), (1, '25.590')] [2023-10-08 03:08:36,507][52059] Updated weights for policy 1, policy_version 80162 (0.0008) [2023-10-08 03:08:36,869][52059] Updated weights for policy 1, policy_version 80172 (0.0007) [2023-10-08 03:08:37,238][52059] Updated weights for policy 1, policy_version 80182 (0.0009) [2023-10-08 03:08:37,592][52059] Updated weights for policy 1, policy_version 80192 (0.0009) [2023-10-08 03:08:37,895][52060] Updated weights for policy 0, policy_version 79170 (0.0009) [2023-10-08 03:08:38,262][52060] Updated weights for policy 0, policy_version 79180 (0.0008) [2023-10-08 03:08:38,629][52060] Updated weights for policy 0, policy_version 79190 (0.0008) [2023-10-08 03:08:39,011][52060] Updated weights for policy 0, policy_version 79200 (0.0009) [2023-10-08 03:08:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 163217408. Throughput: 0: 1691.9, 1: 1735.0. Samples: 40814334. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:41,211][50642] Avg episode reward: [(0, '19.390'), (1, '25.090')] [2023-10-08 03:08:41,540][52059] Updated weights for policy 1, policy_version 80202 (0.0008) [2023-10-08 03:08:41,907][52059] Updated weights for policy 1, policy_version 80212 (0.0008) [2023-10-08 03:08:42,271][52059] Updated weights for policy 1, policy_version 80222 (0.0008) [2023-10-08 03:08:42,882][52060] Updated weights for policy 0, policy_version 79210 (0.0011) [2023-10-08 03:08:43,255][52060] Updated weights for policy 0, policy_version 79220 (0.0008) [2023-10-08 03:08:43,622][52060] Updated weights for policy 0, policy_version 79230 (0.0007) [2023-10-08 03:08:46,182][52059] Updated weights for policy 1, policy_version 80232 (0.0007) [2023-10-08 03:08:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 163282944. Throughput: 0: 1714.1, 1: 1755.1. Samples: 40835868. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:46,211][50642] Avg episode reward: [(0, '19.370'), (1, '24.870')] [2023-10-08 03:08:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000079232_81133568.pth... [2023-10-08 03:08:46,247][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000077632_79495168.pth [2023-10-08 03:08:46,542][52059] Updated weights for policy 1, policy_version 80242 (0.0007) [2023-10-08 03:08:46,906][52059] Updated weights for policy 1, policy_version 80252 (0.0007) [2023-10-08 03:08:47,042][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000080256_82182144.pth... [2023-10-08 03:08:47,082][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000078624_80510976.pth [2023-10-08 03:08:47,679][52060] Updated weights for policy 0, policy_version 79240 (0.0010) [2023-10-08 03:08:48,043][52060] Updated weights for policy 0, policy_version 79250 (0.0009) [2023-10-08 03:08:48,420][52060] Updated weights for policy 0, policy_version 79260 (0.0009) [2023-10-08 03:08:50,874][52059] Updated weights for policy 1, policy_version 80262 (0.0008) [2023-10-08 03:08:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 163348480. Throughput: 0: 1684.8, 1: 1733.6. Samples: 40845282. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:51,211][50642] Avg episode reward: [(0, '18.180'), (1, '21.950')] [2023-10-08 03:08:51,238][52059] Updated weights for policy 1, policy_version 80272 (0.0008) [2023-10-08 03:08:51,596][52059] Updated weights for policy 1, policy_version 80282 (0.0009) [2023-10-08 03:08:52,511][52060] Updated weights for policy 0, policy_version 79270 (0.0009) [2023-10-08 03:08:52,878][52060] Updated weights for policy 0, policy_version 79280 (0.0010) [2023-10-08 03:08:53,242][52060] Updated weights for policy 0, policy_version 79290 (0.0008) [2023-10-08 03:08:55,628][52059] Updated weights for policy 1, policy_version 80292 (0.0010) [2023-10-08 03:08:56,028][52059] Updated weights for policy 1, policy_version 80302 (0.0009) [2023-10-08 03:08:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13773.7). Total num frames: 163414016. Throughput: 0: 1710.5, 1: 1763.7. Samples: 40866922. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:08:56,211][50642] Avg episode reward: [(0, '19.450'), (1, '25.460')] [2023-10-08 03:08:56,394][52059] Updated weights for policy 1, policy_version 80312 (0.0011) [2023-10-08 03:08:57,257][52060] Updated weights for policy 0, policy_version 79300 (0.0010) [2023-10-08 03:08:57,642][52060] Updated weights for policy 0, policy_version 79310 (0.0009) [2023-10-08 03:08:58,011][52060] Updated weights for policy 0, policy_version 79320 (0.0008) [2023-10-08 03:09:00,282][52059] Updated weights for policy 1, policy_version 80322 (0.0008) [2023-10-08 03:09:00,643][52059] Updated weights for policy 1, policy_version 80332 (0.0007) [2023-10-08 03:09:01,009][52059] Updated weights for policy 1, policy_version 80342 (0.0009) [2023-10-08 03:09:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 163479552. Throughput: 0: 1721.6, 1: 1736.7. Samples: 40887280. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:09:01,211][50642] Avg episode reward: [(0, '21.720'), (1, '24.540')] [2023-10-08 03:09:01,369][52059] Updated weights for policy 1, policy_version 80352 (0.0008) [2023-10-08 03:09:01,916][52060] Updated weights for policy 0, policy_version 79330 (0.0008) [2023-10-08 03:09:02,284][52060] Updated weights for policy 0, policy_version 79340 (0.0007) [2023-10-08 03:09:02,662][52060] Updated weights for policy 0, policy_version 79350 (0.0011) [2023-10-08 03:09:03,028][52060] Updated weights for policy 0, policy_version 79360 (0.0011) [2023-10-08 03:09:05,365][52059] Updated weights for policy 1, policy_version 80362 (0.0008) [2023-10-08 03:09:05,731][52059] Updated weights for policy 1, policy_version 80372 (0.0010) [2023-10-08 03:09:06,094][52059] Updated weights for policy 1, policy_version 80382 (0.0008) [2023-10-08 03:09:06,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163577856. Throughput: 0: 1694.0, 1: 1752.8. Samples: 40897318. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:09:06,211][50642] Avg episode reward: [(0, '19.200'), (1, '22.440')] [2023-10-08 03:09:07,146][52060] Updated weights for policy 0, policy_version 79370 (0.0007) [2023-10-08 03:09:07,524][52060] Updated weights for policy 0, policy_version 79380 (0.0007) [2023-10-08 03:09:07,887][52060] Updated weights for policy 0, policy_version 79390 (0.0007) [2023-10-08 03:09:09,855][52059] Updated weights for policy 1, policy_version 80392 (0.0010) [2023-10-08 03:09:10,215][52059] Updated weights for policy 1, policy_version 80402 (0.0008) [2023-10-08 03:09:10,578][52059] Updated weights for policy 1, policy_version 80412 (0.0007) [2023-10-08 03:09:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163643392. Throughput: 0: 1717.5, 1: 1749.6. Samples: 40918238. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:09:11,211][50642] Avg episode reward: [(0, '20.100'), (1, '22.450')] [2023-10-08 03:09:11,831][52060] Updated weights for policy 0, policy_version 79400 (0.0007) [2023-10-08 03:09:12,191][52060] Updated weights for policy 0, policy_version 79410 (0.0007) [2023-10-08 03:09:12,572][52060] Updated weights for policy 0, policy_version 79420 (0.0007) [2023-10-08 03:09:14,612][52059] Updated weights for policy 1, policy_version 80422 (0.0008) [2023-10-08 03:09:14,986][52059] Updated weights for policy 1, policy_version 80432 (0.0008) [2023-10-08 03:09:15,352][52059] Updated weights for policy 1, policy_version 80442 (0.0008) [2023-10-08 03:09:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163708928. Throughput: 0: 1722.0, 1: 1719.5. Samples: 40938644. Policy #0 lag: (min: 31.0, avg: 33.7, max: 63.0) [2023-10-08 03:09:16,211][50642] Avg episode reward: [(0, '21.990'), (1, '22.570')] [2023-10-08 03:09:16,409][52060] Updated weights for policy 0, policy_version 79430 (0.0008) [2023-10-08 03:09:16,781][52060] Updated weights for policy 0, policy_version 79440 (0.0007) [2023-10-08 03:09:17,152][52060] Updated weights for policy 0, policy_version 79450 (0.0008) [2023-10-08 03:09:19,249][52059] Updated weights for policy 1, policy_version 80452 (0.0008) [2023-10-08 03:09:19,613][52059] Updated weights for policy 1, policy_version 80462 (0.0008) [2023-10-08 03:09:19,977][52059] Updated weights for policy 1, policy_version 80472 (0.0008) [2023-10-08 03:09:21,087][52060] Updated weights for policy 0, policy_version 79460 (0.0010) [2023-10-08 03:09:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163774464. Throughput: 0: 1710.0, 1: 1751.5. Samples: 40949482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:21,211][50642] Avg episode reward: [(0, '19.700'), (1, '24.540')] [2023-10-08 03:09:21,453][52060] Updated weights for policy 0, policy_version 79470 (0.0007) [2023-10-08 03:09:21,823][52060] Updated weights for policy 0, policy_version 79480 (0.0009) [2023-10-08 03:09:23,728][52059] Updated weights for policy 1, policy_version 80482 (0.0009) [2023-10-08 03:09:24,094][52059] Updated weights for policy 1, policy_version 80492 (0.0007) [2023-10-08 03:09:24,467][52059] Updated weights for policy 1, policy_version 80502 (0.0009) [2023-10-08 03:09:24,831][52059] Updated weights for policy 1, policy_version 80512 (0.0008) [2023-10-08 03:09:25,838][52060] Updated weights for policy 0, policy_version 79490 (0.0008) [2023-10-08 03:09:26,194][52060] Updated weights for policy 0, policy_version 79500 (0.0007) [2023-10-08 03:09:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163840000. Throughput: 0: 1719.5, 1: 1721.3. Samples: 40969170. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:26,211][50642] Avg episode reward: [(0, '18.600'), (1, '25.980')] [2023-10-08 03:09:26,569][52060] Updated weights for policy 0, policy_version 79510 (0.0007) [2023-10-08 03:09:26,932][52060] Updated weights for policy 0, policy_version 79520 (0.0010) [2023-10-08 03:09:28,558][52059] Updated weights for policy 1, policy_version 80522 (0.0008) [2023-10-08 03:09:28,918][52059] Updated weights for policy 1, policy_version 80532 (0.0009) [2023-10-08 03:09:29,289][52059] Updated weights for policy 1, policy_version 80542 (0.0009) [2023-10-08 03:09:30,921][52060] Updated weights for policy 0, policy_version 79530 (0.0009) [2023-10-08 03:09:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163905536. Throughput: 0: 1710.2, 1: 1719.4. Samples: 40990198. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:31,211][50642] Avg episode reward: [(0, '19.230'), (1, '24.650')] [2023-10-08 03:09:31,285][52060] Updated weights for policy 0, policy_version 79540 (0.0008) [2023-10-08 03:09:31,658][52060] Updated weights for policy 0, policy_version 79550 (0.0008) [2023-10-08 03:09:33,350][52059] Updated weights for policy 1, policy_version 80552 (0.0009) [2023-10-08 03:09:33,712][52059] Updated weights for policy 1, policy_version 80562 (0.0010) [2023-10-08 03:09:34,076][52059] Updated weights for policy 1, policy_version 80572 (0.0009) [2023-10-08 03:09:35,697][52060] Updated weights for policy 0, policy_version 79560 (0.0008) [2023-10-08 03:09:36,069][52060] Updated weights for policy 0, policy_version 79570 (0.0008) [2023-10-08 03:09:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 163971072. Throughput: 0: 1716.7, 1: 1726.6. Samples: 41000232. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:36,211][50642] Avg episode reward: [(0, '20.830'), (1, '24.100')] [2023-10-08 03:09:36,442][52060] Updated weights for policy 0, policy_version 79580 (0.0010) [2023-10-08 03:09:38,179][52059] Updated weights for policy 1, policy_version 80582 (0.0008) [2023-10-08 03:09:38,551][52059] Updated weights for policy 1, policy_version 80592 (0.0007) [2023-10-08 03:09:38,915][52059] Updated weights for policy 1, policy_version 80602 (0.0009) [2023-10-08 03:09:40,383][52060] Updated weights for policy 0, policy_version 79590 (0.0009) [2023-10-08 03:09:40,745][52060] Updated weights for policy 0, policy_version 79600 (0.0010) [2023-10-08 03:09:41,115][52060] Updated weights for policy 0, policy_version 79610 (0.0007) [2023-10-08 03:09:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 164036608. Throughput: 0: 1717.6, 1: 1711.1. Samples: 41021216. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:41,211][50642] Avg episode reward: [(0, '19.410'), (1, '26.390')] [2023-10-08 03:09:42,878][52059] Updated weights for policy 1, policy_version 80612 (0.0010) [2023-10-08 03:09:43,286][52059] Updated weights for policy 1, policy_version 80622 (0.0007) [2023-10-08 03:09:43,643][52059] Updated weights for policy 1, policy_version 80632 (0.0008) [2023-10-08 03:09:45,054][52060] Updated weights for policy 0, policy_version 79620 (0.0009) [2023-10-08 03:09:45,442][52060] Updated weights for policy 0, policy_version 79630 (0.0010) [2023-10-08 03:09:45,809][52060] Updated weights for policy 0, policy_version 79640 (0.0010) [2023-10-08 03:09:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 164134912. Throughput: 0: 1702.9, 1: 1728.9. Samples: 41041710. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:46,211][50642] Avg episode reward: [(0, '17.880'), (1, '27.820')] [2023-10-08 03:09:47,557][52059] Updated weights for policy 1, policy_version 80642 (0.0008) [2023-10-08 03:09:47,920][52059] Updated weights for policy 1, policy_version 80652 (0.0009) [2023-10-08 03:09:48,284][52059] Updated weights for policy 1, policy_version 80662 (0.0007) [2023-10-08 03:09:48,655][52059] Updated weights for policy 1, policy_version 80672 (0.0009) [2023-10-08 03:09:49,786][52060] Updated weights for policy 0, policy_version 79650 (0.0010) [2023-10-08 03:09:50,160][52060] Updated weights for policy 0, policy_version 79660 (0.0009) [2023-10-08 03:09:50,526][52060] Updated weights for policy 0, policy_version 79670 (0.0011) [2023-10-08 03:09:50,891][52060] Updated weights for policy 0, policy_version 79680 (0.0009) [2023-10-08 03:09:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 164200448. Throughput: 0: 1727.0, 1: 1708.9. Samples: 41051936. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:51,211][50642] Avg episode reward: [(0, '19.750'), (1, '23.620')] [2023-10-08 03:09:52,720][52059] Updated weights for policy 1, policy_version 80682 (0.0008) [2023-10-08 03:09:53,085][52059] Updated weights for policy 1, policy_version 80692 (0.0008) [2023-10-08 03:09:53,454][52059] Updated weights for policy 1, policy_version 80702 (0.0007) [2023-10-08 03:09:54,734][52060] Updated weights for policy 0, policy_version 79690 (0.0008) [2023-10-08 03:09:55,093][52060] Updated weights for policy 0, policy_version 79700 (0.0009) [2023-10-08 03:09:55,468][52060] Updated weights for policy 0, policy_version 79710 (0.0010) [2023-10-08 03:09:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 164265984. Throughput: 0: 1721.0, 1: 1713.2. Samples: 41072776. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:09:56,211][50642] Avg episode reward: [(0, '22.500'), (1, '22.590')] [2023-10-08 03:09:57,340][52059] Updated weights for policy 1, policy_version 80712 (0.0007) [2023-10-08 03:09:57,701][52059] Updated weights for policy 1, policy_version 80722 (0.0007) [2023-10-08 03:09:58,067][52059] Updated weights for policy 1, policy_version 80732 (0.0010) [2023-10-08 03:09:59,411][52060] Updated weights for policy 0, policy_version 79720 (0.0010) [2023-10-08 03:09:59,786][52060] Updated weights for policy 0, policy_version 79730 (0.0008) [2023-10-08 03:10:00,154][52060] Updated weights for policy 0, policy_version 79740 (0.0010) [2023-10-08 03:10:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 164331520. Throughput: 0: 1696.0, 1: 1739.6. Samples: 41093246. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:10:01,211][50642] Avg episode reward: [(0, '20.940'), (1, '23.310')] [2023-10-08 03:10:02,077][52059] Updated weights for policy 1, policy_version 80742 (0.0008) [2023-10-08 03:10:02,443][52059] Updated weights for policy 1, policy_version 80752 (0.0008) [2023-10-08 03:10:02,803][52059] Updated weights for policy 1, policy_version 80762 (0.0007) [2023-10-08 03:10:04,007][52060] Updated weights for policy 0, policy_version 79750 (0.0008) [2023-10-08 03:10:04,372][52060] Updated weights for policy 0, policy_version 79760 (0.0008) [2023-10-08 03:10:04,730][52060] Updated weights for policy 0, policy_version 79770 (0.0008) [2023-10-08 03:10:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 164397056. Throughput: 0: 1726.4, 1: 1706.3. Samples: 41103954. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:10:06,211][50642] Avg episode reward: [(0, '18.520'), (1, '24.890')] [2023-10-08 03:10:06,721][52059] Updated weights for policy 1, policy_version 80772 (0.0007) [2023-10-08 03:10:07,085][52059] Updated weights for policy 1, policy_version 80782 (0.0009) [2023-10-08 03:10:07,452][52059] Updated weights for policy 1, policy_version 80792 (0.0008) [2023-10-08 03:10:08,759][52060] Updated weights for policy 0, policy_version 79780 (0.0010) [2023-10-08 03:10:09,124][52060] Updated weights for policy 0, policy_version 79790 (0.0011) [2023-10-08 03:10:09,505][52060] Updated weights for policy 0, policy_version 79800 (0.0011) [2023-10-08 03:10:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 164462592. Throughput: 0: 1700.0, 1: 1741.9. Samples: 41124054. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:11,211][50642] Avg episode reward: [(0, '19.680'), (1, '24.280')] [2023-10-08 03:10:11,294][52059] Updated weights for policy 1, policy_version 80802 (0.0008) [2023-10-08 03:10:11,659][52059] Updated weights for policy 1, policy_version 80812 (0.0007) [2023-10-08 03:10:12,030][52059] Updated weights for policy 1, policy_version 80822 (0.0007) [2023-10-08 03:10:12,392][52059] Updated weights for policy 1, policy_version 80832 (0.0008) [2023-10-08 03:10:13,528][52060] Updated weights for policy 0, policy_version 79810 (0.0011) [2023-10-08 03:10:13,888][52060] Updated weights for policy 0, policy_version 79820 (0.0010) [2023-10-08 03:10:14,260][52060] Updated weights for policy 0, policy_version 79830 (0.0007) [2023-10-08 03:10:14,621][52060] Updated weights for policy 0, policy_version 79840 (0.0010) [2023-10-08 03:10:16,200][52059] Updated weights for policy 1, policy_version 80842 (0.0011) [2023-10-08 03:10:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 164528128. Throughput: 0: 1708.1, 1: 1742.4. Samples: 41145470. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:16,211][50642] Avg episode reward: [(0, '21.010'), (1, '22.380')] [2023-10-08 03:10:16,565][52059] Updated weights for policy 1, policy_version 80852 (0.0010) [2023-10-08 03:10:16,928][52059] Updated weights for policy 1, policy_version 80862 (0.0008) [2023-10-08 03:10:18,612][52060] Updated weights for policy 0, policy_version 79850 (0.0009) [2023-10-08 03:10:18,989][52060] Updated weights for policy 0, policy_version 79860 (0.0009) [2023-10-08 03:10:19,353][52060] Updated weights for policy 0, policy_version 79870 (0.0007) [2023-10-08 03:10:20,810][52059] Updated weights for policy 1, policy_version 80872 (0.0008) [2023-10-08 03:10:21,174][52059] Updated weights for policy 1, policy_version 80882 (0.0009) [2023-10-08 03:10:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 164593664. Throughput: 0: 1719.9, 1: 1737.3. Samples: 41155806. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:21,211][50642] Avg episode reward: [(0, '19.230'), (1, '24.890')] [2023-10-08 03:10:21,543][52059] Updated weights for policy 1, policy_version 80892 (0.0010) [2023-10-08 03:10:23,349][52060] Updated weights for policy 0, policy_version 79880 (0.0008) [2023-10-08 03:10:23,714][52060] Updated weights for policy 0, policy_version 79890 (0.0009) [2023-10-08 03:10:24,089][52060] Updated weights for policy 0, policy_version 79900 (0.0008) [2023-10-08 03:10:25,342][52059] Updated weights for policy 1, policy_version 80902 (0.0010) [2023-10-08 03:10:25,711][52059] Updated weights for policy 1, policy_version 80912 (0.0011) [2023-10-08 03:10:26,069][52059] Updated weights for policy 1, policy_version 80922 (0.0011) [2023-10-08 03:10:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 164659200. Throughput: 0: 1700.8, 1: 1753.8. Samples: 41176672. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:26,211][50642] Avg episode reward: [(0, '19.850'), (1, '26.130')] [2023-10-08 03:10:28,038][52060] Updated weights for policy 0, policy_version 79910 (0.0007) [2023-10-08 03:10:28,402][52060] Updated weights for policy 0, policy_version 79920 (0.0010) [2023-10-08 03:10:28,765][52060] Updated weights for policy 0, policy_version 79930 (0.0009) [2023-10-08 03:10:30,160][52059] Updated weights for policy 1, policy_version 80932 (0.0009) [2023-10-08 03:10:30,552][52059] Updated weights for policy 1, policy_version 80942 (0.0009) [2023-10-08 03:10:30,925][52059] Updated weights for policy 1, policy_version 80952 (0.0009) [2023-10-08 03:10:31,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 164757504. Throughput: 0: 1724.8, 1: 1727.8. Samples: 41197076. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:31,211][50642] Avg episode reward: [(0, '20.860'), (1, '25.560')] [2023-10-08 03:10:32,819][52060] Updated weights for policy 0, policy_version 79940 (0.0009) [2023-10-08 03:10:33,214][52060] Updated weights for policy 0, policy_version 79950 (0.0008) [2023-10-08 03:10:33,575][52060] Updated weights for policy 0, policy_version 79960 (0.0010) [2023-10-08 03:10:34,682][52059] Updated weights for policy 1, policy_version 80962 (0.0008) [2023-10-08 03:10:35,050][52059] Updated weights for policy 1, policy_version 80972 (0.0008) [2023-10-08 03:10:35,409][52059] Updated weights for policy 1, policy_version 80982 (0.0008) [2023-10-08 03:10:35,770][52059] Updated weights for policy 1, policy_version 80992 (0.0009) [2023-10-08 03:10:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 164823040. Throughput: 0: 1698.9, 1: 1752.6. Samples: 41207254. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:36,211][50642] Avg episode reward: [(0, '22.200'), (1, '23.560')] [2023-10-08 03:10:37,549][52060] Updated weights for policy 0, policy_version 79970 (0.0008) [2023-10-08 03:10:37,914][52060] Updated weights for policy 0, policy_version 79980 (0.0010) [2023-10-08 03:10:38,283][52060] Updated weights for policy 0, policy_version 79990 (0.0009) [2023-10-08 03:10:38,655][52060] Updated weights for policy 0, policy_version 80000 (0.0008) [2023-10-08 03:10:39,590][52059] Updated weights for policy 1, policy_version 81002 (0.0007) [2023-10-08 03:10:39,948][52059] Updated weights for policy 1, policy_version 81012 (0.0008) [2023-10-08 03:10:40,315][52059] Updated weights for policy 1, policy_version 81022 (0.0007) [2023-10-08 03:10:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 164888576. Throughput: 0: 1705.3, 1: 1740.0. Samples: 41227812. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:41,211][50642] Avg episode reward: [(0, '18.310'), (1, '20.790')] [2023-10-08 03:10:42,648][52060] Updated weights for policy 0, policy_version 80010 (0.0010) [2023-10-08 03:10:43,013][52060] Updated weights for policy 0, policy_version 80020 (0.0008) [2023-10-08 03:10:43,380][52060] Updated weights for policy 0, policy_version 80030 (0.0009) [2023-10-08 03:10:44,281][52059] Updated weights for policy 1, policy_version 81032 (0.0009) [2023-10-08 03:10:44,653][52059] Updated weights for policy 1, policy_version 81042 (0.0007) [2023-10-08 03:10:45,017][52059] Updated weights for policy 1, policy_version 81052 (0.0007) [2023-10-08 03:10:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 164954112. Throughput: 0: 1728.4, 1: 1727.1. Samples: 41248742. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:46,211][50642] Avg episode reward: [(0, '19.860'), (1, '23.200')] [2023-10-08 03:10:46,223][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000081056_83001344.pth... [2023-10-08 03:10:46,223][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000080032_81952768.pth... [2023-10-08 03:10:46,252][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000079424_81330176.pth [2023-10-08 03:10:46,256][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000081056_83001344.pth [2023-10-08 03:10:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000078432_80314368.pth [2023-10-08 03:10:46,263][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000080032_81952768.pth [2023-10-08 03:10:47,368][52060] Updated weights for policy 0, policy_version 80040 (0.0009) [2023-10-08 03:10:47,739][52060] Updated weights for policy 0, policy_version 80050 (0.0008) [2023-10-08 03:10:48,110][52060] Updated weights for policy 0, policy_version 80060 (0.0008) [2023-10-08 03:10:48,896][52059] Updated weights for policy 1, policy_version 81062 (0.0008) [2023-10-08 03:10:49,262][52059] Updated weights for policy 1, policy_version 81072 (0.0009) [2023-10-08 03:10:49,627][52059] Updated weights for policy 1, policy_version 81082 (0.0011) [2023-10-08 03:10:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 165019648. Throughput: 0: 1696.3, 1: 1755.4. Samples: 41259282. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:51,211][50642] Avg episode reward: [(0, '20.810'), (1, '23.840')] [2023-10-08 03:10:52,087][52060] Updated weights for policy 0, policy_version 80070 (0.0010) [2023-10-08 03:10:52,451][52060] Updated weights for policy 0, policy_version 80080 (0.0010) [2023-10-08 03:10:52,823][52060] Updated weights for policy 0, policy_version 80090 (0.0008) [2023-10-08 03:10:53,673][52059] Updated weights for policy 1, policy_version 81092 (0.0008) [2023-10-08 03:10:54,035][52059] Updated weights for policy 1, policy_version 81102 (0.0010) [2023-10-08 03:10:54,399][52059] Updated weights for policy 1, policy_version 81112 (0.0010) [2023-10-08 03:10:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 165085184. Throughput: 0: 1726.5, 1: 1726.0. Samples: 41279416. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:10:56,211][50642] Avg episode reward: [(0, '20.000'), (1, '22.420')] [2023-10-08 03:10:56,823][52060] Updated weights for policy 0, policy_version 80100 (0.0007) [2023-10-08 03:10:57,191][52060] Updated weights for policy 0, policy_version 80110 (0.0008) [2023-10-08 03:10:57,566][52060] Updated weights for policy 0, policy_version 80120 (0.0009) [2023-10-08 03:10:58,442][52059] Updated weights for policy 1, policy_version 81122 (0.0008) [2023-10-08 03:10:58,797][52059] Updated weights for policy 1, policy_version 81132 (0.0009) [2023-10-08 03:10:59,163][52059] Updated weights for policy 1, policy_version 81142 (0.0011) [2023-10-08 03:10:59,524][52059] Updated weights for policy 1, policy_version 81152 (0.0009) [2023-10-08 03:11:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 165150720. Throughput: 0: 1723.0, 1: 1723.0. Samples: 41300542. Policy #0 lag: (min: 31.0, avg: 37.9, max: 63.0) [2023-10-08 03:11:01,211][50642] Avg episode reward: [(0, '19.100'), (1, '20.830')] [2023-10-08 03:11:01,630][52060] Updated weights for policy 0, policy_version 80130 (0.0008) [2023-10-08 03:11:02,008][52060] Updated weights for policy 0, policy_version 80140 (0.0007) [2023-10-08 03:11:02,365][52060] Updated weights for policy 0, policy_version 80150 (0.0010) [2023-10-08 03:11:02,744][52060] Updated weights for policy 0, policy_version 80160 (0.0010) [2023-10-08 03:11:03,418][52059] Updated weights for policy 1, policy_version 81162 (0.0009) [2023-10-08 03:11:03,771][52059] Updated weights for policy 1, policy_version 81172 (0.0011) [2023-10-08 03:11:04,135][52059] Updated weights for policy 1, policy_version 81182 (0.0009) [2023-10-08 03:11:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 165216256. Throughput: 0: 1704.8, 1: 1731.2. Samples: 41310426. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:06,211][50642] Avg episode reward: [(0, '19.700'), (1, '21.860')] [2023-10-08 03:11:06,616][52060] Updated weights for policy 0, policy_version 80170 (0.0008) [2023-10-08 03:11:06,986][52060] Updated weights for policy 0, policy_version 80180 (0.0008) [2023-10-08 03:11:07,350][52060] Updated weights for policy 0, policy_version 80190 (0.0009) [2023-10-08 03:11:08,052][52059] Updated weights for policy 1, policy_version 81192 (0.0007) [2023-10-08 03:11:08,422][52059] Updated weights for policy 1, policy_version 81202 (0.0007) [2023-10-08 03:11:08,794][52059] Updated weights for policy 1, policy_version 81212 (0.0008) [2023-10-08 03:11:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 165281792. Throughput: 0: 1722.8, 1: 1721.3. Samples: 41331656. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:11,211][50642] Avg episode reward: [(0, '18.920'), (1, '22.490')] [2023-10-08 03:11:11,415][52060] Updated weights for policy 0, policy_version 80200 (0.0008) [2023-10-08 03:11:11,785][52060] Updated weights for policy 0, policy_version 80210 (0.0009) [2023-10-08 03:11:12,153][52060] Updated weights for policy 0, policy_version 80220 (0.0008) [2023-10-08 03:11:12,518][52059] Updated weights for policy 1, policy_version 81222 (0.0010) [2023-10-08 03:11:12,891][52059] Updated weights for policy 1, policy_version 81232 (0.0009) [2023-10-08 03:11:13,266][52059] Updated weights for policy 1, policy_version 81242 (0.0007) [2023-10-08 03:11:16,115][52060] Updated weights for policy 0, policy_version 80230 (0.0008) [2023-10-08 03:11:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 165347328. Throughput: 0: 1715.5, 1: 1748.6. Samples: 41352958. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:16,211][50642] Avg episode reward: [(0, '16.520'), (1, '26.930')] [2023-10-08 03:11:16,483][52060] Updated weights for policy 0, policy_version 80240 (0.0007) [2023-10-08 03:11:16,856][52060] Updated weights for policy 0, policy_version 80250 (0.0010) [2023-10-08 03:11:17,295][52059] Updated weights for policy 1, policy_version 81252 (0.0007) [2023-10-08 03:11:17,656][52059] Updated weights for policy 1, policy_version 81262 (0.0007) [2023-10-08 03:11:18,010][52059] Updated weights for policy 1, policy_version 81272 (0.0009) [2023-10-08 03:11:20,753][52060] Updated weights for policy 0, policy_version 80260 (0.0008) [2023-10-08 03:11:21,121][52060] Updated weights for policy 0, policy_version 80270 (0.0008) [2023-10-08 03:11:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 165412864. Throughput: 0: 1722.7, 1: 1724.5. Samples: 41362378. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:21,211][50642] Avg episode reward: [(0, '16.190'), (1, '19.620')] [2023-10-08 03:11:21,492][52060] Updated weights for policy 0, policy_version 80280 (0.0007) [2023-10-08 03:11:22,022][52059] Updated weights for policy 1, policy_version 81282 (0.0009) [2023-10-08 03:11:22,385][52059] Updated weights for policy 1, policy_version 81292 (0.0009) [2023-10-08 03:11:22,757][52059] Updated weights for policy 1, policy_version 81302 (0.0008) [2023-10-08 03:11:23,112][52059] Updated weights for policy 1, policy_version 81312 (0.0010) [2023-10-08 03:11:25,474][52060] Updated weights for policy 0, policy_version 80290 (0.0009) [2023-10-08 03:11:25,838][52060] Updated weights for policy 0, policy_version 80300 (0.0010) [2023-10-08 03:11:26,201][52060] Updated weights for policy 0, policy_version 80310 (0.0011) [2023-10-08 03:11:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 165478400. Throughput: 0: 1724.8, 1: 1740.7. Samples: 41383762. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:26,211][50642] Avg episode reward: [(0, '19.640'), (1, '21.690')] [2023-10-08 03:11:26,570][52060] Updated weights for policy 0, policy_version 80320 (0.0008) [2023-10-08 03:11:26,871][52059] Updated weights for policy 1, policy_version 81322 (0.0009) [2023-10-08 03:11:27,236][52059] Updated weights for policy 1, policy_version 81332 (0.0007) [2023-10-08 03:11:27,608][52059] Updated weights for policy 1, policy_version 81342 (0.0007) [2023-10-08 03:11:30,589][52060] Updated weights for policy 0, policy_version 80330 (0.0011) [2023-10-08 03:11:30,957][52060] Updated weights for policy 0, policy_version 80340 (0.0009) [2023-10-08 03:11:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 165543936. Throughput: 0: 1702.7, 1: 1758.1. Samples: 41404478. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:31,211][50642] Avg episode reward: [(0, '19.110'), (1, '22.690')] [2023-10-08 03:11:31,317][52060] Updated weights for policy 0, policy_version 80350 (0.0008) [2023-10-08 03:11:31,468][52059] Updated weights for policy 1, policy_version 81352 (0.0008) [2023-10-08 03:11:31,839][52059] Updated weights for policy 1, policy_version 81362 (0.0009) [2023-10-08 03:11:32,201][52059] Updated weights for policy 1, policy_version 81372 (0.0009) [2023-10-08 03:11:35,248][52060] Updated weights for policy 0, policy_version 80360 (0.0010) [2023-10-08 03:11:35,616][52060] Updated weights for policy 0, policy_version 80370 (0.0010) [2023-10-08 03:11:35,985][52060] Updated weights for policy 0, policy_version 80380 (0.0009) [2023-10-08 03:11:36,122][52059] Updated weights for policy 1, policy_version 81382 (0.0009) [2023-10-08 03:11:36,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 165642240. Throughput: 0: 1720.6, 1: 1731.2. Samples: 41414614. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:36,211][50642] Avg episode reward: [(0, '19.730'), (1, '26.940')] [2023-10-08 03:11:36,480][52059] Updated weights for policy 1, policy_version 81392 (0.0007) [2023-10-08 03:11:36,852][52059] Updated weights for policy 1, policy_version 81402 (0.0007) [2023-10-08 03:11:40,038][52060] Updated weights for policy 0, policy_version 80390 (0.0008) [2023-10-08 03:11:40,408][52060] Updated weights for policy 0, policy_version 80400 (0.0008) [2023-10-08 03:11:40,777][52060] Updated weights for policy 0, policy_version 80410 (0.0007) [2023-10-08 03:11:40,836][52059] Updated weights for policy 1, policy_version 81412 (0.0007) [2023-10-08 03:11:41,209][52059] Updated weights for policy 1, policy_version 81422 (0.0010) [2023-10-08 03:11:41,210][50642] Fps is (10 sec: 16384.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 165707776. Throughput: 0: 1716.2, 1: 1761.0. Samples: 41435890. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:41,211][50642] Avg episode reward: [(0, '17.460'), (1, '20.440')] [2023-10-08 03:11:41,572][52059] Updated weights for policy 1, policy_version 81432 (0.0010) [2023-10-08 03:11:44,552][52060] Updated weights for policy 0, policy_version 80420 (0.0007) [2023-10-08 03:11:44,922][52060] Updated weights for policy 0, policy_version 80430 (0.0008) [2023-10-08 03:11:45,292][52060] Updated weights for policy 0, policy_version 80440 (0.0008) [2023-10-08 03:11:45,453][52059] Updated weights for policy 1, policy_version 81442 (0.0009) [2023-10-08 03:11:45,824][52059] Updated weights for policy 1, policy_version 81452 (0.0008) [2023-10-08 03:11:46,189][52059] Updated weights for policy 1, policy_version 81462 (0.0007) [2023-10-08 03:11:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 165773312. Throughput: 0: 1697.3, 1: 1751.5. Samples: 41455738. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:46,211][50642] Avg episode reward: [(0, '22.330'), (1, '19.870')] [2023-10-08 03:11:46,557][52059] Updated weights for policy 1, policy_version 81472 (0.0008) [2023-10-08 03:11:49,303][52060] Updated weights for policy 0, policy_version 80450 (0.0008) [2023-10-08 03:11:49,669][52060] Updated weights for policy 0, policy_version 80460 (0.0010) [2023-10-08 03:11:50,038][52060] Updated weights for policy 0, policy_version 80470 (0.0008) [2023-10-08 03:11:50,402][52060] Updated weights for policy 0, policy_version 80480 (0.0010) [2023-10-08 03:11:50,442][52059] Updated weights for policy 1, policy_version 81482 (0.0007) [2023-10-08 03:11:50,804][52059] Updated weights for policy 1, policy_version 81492 (0.0008) [2023-10-08 03:11:51,181][52059] Updated weights for policy 1, policy_version 81502 (0.0008) [2023-10-08 03:11:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 165838848. Throughput: 0: 1729.0, 1: 1750.9. Samples: 41467022. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:11:51,211][50642] Avg episode reward: [(0, '17.740'), (1, '21.540')] [2023-10-08 03:11:54,501][52060] Updated weights for policy 0, policy_version 80490 (0.0008) [2023-10-08 03:11:54,877][52060] Updated weights for policy 0, policy_version 80500 (0.0008) [2023-10-08 03:11:55,074][52059] Updated weights for policy 1, policy_version 81512 (0.0007) [2023-10-08 03:11:55,249][52060] Updated weights for policy 0, policy_version 80510 (0.0007) [2023-10-08 03:11:55,441][52059] Updated weights for policy 1, policy_version 81522 (0.0007) [2023-10-08 03:11:55,806][52059] Updated weights for policy 1, policy_version 81532 (0.0008) [2023-10-08 03:11:56,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 165937152. Throughput: 0: 1707.5, 1: 1755.1. Samples: 41487474. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:11:56,211][50642] Avg episode reward: [(0, '19.400'), (1, '25.770')] [2023-10-08 03:11:59,147][52060] Updated weights for policy 0, policy_version 80520 (0.0009) [2023-10-08 03:11:59,515][52060] Updated weights for policy 0, policy_version 80530 (0.0008) [2023-10-08 03:11:59,745][52059] Updated weights for policy 1, policy_version 81542 (0.0007) [2023-10-08 03:11:59,883][52060] Updated weights for policy 0, policy_version 80540 (0.0009) [2023-10-08 03:12:00,106][52059] Updated weights for policy 1, policy_version 81552 (0.0007) [2023-10-08 03:12:00,470][52059] Updated weights for policy 1, policy_version 81562 (0.0011) [2023-10-08 03:12:01,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 166002688. Throughput: 0: 1700.0, 1: 1725.2. Samples: 41507090. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:01,211][50642] Avg episode reward: [(0, '18.200'), (1, '24.580')] [2023-10-08 03:12:03,815][52060] Updated weights for policy 0, policy_version 80550 (0.0009) [2023-10-08 03:12:04,188][52060] Updated weights for policy 0, policy_version 80560 (0.0007) [2023-10-08 03:12:04,545][52059] Updated weights for policy 1, policy_version 81572 (0.0008) [2023-10-08 03:12:04,563][52060] Updated weights for policy 0, policy_version 80570 (0.0008) [2023-10-08 03:12:04,907][52059] Updated weights for policy 1, policy_version 81582 (0.0010) [2023-10-08 03:12:05,276][52059] Updated weights for policy 1, policy_version 81592 (0.0008) [2023-10-08 03:12:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 166068224. Throughput: 0: 1722.6, 1: 1755.2. Samples: 41518882. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:06,211][50642] Avg episode reward: [(0, '23.370'), (1, '20.870')] [2023-10-08 03:12:08,447][52060] Updated weights for policy 0, policy_version 80580 (0.0008) [2023-10-08 03:12:08,825][52060] Updated weights for policy 0, policy_version 80590 (0.0010) [2023-10-08 03:12:09,192][52060] Updated weights for policy 0, policy_version 80600 (0.0010) [2023-10-08 03:12:09,280][52059] Updated weights for policy 1, policy_version 81602 (0.0009) [2023-10-08 03:12:09,633][52059] Updated weights for policy 1, policy_version 81612 (0.0007) [2023-10-08 03:12:09,994][52059] Updated weights for policy 1, policy_version 81622 (0.0007) [2023-10-08 03:12:10,363][52059] Updated weights for policy 1, policy_version 81632 (0.0009) [2023-10-08 03:12:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 166133760. Throughput: 0: 1696.7, 1: 1731.8. Samples: 41538044. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:11,211][50642] Avg episode reward: [(0, '17.700'), (1, '21.260')] [2023-10-08 03:12:13,151][52060] Updated weights for policy 0, policy_version 80610 (0.0008) [2023-10-08 03:12:13,520][52060] Updated weights for policy 0, policy_version 80620 (0.0009) [2023-10-08 03:12:13,884][52060] Updated weights for policy 0, policy_version 80630 (0.0008) [2023-10-08 03:12:14,256][52060] Updated weights for policy 0, policy_version 80640 (0.0008) [2023-10-08 03:12:14,363][52059] Updated weights for policy 1, policy_version 81642 (0.0010) [2023-10-08 03:12:14,727][52059] Updated weights for policy 1, policy_version 81652 (0.0009) [2023-10-08 03:12:15,089][52059] Updated weights for policy 1, policy_version 81662 (0.0008) [2023-10-08 03:12:16,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 166199296. Throughput: 0: 1715.5, 1: 1714.5. Samples: 41558824. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:16,211][50642] Avg episode reward: [(0, '17.980'), (1, '25.230')] [2023-10-08 03:12:18,259][52060] Updated weights for policy 0, policy_version 80650 (0.0011) [2023-10-08 03:12:18,627][52060] Updated weights for policy 0, policy_version 80660 (0.0008) [2023-10-08 03:12:18,871][52059] Updated weights for policy 1, policy_version 81672 (0.0009) [2023-10-08 03:12:18,987][52060] Updated weights for policy 0, policy_version 80670 (0.0007) [2023-10-08 03:12:19,240][52059] Updated weights for policy 1, policy_version 81682 (0.0008) [2023-10-08 03:12:19,599][52059] Updated weights for policy 1, policy_version 81692 (0.0008) [2023-10-08 03:12:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 166264832. Throughput: 0: 1704.2, 1: 1740.3. Samples: 41569616. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:21,211][50642] Avg episode reward: [(0, '18.820'), (1, '24.540')] [2023-10-08 03:12:22,916][52060] Updated weights for policy 0, policy_version 80680 (0.0009) [2023-10-08 03:12:23,279][52060] Updated weights for policy 0, policy_version 80690 (0.0009) [2023-10-08 03:12:23,539][52059] Updated weights for policy 1, policy_version 81702 (0.0008) [2023-10-08 03:12:23,635][52060] Updated weights for policy 0, policy_version 80700 (0.0007) [2023-10-08 03:12:23,895][52059] Updated weights for policy 1, policy_version 81712 (0.0009) [2023-10-08 03:12:24,269][52059] Updated weights for policy 1, policy_version 81722 (0.0008) [2023-10-08 03:12:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 166330368. Throughput: 0: 1703.2, 1: 1714.6. Samples: 41589690. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:26,211][50642] Avg episode reward: [(0, '23.050'), (1, '21.310')] [2023-10-08 03:12:27,599][52060] Updated weights for policy 0, policy_version 80710 (0.0009) [2023-10-08 03:12:27,975][52060] Updated weights for policy 0, policy_version 80720 (0.0007) [2023-10-08 03:12:28,128][52059] Updated weights for policy 1, policy_version 81732 (0.0009) [2023-10-08 03:12:28,335][52060] Updated weights for policy 0, policy_version 80730 (0.0008) [2023-10-08 03:12:28,497][52059] Updated weights for policy 1, policy_version 81742 (0.0008) [2023-10-08 03:12:28,854][52059] Updated weights for policy 1, policy_version 81752 (0.0009) [2023-10-08 03:12:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 166395904. Throughput: 0: 1729.3, 1: 1728.6. Samples: 41611344. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:31,211][50642] Avg episode reward: [(0, '17.700'), (1, '23.400')] [2023-10-08 03:12:32,260][52060] Updated weights for policy 0, policy_version 80740 (0.0009) [2023-10-08 03:12:32,634][52060] Updated weights for policy 0, policy_version 80750 (0.0008) [2023-10-08 03:12:32,821][52059] Updated weights for policy 1, policy_version 81762 (0.0009) [2023-10-08 03:12:33,000][52060] Updated weights for policy 0, policy_version 80760 (0.0010) [2023-10-08 03:12:33,185][52059] Updated weights for policy 1, policy_version 81772 (0.0008) [2023-10-08 03:12:33,538][52059] Updated weights for policy 1, policy_version 81782 (0.0007) [2023-10-08 03:12:33,901][52059] Updated weights for policy 1, policy_version 81792 (0.0010) [2023-10-08 03:12:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 166461440. Throughput: 0: 1697.9, 1: 1717.2. Samples: 41620702. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:36,211][50642] Avg episode reward: [(0, '18.340'), (1, '26.310')] [2023-10-08 03:12:36,873][52060] Updated weights for policy 0, policy_version 80770 (0.0007) [2023-10-08 03:12:37,236][52060] Updated weights for policy 0, policy_version 80780 (0.0008) [2023-10-08 03:12:37,611][52060] Updated weights for policy 0, policy_version 80790 (0.0007) [2023-10-08 03:12:37,887][52059] Updated weights for policy 1, policy_version 81802 (0.0007) [2023-10-08 03:12:37,969][52060] Updated weights for policy 0, policy_version 80800 (0.0008) [2023-10-08 03:12:38,246][52059] Updated weights for policy 1, policy_version 81812 (0.0010) [2023-10-08 03:12:38,620][52059] Updated weights for policy 1, policy_version 81822 (0.0009) [2023-10-08 03:12:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 166526976. Throughput: 0: 1721.6, 1: 1712.0. Samples: 41641990. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:41,211][50642] Avg episode reward: [(0, '20.830'), (1, '24.120')] [2023-10-08 03:12:41,855][52060] Updated weights for policy 0, policy_version 80810 (0.0010) [2023-10-08 03:12:42,224][52060] Updated weights for policy 0, policy_version 80820 (0.0010) [2023-10-08 03:12:42,590][52060] Updated weights for policy 0, policy_version 80830 (0.0009) [2023-10-08 03:12:42,608][52059] Updated weights for policy 1, policy_version 81832 (0.0009) [2023-10-08 03:12:42,980][52059] Updated weights for policy 1, policy_version 81842 (0.0009) [2023-10-08 03:12:43,344][52059] Updated weights for policy 1, policy_version 81852 (0.0007) [2023-10-08 03:12:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 166592512. Throughput: 0: 1730.5, 1: 1740.4. Samples: 41663282. Policy #0 lag: (min: 10.0, avg: 13.3, max: 40.0) [2023-10-08 03:12:46,211][50642] Avg episode reward: [(0, '21.600'), (1, '23.750')] [2023-10-08 03:12:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000080832_82771968.pth... [2023-10-08 03:12:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000081856_83820544.pth... [2023-10-08 03:12:46,256][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000079232_81133568.pth [2023-10-08 03:12:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000080256_82182144.pth [2023-10-08 03:12:46,690][52060] Updated weights for policy 0, policy_version 80840 (0.0009) [2023-10-08 03:12:47,058][52060] Updated weights for policy 0, policy_version 80850 (0.0009) [2023-10-08 03:12:47,250][52059] Updated weights for policy 1, policy_version 81862 (0.0009) [2023-10-08 03:12:47,433][52060] Updated weights for policy 0, policy_version 80860 (0.0009) [2023-10-08 03:12:47,625][52059] Updated weights for policy 1, policy_version 81872 (0.0009) [2023-10-08 03:12:47,992][52059] Updated weights for policy 1, policy_version 81882 (0.0007) [2023-10-08 03:12:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 166658048. Throughput: 0: 1704.1, 1: 1714.4. Samples: 41672714. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:12:51,211][50642] Avg episode reward: [(0, '17.390'), (1, '23.160')] [2023-10-08 03:12:51,422][52060] Updated weights for policy 0, policy_version 80870 (0.0007) [2023-10-08 03:12:51,786][52060] Updated weights for policy 0, policy_version 80880 (0.0007) [2023-10-08 03:12:51,892][52059] Updated weights for policy 1, policy_version 81892 (0.0008) [2023-10-08 03:12:52,156][52060] Updated weights for policy 0, policy_version 80890 (0.0009) [2023-10-08 03:12:52,252][52059] Updated weights for policy 1, policy_version 81902 (0.0007) [2023-10-08 03:12:52,616][52059] Updated weights for policy 1, policy_version 81912 (0.0007) [2023-10-08 03:12:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 166723584. Throughput: 0: 1731.2, 1: 1732.4. Samples: 41693910. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:12:56,211][50642] Avg episode reward: [(0, '17.680'), (1, '25.660')] [2023-10-08 03:12:56,278][52060] Updated weights for policy 0, policy_version 80900 (0.0008) [2023-10-08 03:12:56,648][52059] Updated weights for policy 1, policy_version 81922 (0.0011) [2023-10-08 03:12:56,666][52060] Updated weights for policy 0, policy_version 80910 (0.0007) [2023-10-08 03:12:57,029][52060] Updated weights for policy 0, policy_version 80920 (0.0007) [2023-10-08 03:12:57,035][52059] Updated weights for policy 1, policy_version 81932 (0.0007) [2023-10-08 03:12:57,405][52059] Updated weights for policy 1, policy_version 81942 (0.0007) [2023-10-08 03:12:57,760][52059] Updated weights for policy 1, policy_version 81952 (0.0010) [2023-10-08 03:13:00,872][52060] Updated weights for policy 0, policy_version 80930 (0.0008) [2023-10-08 03:13:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 166789120. Throughput: 0: 1731.4, 1: 1737.0. Samples: 41714902. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:01,211][50642] Avg episode reward: [(0, '22.600'), (1, '24.230')] [2023-10-08 03:13:01,242][52060] Updated weights for policy 0, policy_version 80940 (0.0007) [2023-10-08 03:13:01,610][52060] Updated weights for policy 0, policy_version 80950 (0.0007) [2023-10-08 03:13:01,730][52059] Updated weights for policy 1, policy_version 81962 (0.0008) [2023-10-08 03:13:01,982][52060] Updated weights for policy 0, policy_version 80960 (0.0007) [2023-10-08 03:13:02,093][52059] Updated weights for policy 1, policy_version 81972 (0.0007) [2023-10-08 03:13:02,454][52059] Updated weights for policy 1, policy_version 81982 (0.0007) [2023-10-08 03:13:06,080][52060] Updated weights for policy 0, policy_version 80970 (0.0008) [2023-10-08 03:13:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 166854656. Throughput: 0: 1725.1, 1: 1713.2. Samples: 41724336. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:06,211][50642] Avg episode reward: [(0, '22.520'), (1, '23.570')] [2023-10-08 03:13:06,215][52059] Updated weights for policy 1, policy_version 81992 (0.0009) [2023-10-08 03:13:06,458][52060] Updated weights for policy 0, policy_version 80980 (0.0009) [2023-10-08 03:13:06,587][52059] Updated weights for policy 1, policy_version 82002 (0.0008) [2023-10-08 03:13:06,828][52060] Updated weights for policy 0, policy_version 80990 (0.0008) [2023-10-08 03:13:06,949][52059] Updated weights for policy 1, policy_version 82012 (0.0007) [2023-10-08 03:13:10,774][52059] Updated weights for policy 1, policy_version 82022 (0.0008) [2023-10-08 03:13:10,892][52060] Updated weights for policy 0, policy_version 81000 (0.0009) [2023-10-08 03:13:11,134][52059] Updated weights for policy 1, policy_version 82032 (0.0007) [2023-10-08 03:13:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 166920192. Throughput: 0: 1730.6, 1: 1739.1. Samples: 41745826. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:11,211][50642] Avg episode reward: [(0, '19.290'), (1, '24.450')] [2023-10-08 03:13:11,262][52060] Updated weights for policy 0, policy_version 81010 (0.0009) [2023-10-08 03:13:11,495][52059] Updated weights for policy 1, policy_version 82042 (0.0007) [2023-10-08 03:13:11,646][52060] Updated weights for policy 0, policy_version 81020 (0.0008) [2023-10-08 03:13:15,310][52059] Updated weights for policy 1, policy_version 82052 (0.0008) [2023-10-08 03:13:15,463][52060] Updated weights for policy 0, policy_version 81030 (0.0008) [2023-10-08 03:13:15,669][52059] Updated weights for policy 1, policy_version 82062 (0.0009) [2023-10-08 03:13:15,833][52060] Updated weights for policy 0, policy_version 81040 (0.0009) [2023-10-08 03:13:16,019][52059] Updated weights for policy 1, policy_version 82072 (0.0008) [2023-10-08 03:13:16,208][52060] Updated weights for policy 0, policy_version 81050 (0.0008) [2023-10-08 03:13:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 166985728. Throughput: 0: 1715.7, 1: 1719.8. Samples: 41765938. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:16,211][50642] Avg episode reward: [(0, '18.470'), (1, '28.560')] [2023-10-08 03:13:16,311][51710] Saving new best policy, reward=28.560! [2023-10-08 03:13:19,919][52060] Updated weights for policy 0, policy_version 81060 (0.0008) [2023-10-08 03:13:19,956][52059] Updated weights for policy 1, policy_version 82082 (0.0009) [2023-10-08 03:13:20,286][52060] Updated weights for policy 0, policy_version 81070 (0.0008) [2023-10-08 03:13:20,325][52059] Updated weights for policy 1, policy_version 82092 (0.0008) [2023-10-08 03:13:20,650][52060] Updated weights for policy 0, policy_version 81080 (0.0007) [2023-10-08 03:13:20,678][52059] Updated weights for policy 1, policy_version 82102 (0.0009) [2023-10-08 03:13:21,040][52059] Updated weights for policy 1, policy_version 82112 (0.0009) [2023-10-08 03:13:21,210][50642] Fps is (10 sec: 19661.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 167116800. Throughput: 0: 1734.8, 1: 1736.7. Samples: 41776920. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:21,211][50642] Avg episode reward: [(0, '21.780'), (1, '24.930')] [2023-10-08 03:13:24,559][52060] Updated weights for policy 0, policy_version 81090 (0.0007) [2023-10-08 03:13:24,889][52059] Updated weights for policy 1, policy_version 82122 (0.0009) [2023-10-08 03:13:24,926][52060] Updated weights for policy 0, policy_version 81100 (0.0007) [2023-10-08 03:13:25,261][52059] Updated weights for policy 1, policy_version 82132 (0.0007) [2023-10-08 03:13:25,301][52060] Updated weights for policy 0, policy_version 81110 (0.0007) [2023-10-08 03:13:25,627][52059] Updated weights for policy 1, policy_version 82142 (0.0009) [2023-10-08 03:13:25,668][52060] Updated weights for policy 0, policy_version 81120 (0.0007) [2023-10-08 03:13:26,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 167182336. Throughput: 0: 1722.4, 1: 1734.0. Samples: 41797526. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:26,211][50642] Avg episode reward: [(0, '21.240'), (1, '22.840')] [2023-10-08 03:13:29,463][52060] Updated weights for policy 0, policy_version 81130 (0.0008) [2023-10-08 03:13:29,704][52059] Updated weights for policy 1, policy_version 82152 (0.0007) [2023-10-08 03:13:29,834][52060] Updated weights for policy 0, policy_version 81140 (0.0007) [2023-10-08 03:13:30,080][52059] Updated weights for policy 1, policy_version 82162 (0.0009) [2023-10-08 03:13:30,190][52060] Updated weights for policy 0, policy_version 81150 (0.0009) [2023-10-08 03:13:30,445][52059] Updated weights for policy 1, policy_version 82172 (0.0007) [2023-10-08 03:13:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 167247872. Throughput: 0: 1708.0, 1: 1708.8. Samples: 41817040. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:31,211][50642] Avg episode reward: [(0, '20.130'), (1, '22.820')] [2023-10-08 03:13:34,277][52060] Updated weights for policy 0, policy_version 81160 (0.0008) [2023-10-08 03:13:34,481][52059] Updated weights for policy 1, policy_version 82182 (0.0007) [2023-10-08 03:13:34,647][52060] Updated weights for policy 0, policy_version 81170 (0.0009) [2023-10-08 03:13:34,847][52059] Updated weights for policy 1, policy_version 82192 (0.0008) [2023-10-08 03:13:35,016][52060] Updated weights for policy 0, policy_version 81180 (0.0007) [2023-10-08 03:13:35,207][52059] Updated weights for policy 1, policy_version 82202 (0.0008) [2023-10-08 03:13:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 167313408. Throughput: 0: 1736.0, 1: 1738.1. Samples: 41829050. Policy #0 lag: (min: 22.0, avg: 25.4, max: 54.0) [2023-10-08 03:13:36,211][50642] Avg episode reward: [(0, '20.150'), (1, '25.180')] [2023-10-08 03:13:39,064][52060] Updated weights for policy 0, policy_version 81190 (0.0007) [2023-10-08 03:13:39,216][52059] Updated weights for policy 1, policy_version 82212 (0.0007) [2023-10-08 03:13:39,440][52060] Updated weights for policy 0, policy_version 81200 (0.0007) [2023-10-08 03:13:39,580][52059] Updated weights for policy 1, policy_version 82222 (0.0008) [2023-10-08 03:13:39,809][52060] Updated weights for policy 0, policy_version 81210 (0.0007) [2023-10-08 03:13:39,942][52059] Updated weights for policy 1, policy_version 82232 (0.0007) [2023-10-08 03:13:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 167378944. Throughput: 0: 1707.9, 1: 1720.5. Samples: 41848188. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:13:41,211][50642] Avg episode reward: [(0, '21.790'), (1, '22.640')] [2023-10-08 03:13:43,874][52059] Updated weights for policy 1, policy_version 82242 (0.0009) [2023-10-08 03:13:43,886][52060] Updated weights for policy 0, policy_version 81220 (0.0008) [2023-10-08 03:13:44,264][52060] Updated weights for policy 0, policy_version 81230 (0.0008) [2023-10-08 03:13:44,285][52059] Updated weights for policy 1, policy_version 82252 (0.0009) [2023-10-08 03:13:44,630][52060] Updated weights for policy 0, policy_version 81240 (0.0010) [2023-10-08 03:13:44,646][52059] Updated weights for policy 1, policy_version 82262 (0.0009) [2023-10-08 03:13:45,011][52059] Updated weights for policy 1, policy_version 82272 (0.0009) [2023-10-08 03:13:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 167444480. Throughput: 0: 1701.4, 1: 1712.7. Samples: 41868538. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:13:46,211][50642] Avg episode reward: [(0, '19.360'), (1, '20.380')] [2023-10-08 03:13:48,601][52060] Updated weights for policy 0, policy_version 81250 (0.0008) [2023-10-08 03:13:48,927][52059] Updated weights for policy 1, policy_version 82282 (0.0007) [2023-10-08 03:13:48,967][52060] Updated weights for policy 0, policy_version 81260 (0.0007) [2023-10-08 03:13:49,286][52059] Updated weights for policy 1, policy_version 82292 (0.0007) [2023-10-08 03:13:49,328][52060] Updated weights for policy 0, policy_version 81270 (0.0008) [2023-10-08 03:13:49,653][52059] Updated weights for policy 1, policy_version 82302 (0.0007) [2023-10-08 03:13:49,688][52060] Updated weights for policy 0, policy_version 81280 (0.0008) [2023-10-08 03:13:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 167510016. Throughput: 0: 1722.4, 1: 1734.9. Samples: 41879916. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:13:51,211][50642] Avg episode reward: [(0, '20.240'), (1, '21.010')] [2023-10-08 03:13:53,754][52060] Updated weights for policy 0, policy_version 81290 (0.0008) [2023-10-08 03:13:53,823][52059] Updated weights for policy 1, policy_version 82312 (0.0007) [2023-10-08 03:13:54,117][52060] Updated weights for policy 0, policy_version 81300 (0.0009) [2023-10-08 03:13:54,181][52059] Updated weights for policy 1, policy_version 82322 (0.0007) [2023-10-08 03:13:54,492][52060] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-10-08 03:13:54,537][52059] Updated weights for policy 1, policy_version 82332 (0.0007) [2023-10-08 03:13:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 167575552. Throughput: 0: 1696.0, 1: 1706.3. Samples: 41898930. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:13:56,211][50642] Avg episode reward: [(0, '19.840'), (1, '24.310')] [2023-10-08 03:13:58,415][52059] Updated weights for policy 1, policy_version 82342 (0.0009) [2023-10-08 03:13:58,441][52060] Updated weights for policy 0, policy_version 81320 (0.0007) [2023-10-08 03:13:58,786][52059] Updated weights for policy 1, policy_version 82352 (0.0009) [2023-10-08 03:13:58,809][52060] Updated weights for policy 0, policy_version 81330 (0.0007) [2023-10-08 03:13:59,147][52059] Updated weights for policy 1, policy_version 82362 (0.0009) [2023-10-08 03:13:59,170][52060] Updated weights for policy 0, policy_version 81340 (0.0007) [2023-10-08 03:14:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 167641088. Throughput: 0: 1710.0, 1: 1721.9. Samples: 41920374. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:01,211][50642] Avg episode reward: [(0, '21.590'), (1, '24.170')] [2023-10-08 03:14:02,932][52059] Updated weights for policy 1, policy_version 82372 (0.0008) [2023-10-08 03:14:03,197][52060] Updated weights for policy 0, policy_version 81350 (0.0008) [2023-10-08 03:14:03,293][52059] Updated weights for policy 1, policy_version 82382 (0.0007) [2023-10-08 03:14:03,561][52060] Updated weights for policy 0, policy_version 81360 (0.0010) [2023-10-08 03:14:03,656][52059] Updated weights for policy 1, policy_version 82392 (0.0007) [2023-10-08 03:14:03,928][52060] Updated weights for policy 0, policy_version 81370 (0.0009) [2023-10-08 03:14:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 167706624. Throughput: 0: 1699.1, 1: 1709.0. Samples: 41930286. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:06,211][50642] Avg episode reward: [(0, '19.040'), (1, '23.640')] [2023-10-08 03:14:07,567][52059] Updated weights for policy 1, policy_version 82402 (0.0008) [2023-10-08 03:14:07,872][52060] Updated weights for policy 0, policy_version 81380 (0.0007) [2023-10-08 03:14:07,932][52059] Updated weights for policy 1, policy_version 82412 (0.0010) [2023-10-08 03:14:08,233][52060] Updated weights for policy 0, policy_version 81390 (0.0007) [2023-10-08 03:14:08,294][52059] Updated weights for policy 1, policy_version 82422 (0.0009) [2023-10-08 03:14:08,596][52060] Updated weights for policy 0, policy_version 81400 (0.0007) [2023-10-08 03:14:08,656][52059] Updated weights for policy 1, policy_version 82432 (0.0008) [2023-10-08 03:14:11,210][50642] Fps is (10 sec: 13107.7, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 167772160. Throughput: 0: 1699.5, 1: 1718.8. Samples: 41951346. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:11,211][50642] Avg episode reward: [(0, '19.930'), (1, '23.650')] [2023-10-08 03:14:12,455][52060] Updated weights for policy 0, policy_version 81410 (0.0008) [2023-10-08 03:14:12,624][52059] Updated weights for policy 1, policy_version 82442 (0.0008) [2023-10-08 03:14:12,821][52060] Updated weights for policy 0, policy_version 81420 (0.0008) [2023-10-08 03:14:12,983][52059] Updated weights for policy 1, policy_version 82452 (0.0007) [2023-10-08 03:14:13,191][52060] Updated weights for policy 0, policy_version 81430 (0.0007) [2023-10-08 03:14:13,347][52059] Updated weights for policy 1, policy_version 82462 (0.0008) [2023-10-08 03:14:13,563][52060] Updated weights for policy 0, policy_version 81440 (0.0009) [2023-10-08 03:14:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 167837696. Throughput: 0: 1713.9, 1: 1740.4. Samples: 41972482. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:16,211][50642] Avg episode reward: [(0, '22.920'), (1, '25.530')] [2023-10-08 03:14:17,274][52059] Updated weights for policy 1, policy_version 82472 (0.0009) [2023-10-08 03:14:17,565][52060] Updated weights for policy 0, policy_version 81450 (0.0008) [2023-10-08 03:14:17,635][52059] Updated weights for policy 1, policy_version 82482 (0.0007) [2023-10-08 03:14:17,928][52060] Updated weights for policy 0, policy_version 81460 (0.0008) [2023-10-08 03:14:17,991][52059] Updated weights for policy 1, policy_version 82492 (0.0007) [2023-10-08 03:14:18,299][52060] Updated weights for policy 0, policy_version 81470 (0.0007) [2023-10-08 03:14:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 167903232. Throughput: 0: 1685.2, 1: 1713.3. Samples: 41981984. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:21,211][50642] Avg episode reward: [(0, '22.530'), (1, '25.560')] [2023-10-08 03:14:21,840][52059] Updated weights for policy 1, policy_version 82502 (0.0009) [2023-10-08 03:14:22,210][52059] Updated weights for policy 1, policy_version 82512 (0.0007) [2023-10-08 03:14:22,244][52060] Updated weights for policy 0, policy_version 81480 (0.0010) [2023-10-08 03:14:22,573][52059] Updated weights for policy 1, policy_version 82522 (0.0008) [2023-10-08 03:14:22,624][52060] Updated weights for policy 0, policy_version 81490 (0.0010) [2023-10-08 03:14:22,992][52060] Updated weights for policy 0, policy_version 81500 (0.0007) [2023-10-08 03:14:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 167968768. Throughput: 0: 1712.9, 1: 1739.0. Samples: 42003524. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:26,211][50642] Avg episode reward: [(0, '19.350'), (1, '23.980')] [2023-10-08 03:14:26,299][52059] Updated weights for policy 1, policy_version 82532 (0.0008) [2023-10-08 03:14:26,675][52059] Updated weights for policy 1, policy_version 82542 (0.0008) [2023-10-08 03:14:27,023][52059] Updated weights for policy 1, policy_version 82552 (0.0008) [2023-10-08 03:14:27,056][52060] Updated weights for policy 0, policy_version 81510 (0.0007) [2023-10-08 03:14:27,429][52060] Updated weights for policy 0, policy_version 81520 (0.0009) [2023-10-08 03:14:27,808][52060] Updated weights for policy 0, policy_version 81530 (0.0010) [2023-10-08 03:14:30,991][52059] Updated weights for policy 1, policy_version 82562 (0.0007) [2023-10-08 03:14:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 168034304. Throughput: 0: 1720.6, 1: 1754.3. Samples: 42024906. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) [2023-10-08 03:14:31,211][50642] Avg episode reward: [(0, '19.340'), (1, '23.230')] [2023-10-08 03:14:31,379][52059] Updated weights for policy 1, policy_version 82572 (0.0009) [2023-10-08 03:14:31,735][52059] Updated weights for policy 1, policy_version 82582 (0.0008) [2023-10-08 03:14:31,786][52060] Updated weights for policy 0, policy_version 81540 (0.0008) [2023-10-08 03:14:32,103][52059] Updated weights for policy 1, policy_version 82592 (0.0008) [2023-10-08 03:14:32,181][52060] Updated weights for policy 0, policy_version 81550 (0.0008) [2023-10-08 03:14:32,561][52060] Updated weights for policy 0, policy_version 81560 (0.0009) [2023-10-08 03:14:35,971][52059] Updated weights for policy 1, policy_version 82602 (0.0008) [2023-10-08 03:14:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 168099840. Throughput: 0: 1701.2, 1: 1729.0. Samples: 42034276. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:14:36,211][50642] Avg episode reward: [(0, '21.070'), (1, '26.200')] [2023-10-08 03:14:36,328][52059] Updated weights for policy 1, policy_version 82612 (0.0007) [2023-10-08 03:14:36,547][52060] Updated weights for policy 0, policy_version 81570 (0.0007) [2023-10-08 03:14:36,699][52059] Updated weights for policy 1, policy_version 82622 (0.0007) [2023-10-08 03:14:36,914][52060] Updated weights for policy 0, policy_version 81580 (0.0007) [2023-10-08 03:14:37,287][52060] Updated weights for policy 0, policy_version 81590 (0.0007) [2023-10-08 03:14:37,652][52060] Updated weights for policy 0, policy_version 81600 (0.0008) [2023-10-08 03:14:40,825][52059] Updated weights for policy 1, policy_version 82632 (0.0008) [2023-10-08 03:14:41,196][52059] Updated weights for policy 1, policy_version 82642 (0.0007) [2023-10-08 03:14:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 168165376. Throughput: 0: 1720.8, 1: 1753.1. Samples: 42055256. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:14:41,211][50642] Avg episode reward: [(0, '21.970'), (1, '24.350')] [2023-10-08 03:14:41,533][52060] Updated weights for policy 0, policy_version 81610 (0.0008) [2023-10-08 03:14:41,559][52059] Updated weights for policy 1, policy_version 82652 (0.0008) [2023-10-08 03:14:41,903][52060] Updated weights for policy 0, policy_version 81620 (0.0008) [2023-10-08 03:14:42,282][52060] Updated weights for policy 0, policy_version 81630 (0.0007) [2023-10-08 03:14:45,433][52059] Updated weights for policy 1, policy_version 82662 (0.0009) [2023-10-08 03:14:45,798][52059] Updated weights for policy 1, policy_version 82672 (0.0009) [2023-10-08 03:14:46,164][52059] Updated weights for policy 1, policy_version 82682 (0.0009) [2023-10-08 03:14:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 168230912. Throughput: 0: 1717.7, 1: 1736.1. Samples: 42075794. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:14:46,211][50642] Avg episode reward: [(0, '18.890'), (1, '24.280')] [2023-10-08 03:14:46,376][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000082688_84672512.pth... [2023-10-08 03:14:46,416][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000081056_83001344.pth [2023-10-08 03:14:46,511][52060] Updated weights for policy 0, policy_version 81640 (0.0009) [2023-10-08 03:14:46,874][52060] Updated weights for policy 0, policy_version 81650 (0.0007) [2023-10-08 03:14:47,250][52060] Updated weights for policy 0, policy_version 81660 (0.0007) [2023-10-08 03:14:47,392][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000081664_83623936.pth... [2023-10-08 03:14:47,432][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000080032_81952768.pth [2023-10-08 03:14:50,040][52059] Updated weights for policy 1, policy_version 82692 (0.0008) [2023-10-08 03:14:50,404][52059] Updated weights for policy 1, policy_version 82702 (0.0008) [2023-10-08 03:14:50,778][52059] Updated weights for policy 1, policy_version 82712 (0.0009) [2023-10-08 03:14:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 168329216. Throughput: 0: 1708.3, 1: 1747.1. Samples: 42085780. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:14:51,211][50642] Avg episode reward: [(0, '20.110'), (1, '21.720')] [2023-10-08 03:14:51,244][52060] Updated weights for policy 0, policy_version 81670 (0.0009) [2023-10-08 03:14:51,614][52060] Updated weights for policy 0, policy_version 81680 (0.0007) [2023-10-08 03:14:51,978][52060] Updated weights for policy 0, policy_version 81690 (0.0007) [2023-10-08 03:14:54,745][52059] Updated weights for policy 1, policy_version 82722 (0.0008) [2023-10-08 03:14:55,109][52059] Updated weights for policy 1, policy_version 82732 (0.0007) [2023-10-08 03:14:55,469][52059] Updated weights for policy 1, policy_version 82742 (0.0008) [2023-10-08 03:14:55,828][52059] Updated weights for policy 1, policy_version 82752 (0.0007) [2023-10-08 03:14:55,956][52060] Updated weights for policy 0, policy_version 81700 (0.0007) [2023-10-08 03:14:56,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 168394752. Throughput: 0: 1714.0, 1: 1745.3. Samples: 42107014. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:14:56,211][50642] Avg episode reward: [(0, '21.630'), (1, '25.900')] [2023-10-08 03:14:56,316][52060] Updated weights for policy 0, policy_version 81710 (0.0008) [2023-10-08 03:14:56,680][52060] Updated weights for policy 0, policy_version 81720 (0.0009) [2023-10-08 03:14:59,691][52059] Updated weights for policy 1, policy_version 82762 (0.0012) [2023-10-08 03:15:00,061][52059] Updated weights for policy 1, policy_version 82772 (0.0009) [2023-10-08 03:15:00,419][52059] Updated weights for policy 1, policy_version 82782 (0.0007) [2023-10-08 03:15:00,621][52060] Updated weights for policy 0, policy_version 81730 (0.0008) [2023-10-08 03:15:00,986][52060] Updated weights for policy 0, policy_version 81740 (0.0007) [2023-10-08 03:15:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 168460288. Throughput: 0: 1709.9, 1: 1729.8. Samples: 42127270. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:15:01,211][50642] Avg episode reward: [(0, '20.000'), (1, '24.770')] [2023-10-08 03:15:01,354][52060] Updated weights for policy 0, policy_version 81750 (0.0008) [2023-10-08 03:15:01,719][52060] Updated weights for policy 0, policy_version 81760 (0.0009) [2023-10-08 03:15:04,426][52059] Updated weights for policy 1, policy_version 82792 (0.0008) [2023-10-08 03:15:04,788][52059] Updated weights for policy 1, policy_version 82802 (0.0007) [2023-10-08 03:15:05,150][52059] Updated weights for policy 1, policy_version 82812 (0.0008) [2023-10-08 03:15:05,684][52060] Updated weights for policy 0, policy_version 81770 (0.0008) [2023-10-08 03:15:06,057][52060] Updated weights for policy 0, policy_version 81780 (0.0010) [2023-10-08 03:15:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 168525824. Throughput: 0: 1716.4, 1: 1752.9. Samples: 42138106. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:15:06,211][50642] Avg episode reward: [(0, '19.150'), (1, '23.900')] [2023-10-08 03:15:06,435][52060] Updated weights for policy 0, policy_version 81790 (0.0008) [2023-10-08 03:15:09,023][52059] Updated weights for policy 1, policy_version 82822 (0.0009) [2023-10-08 03:15:09,386][52059] Updated weights for policy 1, policy_version 82832 (0.0009) [2023-10-08 03:15:09,749][52059] Updated weights for policy 1, policy_version 82842 (0.0007) [2023-10-08 03:15:10,479][52060] Updated weights for policy 0, policy_version 81800 (0.0008) [2023-10-08 03:15:10,849][52060] Updated weights for policy 0, policy_version 81810 (0.0009) [2023-10-08 03:15:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 168591360. Throughput: 0: 1714.2, 1: 1727.2. Samples: 42158386. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:15:11,211][50642] Avg episode reward: [(0, '20.880'), (1, '23.460')] [2023-10-08 03:15:11,213][52060] Updated weights for policy 0, policy_version 81820 (0.0009) [2023-10-08 03:15:13,533][52059] Updated weights for policy 1, policy_version 82852 (0.0007) [2023-10-08 03:15:13,906][52059] Updated weights for policy 1, policy_version 82862 (0.0009) [2023-10-08 03:15:14,259][52059] Updated weights for policy 1, policy_version 82872 (0.0010) [2023-10-08 03:15:15,067][52060] Updated weights for policy 0, policy_version 81830 (0.0008) [2023-10-08 03:15:15,447][52060] Updated weights for policy 0, policy_version 81840 (0.0010) [2023-10-08 03:15:15,804][52060] Updated weights for policy 0, policy_version 81850 (0.0010) [2023-10-08 03:15:16,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 168689664. Throughput: 0: 1692.9, 1: 1726.1. Samples: 42178758. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:15:16,211][50642] Avg episode reward: [(0, '21.790'), (1, '22.580')] [2023-10-08 03:15:18,193][52059] Updated weights for policy 1, policy_version 82882 (0.0010) [2023-10-08 03:15:18,561][52059] Updated weights for policy 1, policy_version 82892 (0.0009) [2023-10-08 03:15:18,926][52059] Updated weights for policy 1, policy_version 82902 (0.0009) [2023-10-08 03:15:19,282][52059] Updated weights for policy 1, policy_version 82912 (0.0010) [2023-10-08 03:15:19,793][52060] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-10-08 03:15:20,174][52060] Updated weights for policy 0, policy_version 81870 (0.0007) [2023-10-08 03:15:20,540][52060] Updated weights for policy 0, policy_version 81880 (0.0010) [2023-10-08 03:15:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 168755200. Throughput: 0: 1716.1, 1: 1742.2. Samples: 42189900. Policy #0 lag: (min: 31.0, avg: 31.2, max: 39.0) [2023-10-08 03:15:21,211][50642] Avg episode reward: [(0, '19.660'), (1, '27.610')] [2023-10-08 03:15:23,148][52059] Updated weights for policy 1, policy_version 82922 (0.0007) [2023-10-08 03:15:23,505][52059] Updated weights for policy 1, policy_version 82932 (0.0007) [2023-10-08 03:15:23,867][52059] Updated weights for policy 1, policy_version 82942 (0.0008) [2023-10-08 03:15:24,554][52060] Updated weights for policy 0, policy_version 81890 (0.0011) [2023-10-08 03:15:24,929][52060] Updated weights for policy 0, policy_version 81900 (0.0007) [2023-10-08 03:15:25,293][52060] Updated weights for policy 0, policy_version 81910 (0.0008) [2023-10-08 03:15:25,662][52060] Updated weights for policy 0, policy_version 81920 (0.0007) [2023-10-08 03:15:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 168820736. Throughput: 0: 1708.4, 1: 1734.5. Samples: 42210186. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:26,211][50642] Avg episode reward: [(0, '18.320'), (1, '27.890')] [2023-10-08 03:15:27,808][52059] Updated weights for policy 1, policy_version 82952 (0.0008) [2023-10-08 03:15:28,168][52059] Updated weights for policy 1, policy_version 82962 (0.0008) [2023-10-08 03:15:28,530][52059] Updated weights for policy 1, policy_version 82972 (0.0007) [2023-10-08 03:15:29,678][52060] Updated weights for policy 0, policy_version 81930 (0.0007) [2023-10-08 03:15:30,051][52060] Updated weights for policy 0, policy_version 81940 (0.0007) [2023-10-08 03:15:30,412][52060] Updated weights for policy 0, policy_version 81950 (0.0007) [2023-10-08 03:15:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 168886272. Throughput: 0: 1687.5, 1: 1756.1. Samples: 42230754. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:31,211][50642] Avg episode reward: [(0, '20.360'), (1, '23.440')] [2023-10-08 03:15:32,450][52059] Updated weights for policy 1, policy_version 82982 (0.0008) [2023-10-08 03:15:32,799][52059] Updated weights for policy 1, policy_version 82992 (0.0008) [2023-10-08 03:15:33,177][52059] Updated weights for policy 1, policy_version 83002 (0.0010) [2023-10-08 03:15:34,446][52060] Updated weights for policy 0, policy_version 81960 (0.0007) [2023-10-08 03:15:34,813][52060] Updated weights for policy 0, policy_version 81970 (0.0008) [2023-10-08 03:15:35,182][52060] Updated weights for policy 0, policy_version 81980 (0.0009) [2023-10-08 03:15:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 168951808. Throughput: 0: 1721.2, 1: 1740.1. Samples: 42241540. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:36,211][50642] Avg episode reward: [(0, '19.660'), (1, '24.110')] [2023-10-08 03:15:37,113][52059] Updated weights for policy 1, policy_version 83012 (0.0011) [2023-10-08 03:15:37,478][52059] Updated weights for policy 1, policy_version 83022 (0.0010) [2023-10-08 03:15:37,840][52059] Updated weights for policy 1, policy_version 83032 (0.0010) [2023-10-08 03:15:39,008][52060] Updated weights for policy 0, policy_version 81990 (0.0009) [2023-10-08 03:15:39,367][52060] Updated weights for policy 0, policy_version 82000 (0.0009) [2023-10-08 03:15:39,741][52060] Updated weights for policy 0, policy_version 82010 (0.0008) [2023-10-08 03:15:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 169017344. Throughput: 0: 1700.3, 1: 1740.3. Samples: 42261838. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:41,211][50642] Avg episode reward: [(0, '19.240'), (1, '25.160')] [2023-10-08 03:15:41,677][52059] Updated weights for policy 1, policy_version 83042 (0.0010) [2023-10-08 03:15:42,042][52059] Updated weights for policy 1, policy_version 83052 (0.0010) [2023-10-08 03:15:42,400][52059] Updated weights for policy 1, policy_version 83062 (0.0007) [2023-10-08 03:15:42,762][52059] Updated weights for policy 1, policy_version 83072 (0.0007) [2023-10-08 03:15:43,795][52060] Updated weights for policy 0, policy_version 82020 (0.0009) [2023-10-08 03:15:44,164][52060] Updated weights for policy 0, policy_version 82030 (0.0010) [2023-10-08 03:15:44,538][52060] Updated weights for policy 0, policy_version 82040 (0.0010) [2023-10-08 03:15:46,211][50642] Fps is (10 sec: 13106.5, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 169082880. Throughput: 0: 1699.6, 1: 1756.7. Samples: 42282804. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:46,212][50642] Avg episode reward: [(0, '18.250'), (1, '26.120')] [2023-10-08 03:15:46,801][52059] Updated weights for policy 1, policy_version 83082 (0.0009) [2023-10-08 03:15:47,169][52059] Updated weights for policy 1, policy_version 83092 (0.0008) [2023-10-08 03:15:47,536][52059] Updated weights for policy 1, policy_version 83102 (0.0007) [2023-10-08 03:15:48,526][52060] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-10-08 03:15:48,907][52060] Updated weights for policy 0, policy_version 82060 (0.0008) [2023-10-08 03:15:49,270][52060] Updated weights for policy 0, policy_version 82070 (0.0010) [2023-10-08 03:15:49,644][52060] Updated weights for policy 0, policy_version 82080 (0.0011) [2023-10-08 03:15:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 169148416. Throughput: 0: 1716.5, 1: 1730.9. Samples: 42293234. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:51,211][50642] Avg episode reward: [(0, '20.720'), (1, '23.150')] [2023-10-08 03:15:51,480][52059] Updated weights for policy 1, policy_version 83112 (0.0009) [2023-10-08 03:15:51,848][52059] Updated weights for policy 1, policy_version 83122 (0.0009) [2023-10-08 03:15:52,213][52059] Updated weights for policy 1, policy_version 83132 (0.0008) [2023-10-08 03:15:53,559][52060] Updated weights for policy 0, policy_version 82090 (0.0008) [2023-10-08 03:15:53,930][52060] Updated weights for policy 0, policy_version 82100 (0.0008) [2023-10-08 03:15:54,303][52060] Updated weights for policy 0, policy_version 82110 (0.0009) [2023-10-08 03:15:56,015][52059] Updated weights for policy 1, policy_version 83142 (0.0008) [2023-10-08 03:15:56,210][50642] Fps is (10 sec: 13107.9, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 169213952. Throughput: 0: 1695.5, 1: 1756.6. Samples: 42313730. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:15:56,211][50642] Avg episode reward: [(0, '20.030'), (1, '23.800')] [2023-10-08 03:15:56,385][52059] Updated weights for policy 1, policy_version 83152 (0.0010) [2023-10-08 03:15:56,762][52059] Updated weights for policy 1, policy_version 83162 (0.0008) [2023-10-08 03:15:58,207][52060] Updated weights for policy 0, policy_version 82120 (0.0008) [2023-10-08 03:15:58,582][52060] Updated weights for policy 0, policy_version 82130 (0.0007) [2023-10-08 03:15:58,946][52060] Updated weights for policy 0, policy_version 82140 (0.0007) [2023-10-08 03:16:00,527][52059] Updated weights for policy 1, policy_version 83172 (0.0008) [2023-10-08 03:16:00,896][52059] Updated weights for policy 1, policy_version 83182 (0.0007) [2023-10-08 03:16:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 169279488. Throughput: 0: 1714.8, 1: 1752.3. Samples: 42334778. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:16:01,211][50642] Avg episode reward: [(0, '17.120'), (1, '22.280')] [2023-10-08 03:16:01,252][52059] Updated weights for policy 1, policy_version 83192 (0.0009) [2023-10-08 03:16:02,838][52060] Updated weights for policy 0, policy_version 82150 (0.0007) [2023-10-08 03:16:03,209][52060] Updated weights for policy 0, policy_version 82160 (0.0007) [2023-10-08 03:16:03,580][52060] Updated weights for policy 0, policy_version 82170 (0.0008) [2023-10-08 03:16:05,137][52059] Updated weights for policy 1, policy_version 83202 (0.0008) [2023-10-08 03:16:05,563][52059] Updated weights for policy 1, policy_version 83212 (0.0009) [2023-10-08 03:16:05,926][52059] Updated weights for policy 1, policy_version 83222 (0.0007) [2023-10-08 03:16:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 169345024. Throughput: 0: 1695.0, 1: 1754.3. Samples: 42345118. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:16:06,211][50642] Avg episode reward: [(0, '17.830'), (1, '26.540')] [2023-10-08 03:16:06,291][52059] Updated weights for policy 1, policy_version 83232 (0.0007) [2023-10-08 03:16:07,564][52060] Updated weights for policy 0, policy_version 82180 (0.0008) [2023-10-08 03:16:07,929][52060] Updated weights for policy 0, policy_version 82190 (0.0008) [2023-10-08 03:16:08,296][52060] Updated weights for policy 0, policy_version 82200 (0.0008) [2023-10-08 03:16:10,281][52059] Updated weights for policy 1, policy_version 83242 (0.0010) [2023-10-08 03:16:10,653][52059] Updated weights for policy 1, policy_version 83252 (0.0009) [2023-10-08 03:16:11,018][52059] Updated weights for policy 1, policy_version 83262 (0.0011) [2023-10-08 03:16:11,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 169443328. Throughput: 0: 1709.8, 1: 1758.8. Samples: 42366276. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:16:11,211][50642] Avg episode reward: [(0, '20.120'), (1, '26.820')] [2023-10-08 03:16:12,136][52060] Updated weights for policy 0, policy_version 82210 (0.0009) [2023-10-08 03:16:12,499][52060] Updated weights for policy 0, policy_version 82220 (0.0007) [2023-10-08 03:16:12,866][52060] Updated weights for policy 0, policy_version 82230 (0.0007) [2023-10-08 03:16:13,227][52060] Updated weights for policy 0, policy_version 82240 (0.0007) [2023-10-08 03:16:14,946][52059] Updated weights for policy 1, policy_version 83272 (0.0009) [2023-10-08 03:16:15,316][52059] Updated weights for policy 1, policy_version 83282 (0.0007) [2023-10-08 03:16:15,675][52059] Updated weights for policy 1, policy_version 83292 (0.0009) [2023-10-08 03:16:16,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 169508864. Throughput: 0: 1735.7, 1: 1726.9. Samples: 42386570. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) [2023-10-08 03:16:16,211][50642] Avg episode reward: [(0, '17.060'), (1, '23.330')] [2023-10-08 03:16:17,193][52060] Updated weights for policy 0, policy_version 82250 (0.0007) [2023-10-08 03:16:17,562][52060] Updated weights for policy 0, policy_version 82260 (0.0011) [2023-10-08 03:16:17,926][52060] Updated weights for policy 0, policy_version 82270 (0.0008) [2023-10-08 03:16:19,412][52059] Updated weights for policy 1, policy_version 83302 (0.0008) [2023-10-08 03:16:19,781][52059] Updated weights for policy 1, policy_version 83312 (0.0007) [2023-10-08 03:16:20,145][52059] Updated weights for policy 1, policy_version 83322 (0.0008) [2023-10-08 03:16:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 169574400. Throughput: 0: 1704.0, 1: 1760.8. Samples: 42397458. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:21,211][50642] Avg episode reward: [(0, '17.120'), (1, '23.900')] [2023-10-08 03:16:21,932][52060] Updated weights for policy 0, policy_version 82280 (0.0010) [2023-10-08 03:16:22,302][52060] Updated weights for policy 0, policy_version 82290 (0.0007) [2023-10-08 03:16:22,667][52060] Updated weights for policy 0, policy_version 82300 (0.0007) [2023-10-08 03:16:24,154][52059] Updated weights for policy 1, policy_version 83332 (0.0010) [2023-10-08 03:16:24,517][52059] Updated weights for policy 1, policy_version 83342 (0.0010) [2023-10-08 03:16:24,881][52059] Updated weights for policy 1, policy_version 83352 (0.0010) [2023-10-08 03:16:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 169639936. Throughput: 0: 1731.7, 1: 1738.4. Samples: 42417992. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:26,211][50642] Avg episode reward: [(0, '17.730'), (1, '25.530')] [2023-10-08 03:16:26,509][52060] Updated weights for policy 0, policy_version 82310 (0.0007) [2023-10-08 03:16:26,878][52060] Updated weights for policy 0, policy_version 82320 (0.0008) [2023-10-08 03:16:27,237][52060] Updated weights for policy 0, policy_version 82330 (0.0011) [2023-10-08 03:16:28,950][52059] Updated weights for policy 1, policy_version 83362 (0.0008) [2023-10-08 03:16:29,313][52059] Updated weights for policy 1, policy_version 83372 (0.0009) [2023-10-08 03:16:29,674][52059] Updated weights for policy 1, policy_version 83382 (0.0008) [2023-10-08 03:16:30,036][52059] Updated weights for policy 1, policy_version 83392 (0.0008) [2023-10-08 03:16:31,136][52060] Updated weights for policy 0, policy_version 82340 (0.0008) [2023-10-08 03:16:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 169705472. Throughput: 0: 1739.7, 1: 1734.4. Samples: 42439136. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:31,211][50642] Avg episode reward: [(0, '23.410'), (1, '26.730')] [2023-10-08 03:16:31,502][52060] Updated weights for policy 0, policy_version 82350 (0.0007) [2023-10-08 03:16:31,870][52060] Updated weights for policy 0, policy_version 82360 (0.0009) [2023-10-08 03:16:33,810][52059] Updated weights for policy 1, policy_version 83402 (0.0010) [2023-10-08 03:16:34,158][52059] Updated weights for policy 1, policy_version 83412 (0.0009) [2023-10-08 03:16:34,524][52059] Updated weights for policy 1, policy_version 83422 (0.0007) [2023-10-08 03:16:35,801][52060] Updated weights for policy 0, policy_version 82370 (0.0008) [2023-10-08 03:16:36,156][52060] Updated weights for policy 0, policy_version 82380 (0.0007) [2023-10-08 03:16:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 169771008. Throughput: 0: 1719.9, 1: 1754.8. Samples: 42449600. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:36,211][50642] Avg episode reward: [(0, '18.710'), (1, '24.040')] [2023-10-08 03:16:36,523][52060] Updated weights for policy 0, policy_version 82390 (0.0010) [2023-10-08 03:16:36,895][52060] Updated weights for policy 0, policy_version 82400 (0.0008) [2023-10-08 03:16:38,446][52059] Updated weights for policy 1, policy_version 83432 (0.0009) [2023-10-08 03:16:38,801][52059] Updated weights for policy 1, policy_version 83442 (0.0007) [2023-10-08 03:16:39,166][52059] Updated weights for policy 1, policy_version 83452 (0.0007) [2023-10-08 03:16:40,998][52060] Updated weights for policy 0, policy_version 82410 (0.0009) [2023-10-08 03:16:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 169836544. Throughput: 0: 1740.4, 1: 1737.2. Samples: 42470224. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:41,211][50642] Avg episode reward: [(0, '19.000'), (1, '24.370')] [2023-10-08 03:16:41,357][52060] Updated weights for policy 0, policy_version 82420 (0.0012) [2023-10-08 03:16:41,723][52060] Updated weights for policy 0, policy_version 82430 (0.0011) [2023-10-08 03:16:43,060][52059] Updated weights for policy 1, policy_version 83462 (0.0007) [2023-10-08 03:16:43,432][52059] Updated weights for policy 1, policy_version 83472 (0.0008) [2023-10-08 03:16:43,800][52059] Updated weights for policy 1, policy_version 83482 (0.0010) [2023-10-08 03:16:45,652][52060] Updated weights for policy 0, policy_version 82440 (0.0008) [2023-10-08 03:16:46,017][52060] Updated weights for policy 0, policy_version 82450 (0.0007) [2023-10-08 03:16:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 169902080. Throughput: 0: 1734.2, 1: 1743.2. Samples: 42491260. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:46,211][50642] Avg episode reward: [(0, '20.400'), (1, '23.730')] [2023-10-08 03:16:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000083488_85491712.pth... [2023-10-08 03:16:46,251][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000081856_83820544.pth [2023-10-08 03:16:46,389][52060] Updated weights for policy 0, policy_version 82460 (0.0008) [2023-10-08 03:16:46,538][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000082464_84443136.pth... [2023-10-08 03:16:46,567][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000080832_82771968.pth [2023-10-08 03:16:47,591][52059] Updated weights for policy 1, policy_version 83492 (0.0010) [2023-10-08 03:16:47,960][52059] Updated weights for policy 1, policy_version 83502 (0.0010) [2023-10-08 03:16:48,320][52059] Updated weights for policy 1, policy_version 83512 (0.0011) [2023-10-08 03:16:50,438][52060] Updated weights for policy 0, policy_version 82470 (0.0008) [2023-10-08 03:16:50,813][52060] Updated weights for policy 0, policy_version 82480 (0.0008) [2023-10-08 03:16:51,186][52060] Updated weights for policy 0, policy_version 82490 (0.0009) [2023-10-08 03:16:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 169967616. Throughput: 0: 1737.5, 1: 1726.9. Samples: 42501016. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:51,211][50642] Avg episode reward: [(0, '23.690'), (1, '27.100')] [2023-10-08 03:16:52,193][52059] Updated weights for policy 1, policy_version 83522 (0.0008) [2023-10-08 03:16:52,563][52059] Updated weights for policy 1, policy_version 83532 (0.0008) [2023-10-08 03:16:52,931][52059] Updated weights for policy 1, policy_version 83542 (0.0008) [2023-10-08 03:16:53,291][52059] Updated weights for policy 1, policy_version 83552 (0.0009) [2023-10-08 03:16:55,287][52060] Updated weights for policy 0, policy_version 82500 (0.0008) [2023-10-08 03:16:55,676][52060] Updated weights for policy 0, policy_version 82510 (0.0010) [2023-10-08 03:16:56,040][52060] Updated weights for policy 0, policy_version 82520 (0.0011) [2023-10-08 03:16:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 170033152. Throughput: 0: 1733.9, 1: 1734.8. Samples: 42522368. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:16:56,211][50642] Avg episode reward: [(0, '16.950'), (1, '27.240')] [2023-10-08 03:16:57,529][52059] Updated weights for policy 1, policy_version 83562 (0.0007) [2023-10-08 03:16:57,892][52059] Updated weights for policy 1, policy_version 83572 (0.0007) [2023-10-08 03:16:58,257][52059] Updated weights for policy 1, policy_version 83582 (0.0008) [2023-10-08 03:17:00,046][52060] Updated weights for policy 0, policy_version 82530 (0.0009) [2023-10-08 03:17:00,410][52060] Updated weights for policy 0, policy_version 82540 (0.0007) [2023-10-08 03:17:00,777][52060] Updated weights for policy 0, policy_version 82550 (0.0009) [2023-10-08 03:17:01,142][52060] Updated weights for policy 0, policy_version 82560 (0.0011) [2023-10-08 03:17:01,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 170131456. Throughput: 0: 1704.4, 1: 1760.2. Samples: 42542480. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:17:01,211][50642] Avg episode reward: [(0, '19.040'), (1, '25.440')] [2023-10-08 03:17:02,067][52059] Updated weights for policy 1, policy_version 83592 (0.0008) [2023-10-08 03:17:02,427][52059] Updated weights for policy 1, policy_version 83602 (0.0010) [2023-10-08 03:17:02,789][52059] Updated weights for policy 1, policy_version 83612 (0.0010) [2023-10-08 03:17:05,069][52060] Updated weights for policy 0, policy_version 82570 (0.0008) [2023-10-08 03:17:05,454][52060] Updated weights for policy 0, policy_version 82580 (0.0008) [2023-10-08 03:17:05,817][52060] Updated weights for policy 0, policy_version 82590 (0.0007) [2023-10-08 03:17:06,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 170196992. Throughput: 0: 1727.9, 1: 1725.6. Samples: 42552862. Policy #0 lag: (min: 9.0, avg: 17.1, max: 41.0) [2023-10-08 03:17:06,211][50642] Avg episode reward: [(0, '19.460'), (1, '24.460')] [2023-10-08 03:17:06,702][52059] Updated weights for policy 1, policy_version 83622 (0.0008) [2023-10-08 03:17:07,072][52059] Updated weights for policy 1, policy_version 83632 (0.0008) [2023-10-08 03:17:07,428][52059] Updated weights for policy 1, policy_version 83642 (0.0007) [2023-10-08 03:17:09,874][52060] Updated weights for policy 0, policy_version 82600 (0.0007) [2023-10-08 03:17:10,241][52060] Updated weights for policy 0, policy_version 82610 (0.0008) [2023-10-08 03:17:10,615][52060] Updated weights for policy 0, policy_version 82620 (0.0007) [2023-10-08 03:17:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 170262528. Throughput: 0: 1716.2, 1: 1745.9. Samples: 42573786. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:11,211][50642] Avg episode reward: [(0, '22.290'), (1, '26.970')] [2023-10-08 03:17:11,397][52059] Updated weights for policy 1, policy_version 83652 (0.0007) [2023-10-08 03:17:11,754][52059] Updated weights for policy 1, policy_version 83662 (0.0009) [2023-10-08 03:17:12,115][52059] Updated weights for policy 1, policy_version 83672 (0.0007) [2023-10-08 03:17:14,522][52060] Updated weights for policy 0, policy_version 82630 (0.0011) [2023-10-08 03:17:14,885][52060] Updated weights for policy 0, policy_version 82640 (0.0007) [2023-10-08 03:17:15,257][52060] Updated weights for policy 0, policy_version 82650 (0.0007) [2023-10-08 03:17:16,015][52059] Updated weights for policy 1, policy_version 83682 (0.0007) [2023-10-08 03:17:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 170328064. Throughput: 0: 1690.1, 1: 1755.9. Samples: 42594204. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:16,211][50642] Avg episode reward: [(0, '17.500'), (1, '25.060')] [2023-10-08 03:17:16,375][52059] Updated weights for policy 1, policy_version 83692 (0.0009) [2023-10-08 03:17:16,736][52059] Updated weights for policy 1, policy_version 83702 (0.0012) [2023-10-08 03:17:17,101][52059] Updated weights for policy 1, policy_version 83712 (0.0010) [2023-10-08 03:17:19,062][52060] Updated weights for policy 0, policy_version 82660 (0.0009) [2023-10-08 03:17:19,429][52060] Updated weights for policy 0, policy_version 82670 (0.0008) [2023-10-08 03:17:19,802][52060] Updated weights for policy 0, policy_version 82680 (0.0008) [2023-10-08 03:17:20,806][52059] Updated weights for policy 1, policy_version 83722 (0.0008) [2023-10-08 03:17:21,172][52059] Updated weights for policy 1, policy_version 83732 (0.0007) [2023-10-08 03:17:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 170393600. Throughput: 0: 1716.9, 1: 1737.7. Samples: 42605056. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:21,211][50642] Avg episode reward: [(0, '17.860'), (1, '27.520')] [2023-10-08 03:17:21,537][52059] Updated weights for policy 1, policy_version 83742 (0.0008) [2023-10-08 03:17:23,817][52060] Updated weights for policy 0, policy_version 82690 (0.0007) [2023-10-08 03:17:24,180][52060] Updated weights for policy 0, policy_version 82700 (0.0007) [2023-10-08 03:17:24,553][52060] Updated weights for policy 0, policy_version 82710 (0.0009) [2023-10-08 03:17:24,925][52060] Updated weights for policy 0, policy_version 82720 (0.0008) [2023-10-08 03:17:25,492][52059] Updated weights for policy 1, policy_version 83752 (0.0008) [2023-10-08 03:17:25,855][52059] Updated weights for policy 1, policy_version 83762 (0.0007) [2023-10-08 03:17:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 170459136. Throughput: 0: 1691.4, 1: 1751.0. Samples: 42625134. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:26,211][50642] Avg episode reward: [(0, '20.810'), (1, '25.880')] [2023-10-08 03:17:26,226][52059] Updated weights for policy 1, policy_version 83772 (0.0007) [2023-10-08 03:17:28,938][52060] Updated weights for policy 0, policy_version 82730 (0.0008) [2023-10-08 03:17:29,303][52060] Updated weights for policy 0, policy_version 82740 (0.0008) [2023-10-08 03:17:29,668][52060] Updated weights for policy 0, policy_version 82750 (0.0008) [2023-10-08 03:17:30,009][52059] Updated weights for policy 1, policy_version 83782 (0.0007) [2023-10-08 03:17:30,378][52059] Updated weights for policy 1, policy_version 83792 (0.0007) [2023-10-08 03:17:30,750][52059] Updated weights for policy 1, policy_version 83802 (0.0008) [2023-10-08 03:17:31,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 170557440. Throughput: 0: 1695.7, 1: 1725.6. Samples: 42645220. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:31,211][50642] Avg episode reward: [(0, '21.370'), (1, '24.230')] [2023-10-08 03:17:33,621][52060] Updated weights for policy 0, policy_version 82760 (0.0009) [2023-10-08 03:17:33,979][52060] Updated weights for policy 0, policy_version 82770 (0.0010) [2023-10-08 03:17:34,359][52060] Updated weights for policy 0, policy_version 82780 (0.0009) [2023-10-08 03:17:34,673][52059] Updated weights for policy 1, policy_version 83812 (0.0009) [2023-10-08 03:17:35,032][52059] Updated weights for policy 1, policy_version 83822 (0.0007) [2023-10-08 03:17:35,390][52059] Updated weights for policy 1, policy_version 83832 (0.0007) [2023-10-08 03:17:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 170622976. Throughput: 0: 1707.4, 1: 1747.1. Samples: 42656468. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:36,211][50642] Avg episode reward: [(0, '18.650'), (1, '25.250')] [2023-10-08 03:17:38,389][52060] Updated weights for policy 0, policy_version 82790 (0.0008) [2023-10-08 03:17:38,769][52060] Updated weights for policy 0, policy_version 82800 (0.0008) [2023-10-08 03:17:39,133][52060] Updated weights for policy 0, policy_version 82810 (0.0009) [2023-10-08 03:17:39,453][52059] Updated weights for policy 1, policy_version 83842 (0.0009) [2023-10-08 03:17:39,810][52059] Updated weights for policy 1, policy_version 83852 (0.0010) [2023-10-08 03:17:40,181][52059] Updated weights for policy 1, policy_version 83862 (0.0007) [2023-10-08 03:17:40,547][52059] Updated weights for policy 1, policy_version 83872 (0.0008) [2023-10-08 03:17:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 170688512. Throughput: 0: 1693.8, 1: 1734.4. Samples: 42676640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:41,211][50642] Avg episode reward: [(0, '18.630'), (1, '26.330')] [2023-10-08 03:17:43,215][52060] Updated weights for policy 0, policy_version 82820 (0.0008) [2023-10-08 03:17:43,604][52060] Updated weights for policy 0, policy_version 82830 (0.0008) [2023-10-08 03:17:43,969][52060] Updated weights for policy 0, policy_version 82840 (0.0011) [2023-10-08 03:17:44,488][52059] Updated weights for policy 1, policy_version 83882 (0.0008) [2023-10-08 03:17:44,846][52059] Updated weights for policy 1, policy_version 83892 (0.0010) [2023-10-08 03:17:45,213][52059] Updated weights for policy 1, policy_version 83902 (0.0007) [2023-10-08 03:17:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 170754048. Throughput: 0: 1718.0, 1: 1717.9. Samples: 42697098. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:46,211][50642] Avg episode reward: [(0, '24.500'), (1, '24.820')] [2023-10-08 03:17:47,992][52060] Updated weights for policy 0, policy_version 82850 (0.0007) [2023-10-08 03:17:48,364][52060] Updated weights for policy 0, policy_version 82860 (0.0010) [2023-10-08 03:17:48,730][52060] Updated weights for policy 0, policy_version 82870 (0.0009) [2023-10-08 03:17:49,020][52059] Updated weights for policy 1, policy_version 83912 (0.0008) [2023-10-08 03:17:49,092][52060] Updated weights for policy 0, policy_version 82880 (0.0008) [2023-10-08 03:17:49,382][52059] Updated weights for policy 1, policy_version 83922 (0.0008) [2023-10-08 03:17:49,748][52059] Updated weights for policy 1, policy_version 83932 (0.0007) [2023-10-08 03:17:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 170819584. Throughput: 0: 1697.8, 1: 1747.0. Samples: 42707880. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:51,211][50642] Avg episode reward: [(0, '19.770'), (1, '25.590')] [2023-10-08 03:17:53,024][52060] Updated weights for policy 0, policy_version 82890 (0.0007) [2023-10-08 03:17:53,398][52060] Updated weights for policy 0, policy_version 82900 (0.0009) [2023-10-08 03:17:53,669][52059] Updated weights for policy 1, policy_version 83942 (0.0010) [2023-10-08 03:17:53,765][52060] Updated weights for policy 0, policy_version 82910 (0.0008) [2023-10-08 03:17:54,041][52059] Updated weights for policy 1, policy_version 83952 (0.0009) [2023-10-08 03:17:54,411][52059] Updated weights for policy 1, policy_version 83962 (0.0010) [2023-10-08 03:17:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 170885120. Throughput: 0: 1696.9, 1: 1725.4. Samples: 42727792. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:17:56,211][50642] Avg episode reward: [(0, '18.630'), (1, '26.480')] [2023-10-08 03:17:57,643][52060] Updated weights for policy 0, policy_version 82920 (0.0010) [2023-10-08 03:17:58,012][52060] Updated weights for policy 0, policy_version 82930 (0.0008) [2023-10-08 03:17:58,120][52059] Updated weights for policy 1, policy_version 83972 (0.0008) [2023-10-08 03:17:58,379][52060] Updated weights for policy 0, policy_version 82940 (0.0008) [2023-10-08 03:17:58,491][52059] Updated weights for policy 1, policy_version 83982 (0.0008) [2023-10-08 03:17:58,849][52059] Updated weights for policy 1, policy_version 83992 (0.0009) [2023-10-08 03:18:01,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 170950656. Throughput: 0: 1717.2, 1: 1722.6. Samples: 42748992. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:18:01,211][50642] Avg episode reward: [(0, '18.590'), (1, '25.610')] [2023-10-08 03:18:02,346][52060] Updated weights for policy 0, policy_version 82950 (0.0011) [2023-10-08 03:18:02,715][52060] Updated weights for policy 0, policy_version 82960 (0.0009) [2023-10-08 03:18:02,910][52059] Updated weights for policy 1, policy_version 84002 (0.0008) [2023-10-08 03:18:03,083][52060] Updated weights for policy 0, policy_version 82970 (0.0007) [2023-10-08 03:18:03,270][52059] Updated weights for policy 1, policy_version 84012 (0.0008) [2023-10-08 03:18:03,641][52059] Updated weights for policy 1, policy_version 84022 (0.0008) [2023-10-08 03:18:04,002][52059] Updated weights for policy 1, policy_version 84032 (0.0008) [2023-10-08 03:18:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 171016192. Throughput: 0: 1684.7, 1: 1725.6. Samples: 42758522. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:06,211][50642] Avg episode reward: [(0, '23.450'), (1, '27.110')] [2023-10-08 03:18:07,006][52060] Updated weights for policy 0, policy_version 82980 (0.0008) [2023-10-08 03:18:07,382][52060] Updated weights for policy 0, policy_version 82990 (0.0008) [2023-10-08 03:18:07,748][52060] Updated weights for policy 0, policy_version 83000 (0.0007) [2023-10-08 03:18:07,967][52059] Updated weights for policy 1, policy_version 84042 (0.0008) [2023-10-08 03:18:08,333][52059] Updated weights for policy 1, policy_version 84052 (0.0009) [2023-10-08 03:18:08,684][52059] Updated weights for policy 1, policy_version 84062 (0.0008) [2023-10-08 03:18:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 171081728. Throughput: 0: 1712.7, 1: 1720.4. Samples: 42779624. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:11,211][50642] Avg episode reward: [(0, '19.280'), (1, '26.260')] [2023-10-08 03:18:11,928][52060] Updated weights for policy 0, policy_version 83010 (0.0008) [2023-10-08 03:18:12,288][52060] Updated weights for policy 0, policy_version 83020 (0.0009) [2023-10-08 03:18:12,653][52060] Updated weights for policy 0, policy_version 83030 (0.0007) [2023-10-08 03:18:12,735][52059] Updated weights for policy 1, policy_version 84072 (0.0008) [2023-10-08 03:18:13,020][52060] Updated weights for policy 0, policy_version 83040 (0.0009) [2023-10-08 03:18:13,098][52059] Updated weights for policy 1, policy_version 84082 (0.0008) [2023-10-08 03:18:13,456][52059] Updated weights for policy 1, policy_version 84092 (0.0008) [2023-10-08 03:18:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 171147264. Throughput: 0: 1714.9, 1: 1741.5. Samples: 42800758. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:16,211][50642] Avg episode reward: [(0, '18.000'), (1, '25.050')] [2023-10-08 03:18:16,988][52060] Updated weights for policy 0, policy_version 83050 (0.0010) [2023-10-08 03:18:17,365][52060] Updated weights for policy 0, policy_version 83060 (0.0008) [2023-10-08 03:18:17,383][52059] Updated weights for policy 1, policy_version 84102 (0.0008) [2023-10-08 03:18:17,730][52060] Updated weights for policy 0, policy_version 83070 (0.0007) [2023-10-08 03:18:17,745][52059] Updated weights for policy 1, policy_version 84112 (0.0009) [2023-10-08 03:18:18,102][52059] Updated weights for policy 1, policy_version 84122 (0.0008) [2023-10-08 03:18:21,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 171212800. Throughput: 0: 1695.7, 1: 1723.8. Samples: 42810348. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:21,211][50642] Avg episode reward: [(0, '19.350'), (1, '25.970')] [2023-10-08 03:18:21,925][52060] Updated weights for policy 0, policy_version 83080 (0.0009) [2023-10-08 03:18:22,129][52059] Updated weights for policy 1, policy_version 84132 (0.0008) [2023-10-08 03:18:22,294][52060] Updated weights for policy 0, policy_version 83090 (0.0009) [2023-10-08 03:18:22,490][52059] Updated weights for policy 1, policy_version 84142 (0.0007) [2023-10-08 03:18:22,660][52060] Updated weights for policy 0, policy_version 83100 (0.0009) [2023-10-08 03:18:22,862][52059] Updated weights for policy 1, policy_version 84152 (0.0007) [2023-10-08 03:18:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 171278336. Throughput: 0: 1713.5, 1: 1733.2. Samples: 42831740. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:26,211][50642] Avg episode reward: [(0, '21.690'), (1, '24.860')] [2023-10-08 03:18:26,640][52060] Updated weights for policy 0, policy_version 83110 (0.0010) [2023-10-08 03:18:26,843][52059] Updated weights for policy 1, policy_version 84162 (0.0008) [2023-10-08 03:18:27,016][52060] Updated weights for policy 0, policy_version 83120 (0.0007) [2023-10-08 03:18:27,201][52059] Updated weights for policy 1, policy_version 84172 (0.0008) [2023-10-08 03:18:27,390][52060] Updated weights for policy 0, policy_version 83130 (0.0007) [2023-10-08 03:18:27,570][52059] Updated weights for policy 1, policy_version 84182 (0.0010) [2023-10-08 03:18:27,934][52059] Updated weights for policy 1, policy_version 84192 (0.0008) [2023-10-08 03:18:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 171343872. Throughput: 0: 1708.2, 1: 1750.0. Samples: 42852718. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:31,211][50642] Avg episode reward: [(0, '19.420'), (1, '23.910')] [2023-10-08 03:18:31,510][52060] Updated weights for policy 0, policy_version 83140 (0.0008) [2023-10-08 03:18:31,806][52059] Updated weights for policy 1, policy_version 84202 (0.0009) [2023-10-08 03:18:31,878][52060] Updated weights for policy 0, policy_version 83150 (0.0009) [2023-10-08 03:18:32,164][52059] Updated weights for policy 1, policy_version 84212 (0.0008) [2023-10-08 03:18:32,232][52060] Updated weights for policy 0, policy_version 83160 (0.0007) [2023-10-08 03:18:32,535][52059] Updated weights for policy 1, policy_version 84222 (0.0010) [2023-10-08 03:18:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 171409408. Throughput: 0: 1703.1, 1: 1721.1. Samples: 42861968. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:36,211][50642] Avg episode reward: [(0, '19.060'), (1, '24.960')] [2023-10-08 03:18:36,226][52060] Updated weights for policy 0, policy_version 83170 (0.0007) [2023-10-08 03:18:36,604][52060] Updated weights for policy 0, policy_version 83180 (0.0009) [2023-10-08 03:18:36,604][52059] Updated weights for policy 1, policy_version 84232 (0.0008) [2023-10-08 03:18:36,963][52060] Updated weights for policy 0, policy_version 83190 (0.0007) [2023-10-08 03:18:36,966][52059] Updated weights for policy 1, policy_version 84242 (0.0008) [2023-10-08 03:18:37,338][52060] Updated weights for policy 0, policy_version 83200 (0.0007) [2023-10-08 03:18:37,338][52059] Updated weights for policy 1, policy_version 84252 (0.0009) [2023-10-08 03:18:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 171474944. Throughput: 0: 1708.1, 1: 1742.8. Samples: 42883080. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:41,211][50642] Avg episode reward: [(0, '19.940'), (1, '23.530')] [2023-10-08 03:18:41,249][52059] Updated weights for policy 1, policy_version 84262 (0.0008) [2023-10-08 03:18:41,332][52060] Updated weights for policy 0, policy_version 83210 (0.0009) [2023-10-08 03:18:41,617][52059] Updated weights for policy 1, policy_version 84272 (0.0007) [2023-10-08 03:18:41,697][52060] Updated weights for policy 0, policy_version 83220 (0.0010) [2023-10-08 03:18:41,973][52059] Updated weights for policy 1, policy_version 84282 (0.0007) [2023-10-08 03:18:42,063][52060] Updated weights for policy 0, policy_version 83230 (0.0008) [2023-10-08 03:18:45,853][52059] Updated weights for policy 1, policy_version 84292 (0.0007) [2023-10-08 03:18:46,001][52060] Updated weights for policy 0, policy_version 83240 (0.0009) [2023-10-08 03:18:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 171540480. Throughput: 0: 1706.0, 1: 1739.5. Samples: 42904036. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:46,211][50642] Avg episode reward: [(0, '20.690'), (1, '25.600')] [2023-10-08 03:18:46,219][52059] Updated weights for policy 1, policy_version 84302 (0.0008) [2023-10-08 03:18:46,374][52060] Updated weights for policy 0, policy_version 83250 (0.0008) [2023-10-08 03:18:46,583][52059] Updated weights for policy 1, policy_version 84312 (0.0008) [2023-10-08 03:18:46,731][52060] Updated weights for policy 0, policy_version 83260 (0.0007) [2023-10-08 03:18:46,867][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000084320_86343680.pth... [2023-10-08 03:18:46,876][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000083264_85262336.pth... [2023-10-08 03:18:46,902][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000082688_84672512.pth [2023-10-08 03:18:46,906][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000081664_83623936.pth [2023-10-08 03:18:50,382][52059] Updated weights for policy 1, policy_version 84322 (0.0008) [2023-10-08 03:18:50,747][52059] Updated weights for policy 1, policy_version 84332 (0.0007) [2023-10-08 03:18:50,800][52060] Updated weights for policy 0, policy_version 83270 (0.0007) [2023-10-08 03:18:51,110][52059] Updated weights for policy 1, policy_version 84342 (0.0007) [2023-10-08 03:18:51,165][52060] Updated weights for policy 0, policy_version 83280 (0.0007) [2023-10-08 03:18:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 171606016. Throughput: 0: 1710.1, 1: 1737.8. Samples: 42913678. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:51,211][50642] Avg episode reward: [(0, '17.800'), (1, '24.120')] [2023-10-08 03:18:51,463][52059] Updated weights for policy 1, policy_version 84352 (0.0009) [2023-10-08 03:18:51,537][52060] Updated weights for policy 0, policy_version 83290 (0.0007) [2023-10-08 03:18:55,352][52059] Updated weights for policy 1, policy_version 84362 (0.0008) [2023-10-08 03:18:55,449][52060] Updated weights for policy 0, policy_version 83300 (0.0007) [2023-10-08 03:18:55,712][52059] Updated weights for policy 1, policy_version 84372 (0.0008) [2023-10-08 03:18:55,816][52060] Updated weights for policy 0, policy_version 83310 (0.0008) [2023-10-08 03:18:56,081][52059] Updated weights for policy 1, policy_version 84382 (0.0009) [2023-10-08 03:18:56,186][52060] Updated weights for policy 0, policy_version 83320 (0.0007) [2023-10-08 03:18:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 171704320. Throughput: 0: 1710.1, 1: 1751.8. Samples: 42935408. Policy #0 lag: (min: 9.0, avg: 12.9, max: 41.0) [2023-10-08 03:18:56,211][50642] Avg episode reward: [(0, '19.390'), (1, '24.150')] [2023-10-08 03:19:00,022][52059] Updated weights for policy 1, policy_version 84392 (0.0007) [2023-10-08 03:19:00,063][52060] Updated weights for policy 0, policy_version 83330 (0.0008) [2023-10-08 03:19:00,383][52059] Updated weights for policy 1, policy_version 84402 (0.0008) [2023-10-08 03:19:00,436][52060] Updated weights for policy 0, policy_version 83340 (0.0009) [2023-10-08 03:19:00,743][52059] Updated weights for policy 1, policy_version 84412 (0.0008) [2023-10-08 03:19:00,803][52060] Updated weights for policy 0, policy_version 83350 (0.0009) [2023-10-08 03:19:01,164][52060] Updated weights for policy 0, policy_version 83360 (0.0009) [2023-10-08 03:19:01,210][50642] Fps is (10 sec: 19661.0, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 171802624. Throughput: 0: 1691.9, 1: 1725.6. Samples: 42954546. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:01,211][50642] Avg episode reward: [(0, '21.910'), (1, '26.100')] [2023-10-08 03:19:04,694][52059] Updated weights for policy 1, policy_version 84422 (0.0009) [2023-10-08 03:19:05,008][52060] Updated weights for policy 0, policy_version 83370 (0.0008) [2023-10-08 03:19:05,064][52059] Updated weights for policy 1, policy_version 84432 (0.0009) [2023-10-08 03:19:05,378][52060] Updated weights for policy 0, policy_version 83380 (0.0008) [2023-10-08 03:19:05,425][52059] Updated weights for policy 1, policy_version 84442 (0.0009) [2023-10-08 03:19:05,750][52060] Updated weights for policy 0, policy_version 83390 (0.0008) [2023-10-08 03:19:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 171868160. Throughput: 0: 1714.8, 1: 1747.7. Samples: 42966162. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:06,211][50642] Avg episode reward: [(0, '19.600'), (1, '26.270')] [2023-10-08 03:19:09,163][52059] Updated weights for policy 1, policy_version 84452 (0.0008) [2023-10-08 03:19:09,524][52059] Updated weights for policy 1, policy_version 84462 (0.0009) [2023-10-08 03:19:09,698][52060] Updated weights for policy 0, policy_version 83400 (0.0008) [2023-10-08 03:19:09,892][52059] Updated weights for policy 1, policy_version 84472 (0.0008) [2023-10-08 03:19:10,068][52060] Updated weights for policy 0, policy_version 83410 (0.0010) [2023-10-08 03:19:10,438][52060] Updated weights for policy 0, policy_version 83420 (0.0008) [2023-10-08 03:19:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 171933696. Throughput: 0: 1702.0, 1: 1733.3. Samples: 42986332. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:11,211][50642] Avg episode reward: [(0, '18.000'), (1, '26.540')] [2023-10-08 03:19:13,840][52059] Updated weights for policy 1, policy_version 84482 (0.0008) [2023-10-08 03:19:14,203][52059] Updated weights for policy 1, policy_version 84492 (0.0010) [2023-10-08 03:19:14,511][52060] Updated weights for policy 0, policy_version 83430 (0.0009) [2023-10-08 03:19:14,567][52059] Updated weights for policy 1, policy_version 84502 (0.0007) [2023-10-08 03:19:14,877][52060] Updated weights for policy 0, policy_version 83440 (0.0008) [2023-10-08 03:19:14,929][52059] Updated weights for policy 1, policy_version 84512 (0.0009) [2023-10-08 03:19:15,243][52060] Updated weights for policy 0, policy_version 83450 (0.0010) [2023-10-08 03:19:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 171999232. Throughput: 0: 1684.3, 1: 1728.2. Samples: 43006282. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:16,211][50642] Avg episode reward: [(0, '19.380'), (1, '26.050')] [2023-10-08 03:19:18,888][52059] Updated weights for policy 1, policy_version 84522 (0.0010) [2023-10-08 03:19:19,254][52059] Updated weights for policy 1, policy_version 84532 (0.0009) [2023-10-08 03:19:19,353][52060] Updated weights for policy 0, policy_version 83460 (0.0007) [2023-10-08 03:19:19,621][52059] Updated weights for policy 1, policy_version 84542 (0.0007) [2023-10-08 03:19:19,741][52060] Updated weights for policy 0, policy_version 83470 (0.0009) [2023-10-08 03:19:20,106][52060] Updated weights for policy 0, policy_version 83480 (0.0008) [2023-10-08 03:19:21,211][50642] Fps is (10 sec: 13106.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 172064768. Throughput: 0: 1715.8, 1: 1747.9. Samples: 43017836. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:21,212][50642] Avg episode reward: [(0, '21.260'), (1, '25.220')] [2023-10-08 03:19:23,398][52059] Updated weights for policy 1, policy_version 84552 (0.0008) [2023-10-08 03:19:23,767][52059] Updated weights for policy 1, policy_version 84562 (0.0007) [2023-10-08 03:19:23,953][52060] Updated weights for policy 0, policy_version 83490 (0.0008) [2023-10-08 03:19:24,131][52059] Updated weights for policy 1, policy_version 84572 (0.0009) [2023-10-08 03:19:24,319][52060] Updated weights for policy 0, policy_version 83500 (0.0010) [2023-10-08 03:19:24,691][52060] Updated weights for policy 0, policy_version 83510 (0.0010) [2023-10-08 03:19:25,058][52060] Updated weights for policy 0, policy_version 83520 (0.0010) [2023-10-08 03:19:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 172130304. Throughput: 0: 1693.0, 1: 1729.2. Samples: 43037080. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:26,211][50642] Avg episode reward: [(0, '20.220'), (1, '26.710')] [2023-10-08 03:19:28,076][52059] Updated weights for policy 1, policy_version 84582 (0.0010) [2023-10-08 03:19:28,439][52059] Updated weights for policy 1, policy_version 84592 (0.0008) [2023-10-08 03:19:28,796][52059] Updated weights for policy 1, policy_version 84602 (0.0007) [2023-10-08 03:19:29,052][52060] Updated weights for policy 0, policy_version 83530 (0.0008) [2023-10-08 03:19:29,414][52060] Updated weights for policy 0, policy_version 83540 (0.0007) [2023-10-08 03:19:29,785][52060] Updated weights for policy 0, policy_version 83550 (0.0007) [2023-10-08 03:19:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 172195840. Throughput: 0: 1690.8, 1: 1731.4. Samples: 43058034. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:31,211][50642] Avg episode reward: [(0, '19.430'), (1, '27.260')] [2023-10-08 03:19:32,933][52059] Updated weights for policy 1, policy_version 84612 (0.0007) [2023-10-08 03:19:33,294][52059] Updated weights for policy 1, policy_version 84622 (0.0007) [2023-10-08 03:19:33,650][52059] Updated weights for policy 1, policy_version 84632 (0.0007) [2023-10-08 03:19:33,755][52060] Updated weights for policy 0, policy_version 83560 (0.0008) [2023-10-08 03:19:34,122][52060] Updated weights for policy 0, policy_version 83570 (0.0008) [2023-10-08 03:19:34,502][52060] Updated weights for policy 0, policy_version 83580 (0.0008) [2023-10-08 03:19:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 172261376. Throughput: 0: 1711.5, 1: 1728.7. Samples: 43068484. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:36,211][50642] Avg episode reward: [(0, '21.600'), (1, '24.040')] [2023-10-08 03:19:37,318][52059] Updated weights for policy 1, policy_version 84642 (0.0007) [2023-10-08 03:19:37,693][52059] Updated weights for policy 1, policy_version 84652 (0.0009) [2023-10-08 03:19:38,060][52059] Updated weights for policy 1, policy_version 84662 (0.0008) [2023-10-08 03:19:38,395][52060] Updated weights for policy 0, policy_version 83590 (0.0009) [2023-10-08 03:19:38,423][52059] Updated weights for policy 1, policy_version 84672 (0.0007) [2023-10-08 03:19:38,758][52060] Updated weights for policy 0, policy_version 83600 (0.0008) [2023-10-08 03:19:39,114][52060] Updated weights for policy 0, policy_version 83610 (0.0009) [2023-10-08 03:19:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 172326912. Throughput: 0: 1693.4, 1: 1724.4. Samples: 43089212. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:41,211][50642] Avg episode reward: [(0, '21.860'), (1, '22.710')] [2023-10-08 03:19:42,508][52059] Updated weights for policy 1, policy_version 84682 (0.0009) [2023-10-08 03:19:42,878][52059] Updated weights for policy 1, policy_version 84692 (0.0008) [2023-10-08 03:19:43,236][52060] Updated weights for policy 0, policy_version 83620 (0.0010) [2023-10-08 03:19:43,240][52059] Updated weights for policy 1, policy_version 84702 (0.0008) [2023-10-08 03:19:43,608][52060] Updated weights for policy 0, policy_version 83630 (0.0007) [2023-10-08 03:19:43,974][52060] Updated weights for policy 0, policy_version 83640 (0.0007) [2023-10-08 03:19:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 172392448. Throughput: 0: 1713.1, 1: 1759.2. Samples: 43110798. Policy #0 lag: (min: 31.0, avg: 32.8, max: 60.0) [2023-10-08 03:19:46,211][50642] Avg episode reward: [(0, '20.390'), (1, '25.090')] [2023-10-08 03:19:47,046][52059] Updated weights for policy 1, policy_version 84712 (0.0008) [2023-10-08 03:19:47,406][52059] Updated weights for policy 1, policy_version 84722 (0.0007) [2023-10-08 03:19:47,770][52059] Updated weights for policy 1, policy_version 84732 (0.0008) [2023-10-08 03:19:47,953][52060] Updated weights for policy 0, policy_version 83650 (0.0008) [2023-10-08 03:19:48,320][52060] Updated weights for policy 0, policy_version 83660 (0.0009) [2023-10-08 03:19:48,684][52060] Updated weights for policy 0, policy_version 83670 (0.0007) [2023-10-08 03:19:49,050][52060] Updated weights for policy 0, policy_version 83680 (0.0008) [2023-10-08 03:19:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 172457984. Throughput: 0: 1699.2, 1: 1732.0. Samples: 43120568. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:19:51,211][50642] Avg episode reward: [(0, '19.320'), (1, '26.750')] [2023-10-08 03:19:51,695][52059] Updated weights for policy 1, policy_version 84742 (0.0009) [2023-10-08 03:19:52,059][52059] Updated weights for policy 1, policy_version 84752 (0.0009) [2023-10-08 03:19:52,422][52059] Updated weights for policy 1, policy_version 84762 (0.0008) [2023-10-08 03:19:53,047][52060] Updated weights for policy 0, policy_version 83690 (0.0008) [2023-10-08 03:19:53,413][52060] Updated weights for policy 0, policy_version 83700 (0.0009) [2023-10-08 03:19:53,787][52060] Updated weights for policy 0, policy_version 83710 (0.0009) [2023-10-08 03:19:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 172523520. Throughput: 0: 1703.5, 1: 1747.9. Samples: 43141644. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:19:56,211][50642] Avg episode reward: [(0, '22.690'), (1, '25.910')] [2023-10-08 03:19:56,375][52059] Updated weights for policy 1, policy_version 84772 (0.0007) [2023-10-08 03:19:56,741][52059] Updated weights for policy 1, policy_version 84782 (0.0009) [2023-10-08 03:19:57,099][52059] Updated weights for policy 1, policy_version 84792 (0.0008) [2023-10-08 03:19:57,849][52060] Updated weights for policy 0, policy_version 83720 (0.0007) [2023-10-08 03:19:58,215][52060] Updated weights for policy 0, policy_version 83730 (0.0008) [2023-10-08 03:19:58,583][52060] Updated weights for policy 0, policy_version 83740 (0.0008) [2023-10-08 03:20:00,884][52059] Updated weights for policy 1, policy_version 84802 (0.0010) [2023-10-08 03:20:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 172589056. Throughput: 0: 1724.4, 1: 1754.0. Samples: 43162806. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:01,211][50642] Avg episode reward: [(0, '21.200'), (1, '21.800')] [2023-10-08 03:20:01,241][52059] Updated weights for policy 1, policy_version 84812 (0.0007) [2023-10-08 03:20:01,614][52059] Updated weights for policy 1, policy_version 84822 (0.0007) [2023-10-08 03:20:01,970][52059] Updated weights for policy 1, policy_version 84832 (0.0008) [2023-10-08 03:20:02,569][52060] Updated weights for policy 0, policy_version 83750 (0.0009) [2023-10-08 03:20:02,938][52060] Updated weights for policy 0, policy_version 83760 (0.0010) [2023-10-08 03:20:03,297][52060] Updated weights for policy 0, policy_version 83770 (0.0011) [2023-10-08 03:20:05,817][52059] Updated weights for policy 1, policy_version 84842 (0.0009) [2023-10-08 03:20:06,186][52059] Updated weights for policy 1, policy_version 84852 (0.0009) [2023-10-08 03:20:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 172654592. Throughput: 0: 1692.0, 1: 1739.1. Samples: 43172232. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:06,211][50642] Avg episode reward: [(0, '19.810'), (1, '22.690')] [2023-10-08 03:20:06,557][52059] Updated weights for policy 1, policy_version 84862 (0.0008) [2023-10-08 03:20:07,228][52060] Updated weights for policy 0, policy_version 83780 (0.0009) [2023-10-08 03:20:07,621][52060] Updated weights for policy 0, policy_version 83790 (0.0008) [2023-10-08 03:20:07,994][52060] Updated weights for policy 0, policy_version 83800 (0.0008) [2023-10-08 03:20:10,392][52059] Updated weights for policy 1, policy_version 84872 (0.0008) [2023-10-08 03:20:10,753][52059] Updated weights for policy 1, policy_version 84882 (0.0010) [2023-10-08 03:20:11,125][52059] Updated weights for policy 1, policy_version 84892 (0.0009) [2023-10-08 03:20:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 172720128. Throughput: 0: 1719.2, 1: 1762.6. Samples: 43193760. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:11,211][50642] Avg episode reward: [(0, '21.250'), (1, '23.740')] [2023-10-08 03:20:11,788][52060] Updated weights for policy 0, policy_version 83810 (0.0009) [2023-10-08 03:20:12,159][52060] Updated weights for policy 0, policy_version 83820 (0.0007) [2023-10-08 03:20:12,533][52060] Updated weights for policy 0, policy_version 83830 (0.0007) [2023-10-08 03:20:12,900][52060] Updated weights for policy 0, policy_version 83840 (0.0008) [2023-10-08 03:20:15,134][52059] Updated weights for policy 1, policy_version 84902 (0.0008) [2023-10-08 03:20:15,495][52059] Updated weights for policy 1, policy_version 84912 (0.0008) [2023-10-08 03:20:15,863][52059] Updated weights for policy 1, policy_version 84922 (0.0007) [2023-10-08 03:20:16,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 172818432. Throughput: 0: 1733.7, 1: 1736.0. Samples: 43214168. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:16,211][50642] Avg episode reward: [(0, '20.530'), (1, '30.140')] [2023-10-08 03:20:16,219][51710] Saving new best policy, reward=30.140! [2023-10-08 03:20:16,648][52060] Updated weights for policy 0, policy_version 83850 (0.0011) [2023-10-08 03:20:17,011][52060] Updated weights for policy 0, policy_version 83860 (0.0007) [2023-10-08 03:20:17,383][52060] Updated weights for policy 0, policy_version 83870 (0.0011) [2023-10-08 03:20:19,839][52059] Updated weights for policy 1, policy_version 84932 (0.0007) [2023-10-08 03:20:20,201][52059] Updated weights for policy 1, policy_version 84942 (0.0009) [2023-10-08 03:20:20,562][52059] Updated weights for policy 1, policy_version 84952 (0.0010) [2023-10-08 03:20:21,210][50642] Fps is (10 sec: 16384.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 172883968. Throughput: 0: 1709.3, 1: 1757.8. Samples: 43224506. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:21,211][50642] Avg episode reward: [(0, '16.750'), (1, '26.030')] [2023-10-08 03:20:21,349][52060] Updated weights for policy 0, policy_version 83880 (0.0009) [2023-10-08 03:20:21,721][52060] Updated weights for policy 0, policy_version 83890 (0.0008) [2023-10-08 03:20:22,081][52060] Updated weights for policy 0, policy_version 83900 (0.0007) [2023-10-08 03:20:24,478][52059] Updated weights for policy 1, policy_version 84962 (0.0009) [2023-10-08 03:20:24,830][52059] Updated weights for policy 1, policy_version 84972 (0.0009) [2023-10-08 03:20:25,197][52059] Updated weights for policy 1, policy_version 84982 (0.0008) [2023-10-08 03:20:25,564][52059] Updated weights for policy 1, policy_version 84992 (0.0010) [2023-10-08 03:20:26,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 172949504. Throughput: 0: 1730.6, 1: 1744.0. Samples: 43245566. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:26,211][50642] Avg episode reward: [(0, '17.170'), (1, '23.140')] [2023-10-08 03:20:26,220][52060] Updated weights for policy 0, policy_version 83910 (0.0010) [2023-10-08 03:20:26,579][52060] Updated weights for policy 0, policy_version 83920 (0.0011) [2023-10-08 03:20:26,949][52060] Updated weights for policy 0, policy_version 83930 (0.0011) [2023-10-08 03:20:29,434][52059] Updated weights for policy 1, policy_version 85002 (0.0009) [2023-10-08 03:20:29,803][52059] Updated weights for policy 1, policy_version 85012 (0.0007) [2023-10-08 03:20:30,170][52059] Updated weights for policy 1, policy_version 85022 (0.0007) [2023-10-08 03:20:31,013][52060] Updated weights for policy 0, policy_version 83940 (0.0010) [2023-10-08 03:20:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 173015040. Throughput: 0: 1730.3, 1: 1717.5. Samples: 43265946. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:31,211][50642] Avg episode reward: [(0, '18.480'), (1, '24.090')] [2023-10-08 03:20:31,380][52060] Updated weights for policy 0, policy_version 83950 (0.0008) [2023-10-08 03:20:31,755][52060] Updated weights for policy 0, policy_version 83960 (0.0010) [2023-10-08 03:20:34,085][52059] Updated weights for policy 1, policy_version 85032 (0.0008) [2023-10-08 03:20:34,441][52059] Updated weights for policy 1, policy_version 85042 (0.0009) [2023-10-08 03:20:34,807][52059] Updated weights for policy 1, policy_version 85052 (0.0007) [2023-10-08 03:20:35,557][52060] Updated weights for policy 0, policy_version 83970 (0.0009) [2023-10-08 03:20:35,929][52060] Updated weights for policy 0, policy_version 83980 (0.0010) [2023-10-08 03:20:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 173080576. Throughput: 0: 1721.0, 1: 1746.1. Samples: 43276590. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:36,211][50642] Avg episode reward: [(0, '22.130'), (1, '26.780')] [2023-10-08 03:20:36,297][52060] Updated weights for policy 0, policy_version 83990 (0.0009) [2023-10-08 03:20:36,667][52060] Updated weights for policy 0, policy_version 84000 (0.0008) [2023-10-08 03:20:38,788][52059] Updated weights for policy 1, policy_version 85062 (0.0008) [2023-10-08 03:20:39,160][52059] Updated weights for policy 1, policy_version 85072 (0.0009) [2023-10-08 03:20:39,520][52059] Updated weights for policy 1, policy_version 85082 (0.0008) [2023-10-08 03:20:40,619][52060] Updated weights for policy 0, policy_version 84010 (0.0010) [2023-10-08 03:20:40,984][52060] Updated weights for policy 0, policy_version 84020 (0.0007) [2023-10-08 03:20:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 173146112. Throughput: 0: 1731.6, 1: 1716.8. Samples: 43296822. Policy #0 lag: (min: 15.0, avg: 16.6, max: 42.0) [2023-10-08 03:20:41,211][50642] Avg episode reward: [(0, '16.350'), (1, '26.300')] [2023-10-08 03:20:41,349][52060] Updated weights for policy 0, policy_version 84030 (0.0011) [2023-10-08 03:20:43,400][52059] Updated weights for policy 1, policy_version 85092 (0.0009) [2023-10-08 03:20:43,767][52059] Updated weights for policy 1, policy_version 85102 (0.0009) [2023-10-08 03:20:44,123][52059] Updated weights for policy 1, policy_version 85112 (0.0011) [2023-10-08 03:20:45,332][52060] Updated weights for policy 0, policy_version 84040 (0.0008) [2023-10-08 03:20:45,698][52060] Updated weights for policy 0, policy_version 84050 (0.0008) [2023-10-08 03:20:46,057][52060] Updated weights for policy 0, policy_version 84060 (0.0007) [2023-10-08 03:20:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 173244416. Throughput: 0: 1714.7, 1: 1720.4. Samples: 43317386. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:20:46,211][50642] Avg episode reward: [(0, '17.790'), (1, '22.500')] [2023-10-08 03:20:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000085120_87162880.pth... [2023-10-08 03:20:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000084064_86081536.pth... [2023-10-08 03:20:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000083488_85491712.pth [2023-10-08 03:20:46,261][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000082464_84443136.pth [2023-10-08 03:20:48,144][52059] Updated weights for policy 1, policy_version 85122 (0.0009) [2023-10-08 03:20:48,515][52059] Updated weights for policy 1, policy_version 85132 (0.0008) [2023-10-08 03:20:48,873][52059] Updated weights for policy 1, policy_version 85142 (0.0010) [2023-10-08 03:20:49,241][52059] Updated weights for policy 1, policy_version 85152 (0.0010) [2023-10-08 03:20:50,012][52060] Updated weights for policy 0, policy_version 84070 (0.0007) [2023-10-08 03:20:50,384][52060] Updated weights for policy 0, policy_version 84080 (0.0009) [2023-10-08 03:20:50,752][52060] Updated weights for policy 0, policy_version 84090 (0.0010) [2023-10-08 03:20:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 173309952. Throughput: 0: 1735.7, 1: 1728.1. Samples: 43328104. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:20:51,211][50642] Avg episode reward: [(0, '18.650'), (1, '22.680')] [2023-10-08 03:20:53,247][52059] Updated weights for policy 1, policy_version 85162 (0.0008) [2023-10-08 03:20:53,613][52059] Updated weights for policy 1, policy_version 85172 (0.0010) [2023-10-08 03:20:53,973][52059] Updated weights for policy 1, policy_version 85182 (0.0007) [2023-10-08 03:20:54,758][52060] Updated weights for policy 0, policy_version 84100 (0.0009) [2023-10-08 03:20:55,140][52060] Updated weights for policy 0, policy_version 84110 (0.0008) [2023-10-08 03:20:55,507][52060] Updated weights for policy 0, policy_version 84120 (0.0010) [2023-10-08 03:20:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 173375488. Throughput: 0: 1728.8, 1: 1713.6. Samples: 43348664. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:20:56,211][50642] Avg episode reward: [(0, '22.330'), (1, '25.870')] [2023-10-08 03:20:57,961][52059] Updated weights for policy 1, policy_version 85192 (0.0010) [2023-10-08 03:20:58,339][52059] Updated weights for policy 1, policy_version 85202 (0.0008) [2023-10-08 03:20:58,693][52059] Updated weights for policy 1, policy_version 85212 (0.0007) [2023-10-08 03:20:59,523][52060] Updated weights for policy 0, policy_version 84130 (0.0010) [2023-10-08 03:20:59,895][52060] Updated weights for policy 0, policy_version 84140 (0.0009) [2023-10-08 03:21:00,269][52060] Updated weights for policy 0, policy_version 84150 (0.0011) [2023-10-08 03:21:00,631][52060] Updated weights for policy 0, policy_version 84160 (0.0008) [2023-10-08 03:21:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 173441024. Throughput: 0: 1693.7, 1: 1736.6. Samples: 43368532. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:01,211][50642] Avg episode reward: [(0, '18.610'), (1, '23.770')] [2023-10-08 03:21:02,631][52059] Updated weights for policy 1, policy_version 85222 (0.0009) [2023-10-08 03:21:02,998][52059] Updated weights for policy 1, policy_version 85232 (0.0007) [2023-10-08 03:21:03,358][52059] Updated weights for policy 1, policy_version 85242 (0.0007) [2023-10-08 03:21:04,784][52060] Updated weights for policy 0, policy_version 84170 (0.0012) [2023-10-08 03:21:05,144][52060] Updated weights for policy 0, policy_version 84180 (0.0009) [2023-10-08 03:21:05,511][52060] Updated weights for policy 0, policy_version 84190 (0.0010) [2023-10-08 03:21:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 173506560. Throughput: 0: 1723.7, 1: 1714.8. Samples: 43379240. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:06,211][50642] Avg episode reward: [(0, '19.750'), (1, '20.990')] [2023-10-08 03:21:07,169][52059] Updated weights for policy 1, policy_version 85252 (0.0007) [2023-10-08 03:21:07,534][52059] Updated weights for policy 1, policy_version 85262 (0.0007) [2023-10-08 03:21:07,899][52059] Updated weights for policy 1, policy_version 85272 (0.0009) [2023-10-08 03:21:09,378][52060] Updated weights for policy 0, policy_version 84200 (0.0009) [2023-10-08 03:21:09,746][52060] Updated weights for policy 0, policy_version 84210 (0.0008) [2023-10-08 03:21:10,113][52060] Updated weights for policy 0, policy_version 84220 (0.0007) [2023-10-08 03:21:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 173572096. Throughput: 0: 1704.1, 1: 1724.8. Samples: 43399864. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:11,211][50642] Avg episode reward: [(0, '20.690'), (1, '14.340')] [2023-10-08 03:21:11,911][52059] Updated weights for policy 1, policy_version 85282 (0.0009) [2023-10-08 03:21:12,282][52059] Updated weights for policy 1, policy_version 85292 (0.0009) [2023-10-08 03:21:12,644][52059] Updated weights for policy 1, policy_version 85302 (0.0007) [2023-10-08 03:21:13,011][52059] Updated weights for policy 1, policy_version 85312 (0.0007) [2023-10-08 03:21:14,021][52060] Updated weights for policy 0, policy_version 84230 (0.0010) [2023-10-08 03:21:14,385][52060] Updated weights for policy 0, policy_version 84240 (0.0010) [2023-10-08 03:21:14,759][52060] Updated weights for policy 0, policy_version 84250 (0.0009) [2023-10-08 03:21:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 173637632. Throughput: 0: 1693.6, 1: 1746.2. Samples: 43420740. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:16,211][50642] Avg episode reward: [(0, '20.540'), (1, '13.560')] [2023-10-08 03:21:16,862][52059] Updated weights for policy 1, policy_version 85322 (0.0008) [2023-10-08 03:21:17,219][52059] Updated weights for policy 1, policy_version 85332 (0.0007) [2023-10-08 03:21:17,589][52059] Updated weights for policy 1, policy_version 85342 (0.0008) [2023-10-08 03:21:18,679][52060] Updated weights for policy 0, policy_version 84260 (0.0007) [2023-10-08 03:21:19,057][52060] Updated weights for policy 0, policy_version 84270 (0.0009) [2023-10-08 03:21:19,427][52060] Updated weights for policy 0, policy_version 84280 (0.0009) [2023-10-08 03:21:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 173703168. Throughput: 0: 1715.8, 1: 1718.2. Samples: 43431122. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:21,211][50642] Avg episode reward: [(0, '20.630'), (1, '14.430')] [2023-10-08 03:21:21,567][52059] Updated weights for policy 1, policy_version 85352 (0.0010) [2023-10-08 03:21:21,936][52059] Updated weights for policy 1, policy_version 85362 (0.0008) [2023-10-08 03:21:22,291][52059] Updated weights for policy 1, policy_version 85372 (0.0010) [2023-10-08 03:21:23,373][52060] Updated weights for policy 0, policy_version 84290 (0.0010) [2023-10-08 03:21:23,744][52060] Updated weights for policy 0, policy_version 84300 (0.0010) [2023-10-08 03:21:24,116][52060] Updated weights for policy 0, policy_version 84310 (0.0007) [2023-10-08 03:21:24,484][52060] Updated weights for policy 0, policy_version 84320 (0.0008) [2023-10-08 03:21:26,122][52059] Updated weights for policy 1, policy_version 85382 (0.0008) [2023-10-08 03:21:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 173768704. Throughput: 0: 1693.6, 1: 1750.7. Samples: 43451818. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:26,211][50642] Avg episode reward: [(0, '19.720'), (1, '20.290')] [2023-10-08 03:21:26,490][52059] Updated weights for policy 1, policy_version 85392 (0.0008) [2023-10-08 03:21:26,851][52059] Updated weights for policy 1, policy_version 85402 (0.0008) [2023-10-08 03:21:28,386][52060] Updated weights for policy 0, policy_version 84330 (0.0008) [2023-10-08 03:21:28,760][52060] Updated weights for policy 0, policy_version 84340 (0.0009) [2023-10-08 03:21:29,140][52060] Updated weights for policy 0, policy_version 84350 (0.0010) [2023-10-08 03:21:30,716][52059] Updated weights for policy 1, policy_version 85412 (0.0010) [2023-10-08 03:21:31,074][52059] Updated weights for policy 1, policy_version 85422 (0.0010) [2023-10-08 03:21:31,211][50642] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 173834240. Throughput: 0: 1720.1, 1: 1740.3. Samples: 43473106. Policy #0 lag: (min: 22.0, avg: 29.7, max: 54.0) [2023-10-08 03:21:31,212][50642] Avg episode reward: [(0, '21.040'), (1, '20.160')] [2023-10-08 03:21:31,433][52059] Updated weights for policy 1, policy_version 85432 (0.0011) [2023-10-08 03:21:32,929][52060] Updated weights for policy 0, policy_version 84360 (0.0008) [2023-10-08 03:21:33,302][52060] Updated weights for policy 0, policy_version 84370 (0.0007) [2023-10-08 03:21:33,675][52060] Updated weights for policy 0, policy_version 84380 (0.0007) [2023-10-08 03:21:35,345][52059] Updated weights for policy 1, policy_version 85442 (0.0009) [2023-10-08 03:21:35,708][52059] Updated weights for policy 1, policy_version 85452 (0.0008) [2023-10-08 03:21:36,077][52059] Updated weights for policy 1, policy_version 85462 (0.0007) [2023-10-08 03:21:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 173899776. Throughput: 0: 1702.4, 1: 1737.8. Samples: 43482916. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:21:36,211][50642] Avg episode reward: [(0, '20.010'), (1, '19.390')] [2023-10-08 03:21:36,434][52059] Updated weights for policy 1, policy_version 85472 (0.0007) [2023-10-08 03:21:37,816][52060] Updated weights for policy 0, policy_version 84390 (0.0009) [2023-10-08 03:21:38,183][52060] Updated weights for policy 0, policy_version 84400 (0.0008) [2023-10-08 03:21:38,550][52060] Updated weights for policy 0, policy_version 84410 (0.0007) [2023-10-08 03:21:40,430][52059] Updated weights for policy 1, policy_version 85482 (0.0010) [2023-10-08 03:21:40,794][52059] Updated weights for policy 1, policy_version 85492 (0.0008) [2023-10-08 03:21:41,162][52059] Updated weights for policy 1, policy_version 85502 (0.0009) [2023-10-08 03:21:41,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 173965312. Throughput: 0: 1701.6, 1: 1751.4. Samples: 43504050. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:21:41,211][50642] Avg episode reward: [(0, '20.590'), (1, '21.470')] [2023-10-08 03:21:42,644][52060] Updated weights for policy 0, policy_version 84420 (0.0009) [2023-10-08 03:21:43,028][52060] Updated weights for policy 0, policy_version 84430 (0.0010) [2023-10-08 03:21:43,404][52060] Updated weights for policy 0, policy_version 84440 (0.0008) [2023-10-08 03:21:45,098][52059] Updated weights for policy 1, policy_version 85512 (0.0009) [2023-10-08 03:21:45,482][52059] Updated weights for policy 1, policy_version 85522 (0.0008) [2023-10-08 03:21:45,840][52059] Updated weights for policy 1, policy_version 85532 (0.0008) [2023-10-08 03:21:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 174063616. Throughput: 0: 1729.4, 1: 1731.1. Samples: 43524254. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:21:46,211][50642] Avg episode reward: [(0, '20.250'), (1, '24.730')] [2023-10-08 03:21:47,321][52060] Updated weights for policy 0, policy_version 84450 (0.0007) [2023-10-08 03:21:47,697][52060] Updated weights for policy 0, policy_version 84460 (0.0009) [2023-10-08 03:21:48,059][52060] Updated weights for policy 0, policy_version 84470 (0.0008) [2023-10-08 03:21:48,439][52060] Updated weights for policy 0, policy_version 84480 (0.0008) [2023-10-08 03:21:49,554][52059] Updated weights for policy 1, policy_version 85542 (0.0010) [2023-10-08 03:21:49,921][52059] Updated weights for policy 1, policy_version 85552 (0.0011) [2023-10-08 03:21:50,289][52059] Updated weights for policy 1, policy_version 85562 (0.0009) [2023-10-08 03:21:51,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 174129152. Throughput: 0: 1701.0, 1: 1756.9. Samples: 43534846. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:21:51,211][50642] Avg episode reward: [(0, '22.790'), (1, '25.990')] [2023-10-08 03:21:52,634][52060] Updated weights for policy 0, policy_version 84490 (0.0010) [2023-10-08 03:21:52,998][52060] Updated weights for policy 0, policy_version 84500 (0.0010) [2023-10-08 03:21:53,361][52060] Updated weights for policy 0, policy_version 84510 (0.0010) [2023-10-08 03:21:54,275][52059] Updated weights for policy 1, policy_version 85572 (0.0008) [2023-10-08 03:21:54,635][52059] Updated weights for policy 1, policy_version 85582 (0.0007) [2023-10-08 03:21:54,991][52059] Updated weights for policy 1, policy_version 85592 (0.0008) [2023-10-08 03:21:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 174194688. Throughput: 0: 1710.3, 1: 1741.3. Samples: 43555186. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:21:56,211][50642] Avg episode reward: [(0, '20.280'), (1, '20.670')] [2023-10-08 03:21:57,442][52060] Updated weights for policy 0, policy_version 84520 (0.0010) [2023-10-08 03:21:57,812][52060] Updated weights for policy 0, policy_version 84530 (0.0009) [2023-10-08 03:21:58,170][52060] Updated weights for policy 0, policy_version 84540 (0.0009) [2023-10-08 03:21:58,847][52059] Updated weights for policy 1, policy_version 85602 (0.0008) [2023-10-08 03:21:59,209][52059] Updated weights for policy 1, policy_version 85612 (0.0009) [2023-10-08 03:21:59,572][52059] Updated weights for policy 1, policy_version 85622 (0.0007) [2023-10-08 03:21:59,942][52059] Updated weights for policy 1, policy_version 85632 (0.0008) [2023-10-08 03:22:01,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 174260224. Throughput: 0: 1721.6, 1: 1733.1. Samples: 43576198. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:22:01,211][50642] Avg episode reward: [(0, '19.410'), (1, '20.460')] [2023-10-08 03:22:01,962][52060] Updated weights for policy 0, policy_version 84550 (0.0010) [2023-10-08 03:22:02,339][52060] Updated weights for policy 0, policy_version 84560 (0.0008) [2023-10-08 03:22:02,707][52060] Updated weights for policy 0, policy_version 84570 (0.0007) [2023-10-08 03:22:03,804][52059] Updated weights for policy 1, policy_version 85642 (0.0009) [2023-10-08 03:22:04,162][52059] Updated weights for policy 1, policy_version 85652 (0.0010) [2023-10-08 03:22:04,534][52059] Updated weights for policy 1, policy_version 85662 (0.0008) [2023-10-08 03:22:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 174325760. Throughput: 0: 1699.3, 1: 1750.9. Samples: 43586382. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:22:06,211][50642] Avg episode reward: [(0, '21.110'), (1, '22.730')] [2023-10-08 03:22:06,804][52060] Updated weights for policy 0, policy_version 84580 (0.0007) [2023-10-08 03:22:07,163][52060] Updated weights for policy 0, policy_version 84590 (0.0007) [2023-10-08 03:22:07,544][52060] Updated weights for policy 0, policy_version 84600 (0.0008) [2023-10-08 03:22:08,477][52059] Updated weights for policy 1, policy_version 85672 (0.0007) [2023-10-08 03:22:08,845][52059] Updated weights for policy 1, policy_version 85682 (0.0008) [2023-10-08 03:22:09,203][52059] Updated weights for policy 1, policy_version 85692 (0.0008) [2023-10-08 03:22:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 174391296. Throughput: 0: 1724.8, 1: 1724.7. Samples: 43607048. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:22:11,211][50642] Avg episode reward: [(0, '21.960'), (1, '24.650')] [2023-10-08 03:22:11,368][52060] Updated weights for policy 0, policy_version 84610 (0.0008) [2023-10-08 03:22:11,742][52060] Updated weights for policy 0, policy_version 84620 (0.0007) [2023-10-08 03:22:12,111][52060] Updated weights for policy 0, policy_version 84630 (0.0007) [2023-10-08 03:22:12,473][52060] Updated weights for policy 0, policy_version 84640 (0.0007) [2023-10-08 03:22:13,142][52059] Updated weights for policy 1, policy_version 85702 (0.0009) [2023-10-08 03:22:13,500][52059] Updated weights for policy 1, policy_version 85712 (0.0010) [2023-10-08 03:22:13,861][52059] Updated weights for policy 1, policy_version 85722 (0.0009) [2023-10-08 03:22:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 174456832. Throughput: 0: 1720.6, 1: 1733.6. Samples: 43628544. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:22:16,211][50642] Avg episode reward: [(0, '21.590'), (1, '20.090')] [2023-10-08 03:22:16,406][52060] Updated weights for policy 0, policy_version 84650 (0.0008) [2023-10-08 03:22:16,774][52060] Updated weights for policy 0, policy_version 84660 (0.0008) [2023-10-08 03:22:17,145][52060] Updated weights for policy 0, policy_version 84670 (0.0007) [2023-10-08 03:22:17,763][52059] Updated weights for policy 1, policy_version 85732 (0.0008) [2023-10-08 03:22:18,128][52059] Updated weights for policy 1, policy_version 85742 (0.0007) [2023-10-08 03:22:18,496][52059] Updated weights for policy 1, policy_version 85752 (0.0007) [2023-10-08 03:22:20,973][52060] Updated weights for policy 0, policy_version 84680 (0.0008) [2023-10-08 03:22:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 174522368. Throughput: 0: 1721.9, 1: 1730.2. Samples: 43638258. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:22:21,211][50642] Avg episode reward: [(0, '20.070'), (1, '21.580')] [2023-10-08 03:22:21,351][52060] Updated weights for policy 0, policy_version 84690 (0.0010) [2023-10-08 03:22:21,716][52060] Updated weights for policy 0, policy_version 84700 (0.0010) [2023-10-08 03:22:22,372][52059] Updated weights for policy 1, policy_version 85762 (0.0009) [2023-10-08 03:22:22,740][52059] Updated weights for policy 1, policy_version 85772 (0.0009) [2023-10-08 03:22:23,118][52059] Updated weights for policy 1, policy_version 85782 (0.0008) [2023-10-08 03:22:23,476][52059] Updated weights for policy 1, policy_version 85792 (0.0009) [2023-10-08 03:22:25,784][52060] Updated weights for policy 0, policy_version 84710 (0.0011) [2023-10-08 03:22:26,156][52060] Updated weights for policy 0, policy_version 84720 (0.0007) [2023-10-08 03:22:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 174587904. Throughput: 0: 1728.1, 1: 1726.8. Samples: 43659522. Policy #0 lag: (min: 24.0, avg: 45.3, max: 56.0) [2023-10-08 03:22:26,211][50642] Avg episode reward: [(0, '20.530'), (1, '23.090')] [2023-10-08 03:22:26,528][52060] Updated weights for policy 0, policy_version 84730 (0.0008) [2023-10-08 03:22:27,585][52059] Updated weights for policy 1, policy_version 85802 (0.0010) [2023-10-08 03:22:27,953][52059] Updated weights for policy 1, policy_version 85812 (0.0008) [2023-10-08 03:22:28,317][52059] Updated weights for policy 1, policy_version 85822 (0.0007) [2023-10-08 03:22:30,535][52060] Updated weights for policy 0, policy_version 84740 (0.0007) [2023-10-08 03:22:30,930][52060] Updated weights for policy 0, policy_version 84750 (0.0009) [2023-10-08 03:22:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 174653440. Throughput: 0: 1714.9, 1: 1750.4. Samples: 43680194. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:22:31,211][50642] Avg episode reward: [(0, '20.720'), (1, '25.430')] [2023-10-08 03:22:31,296][52060] Updated weights for policy 0, policy_version 84760 (0.0010) [2023-10-08 03:22:32,291][52059] Updated weights for policy 1, policy_version 85832 (0.0007) [2023-10-08 03:22:32,668][52059] Updated weights for policy 1, policy_version 85842 (0.0007) [2023-10-08 03:22:33,025][52059] Updated weights for policy 1, policy_version 85852 (0.0009) [2023-10-08 03:22:35,223][52060] Updated weights for policy 0, policy_version 84770 (0.0011) [2023-10-08 03:22:35,586][52060] Updated weights for policy 0, policy_version 84780 (0.0009) [2023-10-08 03:22:35,963][52060] Updated weights for policy 0, policy_version 84790 (0.0008) [2023-10-08 03:22:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 174718976. Throughput: 0: 1722.3, 1: 1720.0. Samples: 43689752. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:22:36,211][50642] Avg episode reward: [(0, '20.690'), (1, '21.650')] [2023-10-08 03:22:36,323][52060] Updated weights for policy 0, policy_version 84800 (0.0009) [2023-10-08 03:22:36,846][52059] Updated weights for policy 1, policy_version 85862 (0.0008) [2023-10-08 03:22:37,209][52059] Updated weights for policy 1, policy_version 85872 (0.0009) [2023-10-08 03:22:37,571][52059] Updated weights for policy 1, policy_version 85882 (0.0009) [2023-10-08 03:22:40,247][52060] Updated weights for policy 0, policy_version 84810 (0.0008) [2023-10-08 03:22:40,620][52060] Updated weights for policy 0, policy_version 84820 (0.0009) [2023-10-08 03:22:40,981][52060] Updated weights for policy 0, policy_version 84830 (0.0008) [2023-10-08 03:22:41,210][50642] Fps is (10 sec: 16383.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 174817280. Throughput: 0: 1734.3, 1: 1737.4. Samples: 43711414. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:22:41,211][50642] Avg episode reward: [(0, '20.710'), (1, '21.460')] [2023-10-08 03:22:41,426][52059] Updated weights for policy 1, policy_version 85892 (0.0007) [2023-10-08 03:22:41,798][52059] Updated weights for policy 1, policy_version 85902 (0.0009) [2023-10-08 03:22:42,158][52059] Updated weights for policy 1, policy_version 85912 (0.0008) [2023-10-08 03:22:45,085][52060] Updated weights for policy 0, policy_version 84840 (0.0008) [2023-10-08 03:22:45,439][52060] Updated weights for policy 0, policy_version 84850 (0.0007) [2023-10-08 03:22:45,805][52060] Updated weights for policy 0, policy_version 84860 (0.0007) [2023-10-08 03:22:46,154][52059] Updated weights for policy 1, policy_version 85922 (0.0009) [2023-10-08 03:22:46,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 174882816. Throughput: 0: 1707.1, 1: 1749.6. Samples: 43731750. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:22:46,211][50642] Avg episode reward: [(0, '18.800'), (1, '22.530')] [2023-10-08 03:22:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000084864_86900736.pth... [2023-10-08 03:22:46,250][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000083264_85262336.pth [2023-10-08 03:22:46,513][52059] Updated weights for policy 1, policy_version 85932 (0.0008) [2023-10-08 03:22:46,876][52059] Updated weights for policy 1, policy_version 85942 (0.0008) [2023-10-08 03:22:47,238][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000085952_88014848.pth... [2023-10-08 03:22:47,242][52059] Updated weights for policy 1, policy_version 85952 (0.0010) [2023-10-08 03:22:47,278][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000084320_86343680.pth [2023-10-08 03:22:49,779][52060] Updated weights for policy 0, policy_version 84870 (0.0009) [2023-10-08 03:22:50,152][52060] Updated weights for policy 0, policy_version 84880 (0.0008) [2023-10-08 03:22:50,510][52060] Updated weights for policy 0, policy_version 84890 (0.0009) [2023-10-08 03:22:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 174948352. Throughput: 0: 1730.6, 1: 1731.1. Samples: 43742158. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:22:51,211][50642] Avg episode reward: [(0, '18.140'), (1, '24.360')] [2023-10-08 03:22:51,259][52059] Updated weights for policy 1, policy_version 85962 (0.0009) [2023-10-08 03:22:51,630][52059] Updated weights for policy 1, policy_version 85972 (0.0010) [2023-10-08 03:22:52,003][52059] Updated weights for policy 1, policy_version 85982 (0.0009) [2023-10-08 03:22:54,462][52060] Updated weights for policy 0, policy_version 84900 (0.0009) [2023-10-08 03:22:54,824][52060] Updated weights for policy 0, policy_version 84910 (0.0011) [2023-10-08 03:22:55,191][52060] Updated weights for policy 0, policy_version 84920 (0.0007) [2023-10-08 03:22:55,739][52059] Updated weights for policy 1, policy_version 85992 (0.0007) [2023-10-08 03:22:56,103][52059] Updated weights for policy 1, policy_version 86002 (0.0007) [2023-10-08 03:22:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 175013888. Throughput: 0: 1713.4, 1: 1754.9. Samples: 43763122. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:22:56,211][50642] Avg episode reward: [(0, '19.280'), (1, '22.100')] [2023-10-08 03:22:56,469][52059] Updated weights for policy 1, policy_version 86012 (0.0009) [2023-10-08 03:22:59,048][52060] Updated weights for policy 0, policy_version 84930 (0.0009) [2023-10-08 03:22:59,418][52060] Updated weights for policy 0, policy_version 84940 (0.0008) [2023-10-08 03:22:59,793][52060] Updated weights for policy 0, policy_version 84950 (0.0010) [2023-10-08 03:23:00,161][52060] Updated weights for policy 0, policy_version 84960 (0.0010) [2023-10-08 03:23:00,505][52059] Updated weights for policy 1, policy_version 86022 (0.0010) [2023-10-08 03:23:00,873][52059] Updated weights for policy 1, policy_version 86032 (0.0009) [2023-10-08 03:23:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 175079424. Throughput: 0: 1694.8, 1: 1738.4. Samples: 43783036. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:23:01,211][50642] Avg episode reward: [(0, '20.160'), (1, '21.840')] [2023-10-08 03:23:01,239][52059] Updated weights for policy 1, policy_version 86042 (0.0008) [2023-10-08 03:23:04,023][52060] Updated weights for policy 0, policy_version 84970 (0.0011) [2023-10-08 03:23:04,393][52060] Updated weights for policy 0, policy_version 84980 (0.0009) [2023-10-08 03:23:04,759][52060] Updated weights for policy 0, policy_version 84990 (0.0008) [2023-10-08 03:23:05,106][52059] Updated weights for policy 1, policy_version 86052 (0.0007) [2023-10-08 03:23:05,465][52059] Updated weights for policy 1, policy_version 86062 (0.0007) [2023-10-08 03:23:05,833][52059] Updated weights for policy 1, policy_version 86072 (0.0007) [2023-10-08 03:23:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 175177728. Throughput: 0: 1719.0, 1: 1746.5. Samples: 43794204. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:23:06,211][50642] Avg episode reward: [(0, '19.490'), (1, '21.220')] [2023-10-08 03:23:08,751][52060] Updated weights for policy 0, policy_version 85000 (0.0007) [2023-10-08 03:23:09,118][52060] Updated weights for policy 0, policy_version 85010 (0.0009) [2023-10-08 03:23:09,488][52060] Updated weights for policy 0, policy_version 85020 (0.0010) [2023-10-08 03:23:09,820][52059] Updated weights for policy 1, policy_version 86082 (0.0007) [2023-10-08 03:23:10,191][52059] Updated weights for policy 1, policy_version 86092 (0.0008) [2023-10-08 03:23:10,549][52059] Updated weights for policy 1, policy_version 86102 (0.0010) [2023-10-08 03:23:10,920][52059] Updated weights for policy 1, policy_version 86112 (0.0009) [2023-10-08 03:23:11,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 175243264. Throughput: 0: 1692.1, 1: 1747.0. Samples: 43814280. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:23:11,211][50642] Avg episode reward: [(0, '18.860'), (1, '24.730')] [2023-10-08 03:23:13,489][52060] Updated weights for policy 0, policy_version 85030 (0.0009) [2023-10-08 03:23:13,852][52060] Updated weights for policy 0, policy_version 85040 (0.0007) [2023-10-08 03:23:14,227][52060] Updated weights for policy 0, policy_version 85050 (0.0010) [2023-10-08 03:23:14,878][52059] Updated weights for policy 1, policy_version 86122 (0.0008) [2023-10-08 03:23:15,248][52059] Updated weights for policy 1, policy_version 86132 (0.0008) [2023-10-08 03:23:15,617][52059] Updated weights for policy 1, policy_version 86142 (0.0008) [2023-10-08 03:23:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 175308800. Throughput: 0: 1709.4, 1: 1717.2. Samples: 43834392. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:23:16,211][50642] Avg episode reward: [(0, '21.180'), (1, '23.970')] [2023-10-08 03:23:18,275][52060] Updated weights for policy 0, policy_version 85060 (0.0008) [2023-10-08 03:23:18,649][52060] Updated weights for policy 0, policy_version 85070 (0.0008) [2023-10-08 03:23:19,025][52060] Updated weights for policy 0, policy_version 85080 (0.0009) [2023-10-08 03:23:19,528][52059] Updated weights for policy 1, policy_version 86152 (0.0007) [2023-10-08 03:23:19,908][52059] Updated weights for policy 1, policy_version 86162 (0.0008) [2023-10-08 03:23:20,269][52059] Updated weights for policy 1, policy_version 86172 (0.0009) [2023-10-08 03:23:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 175374336. Throughput: 0: 1709.8, 1: 1750.3. Samples: 43845456. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:21,211][50642] Avg episode reward: [(0, '19.510'), (1, '19.870')] [2023-10-08 03:23:22,970][52060] Updated weights for policy 0, policy_version 85090 (0.0008) [2023-10-08 03:23:23,347][52060] Updated weights for policy 0, policy_version 85100 (0.0010) [2023-10-08 03:23:23,714][52060] Updated weights for policy 0, policy_version 85110 (0.0009) [2023-10-08 03:23:24,079][52060] Updated weights for policy 0, policy_version 85120 (0.0007) [2023-10-08 03:23:24,134][52059] Updated weights for policy 1, policy_version 86182 (0.0008) [2023-10-08 03:23:24,489][52059] Updated weights for policy 1, policy_version 86192 (0.0009) [2023-10-08 03:23:24,859][52059] Updated weights for policy 1, policy_version 86202 (0.0008) [2023-10-08 03:23:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 175439872. Throughput: 0: 1692.2, 1: 1718.7. Samples: 43864902. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:26,211][50642] Avg episode reward: [(0, '19.370'), (1, '21.620')] [2023-10-08 03:23:27,993][52060] Updated weights for policy 0, policy_version 85130 (0.0011) [2023-10-08 03:23:28,360][52060] Updated weights for policy 0, policy_version 85140 (0.0010) [2023-10-08 03:23:28,720][52060] Updated weights for policy 0, policy_version 85150 (0.0011) [2023-10-08 03:23:29,039][52059] Updated weights for policy 1, policy_version 86212 (0.0010) [2023-10-08 03:23:29,395][52059] Updated weights for policy 1, policy_version 86222 (0.0008) [2023-10-08 03:23:29,766][52059] Updated weights for policy 1, policy_version 86232 (0.0009) [2023-10-08 03:23:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 175505408. Throughput: 0: 1724.1, 1: 1702.9. Samples: 43885966. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:31,211][50642] Avg episode reward: [(0, '20.120'), (1, '24.550')] [2023-10-08 03:23:32,686][52060] Updated weights for policy 0, policy_version 85160 (0.0008) [2023-10-08 03:23:33,058][52060] Updated weights for policy 0, policy_version 85170 (0.0007) [2023-10-08 03:23:33,428][52060] Updated weights for policy 0, policy_version 85180 (0.0008) [2023-10-08 03:23:33,669][52059] Updated weights for policy 1, policy_version 86242 (0.0011) [2023-10-08 03:23:34,027][52059] Updated weights for policy 1, policy_version 86252 (0.0009) [2023-10-08 03:23:34,388][52059] Updated weights for policy 1, policy_version 86262 (0.0010) [2023-10-08 03:23:34,750][52059] Updated weights for policy 1, policy_version 86272 (0.0008) [2023-10-08 03:23:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 175570944. Throughput: 0: 1696.4, 1: 1727.3. Samples: 43896224. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:36,211][50642] Avg episode reward: [(0, '22.100'), (1, '25.920')] [2023-10-08 03:23:37,332][52060] Updated weights for policy 0, policy_version 85190 (0.0008) [2023-10-08 03:23:37,690][52060] Updated weights for policy 0, policy_version 85200 (0.0010) [2023-10-08 03:23:38,063][52060] Updated weights for policy 0, policy_version 85210 (0.0011) [2023-10-08 03:23:38,611][52059] Updated weights for policy 1, policy_version 86282 (0.0011) [2023-10-08 03:23:38,980][52059] Updated weights for policy 1, policy_version 86292 (0.0009) [2023-10-08 03:23:39,347][52059] Updated weights for policy 1, policy_version 86302 (0.0008) [2023-10-08 03:23:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 175636480. Throughput: 0: 1713.8, 1: 1703.8. Samples: 43916916. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:41,211][50642] Avg episode reward: [(0, '21.460'), (1, '20.890')] [2023-10-08 03:23:41,794][52060] Updated weights for policy 0, policy_version 85220 (0.0010) [2023-10-08 03:23:42,159][52060] Updated weights for policy 0, policy_version 85230 (0.0010) [2023-10-08 03:23:42,534][52060] Updated weights for policy 0, policy_version 85240 (0.0008) [2023-10-08 03:23:43,395][52059] Updated weights for policy 1, policy_version 86312 (0.0008) [2023-10-08 03:23:43,768][52059] Updated weights for policy 1, policy_version 86322 (0.0009) [2023-10-08 03:23:44,123][52059] Updated weights for policy 1, policy_version 86332 (0.0007) [2023-10-08 03:23:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.8). Total num frames: 175702016. Throughput: 0: 1732.1, 1: 1715.8. Samples: 43938194. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:46,211][50642] Avg episode reward: [(0, '21.160'), (1, '19.510')] [2023-10-08 03:23:46,514][52060] Updated weights for policy 0, policy_version 85250 (0.0008) [2023-10-08 03:23:46,877][52060] Updated weights for policy 0, policy_version 85260 (0.0010) [2023-10-08 03:23:47,239][52060] Updated weights for policy 0, policy_version 85270 (0.0010) [2023-10-08 03:23:47,601][52060] Updated weights for policy 0, policy_version 85280 (0.0007) [2023-10-08 03:23:48,107][52059] Updated weights for policy 1, policy_version 86342 (0.0007) [2023-10-08 03:23:48,479][52059] Updated weights for policy 1, policy_version 86352 (0.0007) [2023-10-08 03:23:48,838][52059] Updated weights for policy 1, policy_version 86362 (0.0009) [2023-10-08 03:23:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 175767552. Throughput: 0: 1704.5, 1: 1710.6. Samples: 43947884. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:51,211][50642] Avg episode reward: [(0, '22.890'), (1, '22.000')] [2023-10-08 03:23:51,759][52060] Updated weights for policy 0, policy_version 85290 (0.0010) [2023-10-08 03:23:52,127][52060] Updated weights for policy 0, policy_version 85300 (0.0010) [2023-10-08 03:23:52,505][52060] Updated weights for policy 0, policy_version 85310 (0.0009) [2023-10-08 03:23:52,796][52059] Updated weights for policy 1, policy_version 86372 (0.0008) [2023-10-08 03:23:53,155][52059] Updated weights for policy 1, policy_version 86382 (0.0009) [2023-10-08 03:23:53,517][52059] Updated weights for policy 1, policy_version 86392 (0.0008) [2023-10-08 03:23:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 175833088. Throughput: 0: 1728.7, 1: 1706.5. Samples: 43968864. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:23:56,211][50642] Avg episode reward: [(0, '21.370'), (1, '25.950')] [2023-10-08 03:23:56,654][52060] Updated weights for policy 0, policy_version 85320 (0.0008) [2023-10-08 03:23:57,022][52060] Updated weights for policy 0, policy_version 85330 (0.0009) [2023-10-08 03:23:57,322][52059] Updated weights for policy 1, policy_version 86402 (0.0007) [2023-10-08 03:23:57,394][52060] Updated weights for policy 0, policy_version 85340 (0.0008) [2023-10-08 03:23:57,685][52059] Updated weights for policy 1, policy_version 86412 (0.0010) [2023-10-08 03:23:58,045][52059] Updated weights for policy 1, policy_version 86422 (0.0009) [2023-10-08 03:23:58,409][52059] Updated weights for policy 1, policy_version 86432 (0.0008) [2023-10-08 03:24:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 175898624. Throughput: 0: 1722.1, 1: 1739.0. Samples: 43990142. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:24:01,211][50642] Avg episode reward: [(0, '20.510'), (1, '22.920')] [2023-10-08 03:24:01,405][52060] Updated weights for policy 0, policy_version 85350 (0.0008) [2023-10-08 03:24:01,772][52060] Updated weights for policy 0, policy_version 85360 (0.0007) [2023-10-08 03:24:02,135][52060] Updated weights for policy 0, policy_version 85370 (0.0007) [2023-10-08 03:24:02,238][52059] Updated weights for policy 1, policy_version 86442 (0.0007) [2023-10-08 03:24:02,599][52059] Updated weights for policy 1, policy_version 86452 (0.0008) [2023-10-08 03:24:02,966][52059] Updated weights for policy 1, policy_version 86462 (0.0009) [2023-10-08 03:24:06,076][52060] Updated weights for policy 0, policy_version 85380 (0.0009) [2023-10-08 03:24:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 175964160. Throughput: 0: 1717.9, 1: 1709.6. Samples: 43999692. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:24:06,211][50642] Avg episode reward: [(0, '21.310'), (1, '21.050')] [2023-10-08 03:24:06,473][52060] Updated weights for policy 0, policy_version 85390 (0.0010) [2023-10-08 03:24:06,839][52060] Updated weights for policy 0, policy_version 85400 (0.0007) [2023-10-08 03:24:06,842][52059] Updated weights for policy 1, policy_version 86472 (0.0008) [2023-10-08 03:24:07,213][52059] Updated weights for policy 1, policy_version 86482 (0.0010) [2023-10-08 03:24:07,585][52059] Updated weights for policy 1, policy_version 86492 (0.0010) [2023-10-08 03:24:10,811][52060] Updated weights for policy 0, policy_version 85410 (0.0007) [2023-10-08 03:24:11,183][52060] Updated weights for policy 0, policy_version 85420 (0.0008) [2023-10-08 03:24:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 176029696. Throughput: 0: 1728.7, 1: 1740.0. Samples: 44020994. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:24:11,211][50642] Avg episode reward: [(0, '22.160'), (1, '22.240')] [2023-10-08 03:24:11,465][52059] Updated weights for policy 1, policy_version 86502 (0.0008) [2023-10-08 03:24:11,557][52060] Updated weights for policy 0, policy_version 85430 (0.0008) [2023-10-08 03:24:11,834][52059] Updated weights for policy 1, policy_version 86512 (0.0007) [2023-10-08 03:24:11,919][52060] Updated weights for policy 0, policy_version 85440 (0.0007) [2023-10-08 03:24:12,194][52059] Updated weights for policy 1, policy_version 86522 (0.0010) [2023-10-08 03:24:15,707][52060] Updated weights for policy 0, policy_version 85450 (0.0011) [2023-10-08 03:24:16,076][52060] Updated weights for policy 0, policy_version 85460 (0.0009) [2023-10-08 03:24:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 176095232. Throughput: 0: 1714.9, 1: 1747.2. Samples: 44041760. Policy #0 lag: (min: 31.0, avg: 35.4, max: 63.0) [2023-10-08 03:24:16,211][50642] Avg episode reward: [(0, '21.230'), (1, '25.470')] [2023-10-08 03:24:16,309][52059] Updated weights for policy 1, policy_version 86532 (0.0008) [2023-10-08 03:24:16,443][52060] Updated weights for policy 0, policy_version 85470 (0.0007) [2023-10-08 03:24:16,668][52059] Updated weights for policy 1, policy_version 86542 (0.0009) [2023-10-08 03:24:17,038][52059] Updated weights for policy 1, policy_version 86552 (0.0010) [2023-10-08 03:24:20,331][52060] Updated weights for policy 0, policy_version 85480 (0.0007) [2023-10-08 03:24:20,691][52060] Updated weights for policy 0, policy_version 85490 (0.0007) [2023-10-08 03:24:21,075][52060] Updated weights for policy 0, policy_version 85500 (0.0009) [2023-10-08 03:24:21,200][52059] Updated weights for policy 1, policy_version 86562 (0.0008) [2023-10-08 03:24:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 176160768. Throughput: 0: 1734.7, 1: 1724.6. Samples: 44051892. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:21,211][50642] Avg episode reward: [(0, '19.840'), (1, '25.390')] [2023-10-08 03:24:21,564][52059] Updated weights for policy 1, policy_version 86572 (0.0011) [2023-10-08 03:24:21,926][52059] Updated weights for policy 1, policy_version 86582 (0.0009) [2023-10-08 03:24:22,290][52059] Updated weights for policy 1, policy_version 86592 (0.0008) [2023-10-08 03:24:24,990][52060] Updated weights for policy 0, policy_version 85510 (0.0008) [2023-10-08 03:24:25,360][52060] Updated weights for policy 0, policy_version 85520 (0.0008) [2023-10-08 03:24:25,730][52060] Updated weights for policy 0, policy_version 85530 (0.0010) [2023-10-08 03:24:26,126][52059] Updated weights for policy 1, policy_version 86602 (0.0007) [2023-10-08 03:24:26,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 176259072. Throughput: 0: 1726.7, 1: 1743.5. Samples: 44073072. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:26,211][50642] Avg episode reward: [(0, '20.710'), (1, '23.310')] [2023-10-08 03:24:26,497][52059] Updated weights for policy 1, policy_version 86612 (0.0007) [2023-10-08 03:24:26,856][52059] Updated weights for policy 1, policy_version 86622 (0.0009) [2023-10-08 03:24:29,738][52060] Updated weights for policy 0, policy_version 85540 (0.0009) [2023-10-08 03:24:30,114][52060] Updated weights for policy 0, policy_version 85550 (0.0009) [2023-10-08 03:24:30,479][52060] Updated weights for policy 0, policy_version 85560 (0.0008) [2023-10-08 03:24:30,710][52059] Updated weights for policy 1, policy_version 86632 (0.0007) [2023-10-08 03:24:31,076][52059] Updated weights for policy 1, policy_version 86642 (0.0011) [2023-10-08 03:24:31,211][50642] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 176324608. Throughput: 0: 1694.3, 1: 1741.6. Samples: 44092810. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:31,212][50642] Avg episode reward: [(0, '21.490'), (1, '21.800')] [2023-10-08 03:24:31,443][52059] Updated weights for policy 1, policy_version 86652 (0.0009) [2023-10-08 03:24:34,508][52060] Updated weights for policy 0, policy_version 85570 (0.0008) [2023-10-08 03:24:34,882][52060] Updated weights for policy 0, policy_version 85580 (0.0007) [2023-10-08 03:24:35,252][52060] Updated weights for policy 0, policy_version 85590 (0.0007) [2023-10-08 03:24:35,310][52059] Updated weights for policy 1, policy_version 86662 (0.0008) [2023-10-08 03:24:35,617][52060] Updated weights for policy 0, policy_version 85600 (0.0008) [2023-10-08 03:24:35,672][52059] Updated weights for policy 1, policy_version 86672 (0.0009) [2023-10-08 03:24:36,047][52059] Updated weights for policy 1, policy_version 86682 (0.0010) [2023-10-08 03:24:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 176390144. Throughput: 0: 1724.0, 1: 1741.8. Samples: 44103844. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:36,211][50642] Avg episode reward: [(0, '21.530'), (1, '22.090')] [2023-10-08 03:24:39,601][52060] Updated weights for policy 0, policy_version 85610 (0.0008) [2023-10-08 03:24:39,947][52059] Updated weights for policy 1, policy_version 86692 (0.0009) [2023-10-08 03:24:39,975][52060] Updated weights for policy 0, policy_version 85620 (0.0007) [2023-10-08 03:24:40,298][52059] Updated weights for policy 1, policy_version 86702 (0.0009) [2023-10-08 03:24:40,332][52060] Updated weights for policy 0, policy_version 85630 (0.0007) [2023-10-08 03:24:40,667][52059] Updated weights for policy 1, policy_version 86712 (0.0009) [2023-10-08 03:24:41,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 176488448. Throughput: 0: 1710.4, 1: 1745.3. Samples: 44124372. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:41,211][50642] Avg episode reward: [(0, '20.460'), (1, '27.080')] [2023-10-08 03:24:44,304][52060] Updated weights for policy 0, policy_version 85640 (0.0007) [2023-10-08 03:24:44,661][52059] Updated weights for policy 1, policy_version 86722 (0.0010) [2023-10-08 03:24:44,679][52060] Updated weights for policy 0, policy_version 85650 (0.0011) [2023-10-08 03:24:45,020][52059] Updated weights for policy 1, policy_version 86732 (0.0008) [2023-10-08 03:24:45,043][52060] Updated weights for policy 0, policy_version 85660 (0.0008) [2023-10-08 03:24:45,372][52059] Updated weights for policy 1, policy_version 86742 (0.0010) [2023-10-08 03:24:45,736][52059] Updated weights for policy 1, policy_version 86752 (0.0008) [2023-10-08 03:24:46,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 176553984. Throughput: 0: 1697.8, 1: 1714.1. Samples: 44143678. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:46,211][50642] Avg episode reward: [(0, '19.470'), (1, '22.720')] [2023-10-08 03:24:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000085664_87719936.pth... [2023-10-08 03:24:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000086752_88834048.pth... [2023-10-08 03:24:46,252][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000085120_87162880.pth [2023-10-08 03:24:46,262][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000084064_86081536.pth [2023-10-08 03:24:49,077][52060] Updated weights for policy 0, policy_version 85670 (0.0008) [2023-10-08 03:24:49,440][52060] Updated weights for policy 0, policy_version 85680 (0.0008) [2023-10-08 03:24:49,727][52059] Updated weights for policy 1, policy_version 86762 (0.0008) [2023-10-08 03:24:49,803][52060] Updated weights for policy 0, policy_version 85690 (0.0008) [2023-10-08 03:24:50,083][52059] Updated weights for policy 1, policy_version 86772 (0.0008) [2023-10-08 03:24:50,443][52059] Updated weights for policy 1, policy_version 86782 (0.0007) [2023-10-08 03:24:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 176619520. Throughput: 0: 1726.7, 1: 1740.1. Samples: 44155698. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:51,211][50642] Avg episode reward: [(0, '21.590'), (1, '22.260')] [2023-10-08 03:24:53,823][52060] Updated weights for policy 0, policy_version 85700 (0.0010) [2023-10-08 03:24:54,188][52060] Updated weights for policy 0, policy_version 85710 (0.0007) [2023-10-08 03:24:54,551][52060] Updated weights for policy 0, policy_version 85720 (0.0007) [2023-10-08 03:24:54,645][52059] Updated weights for policy 1, policy_version 86792 (0.0007) [2023-10-08 03:24:55,021][52059] Updated weights for policy 1, policy_version 86802 (0.0010) [2023-10-08 03:24:55,384][52059] Updated weights for policy 1, policy_version 86812 (0.0008) [2023-10-08 03:24:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 176685056. Throughput: 0: 1701.8, 1: 1723.8. Samples: 44175146. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:24:56,211][50642] Avg episode reward: [(0, '22.790'), (1, '23.210')] [2023-10-08 03:24:58,666][52060] Updated weights for policy 0, policy_version 85730 (0.0007) [2023-10-08 03:24:59,025][52060] Updated weights for policy 0, policy_version 85740 (0.0007) [2023-10-08 03:24:59,336][52059] Updated weights for policy 1, policy_version 86822 (0.0007) [2023-10-08 03:24:59,393][52060] Updated weights for policy 0, policy_version 85750 (0.0008) [2023-10-08 03:24:59,691][52059] Updated weights for policy 1, policy_version 86832 (0.0007) [2023-10-08 03:24:59,759][52060] Updated weights for policy 0, policy_version 85760 (0.0008) [2023-10-08 03:25:00,060][52059] Updated weights for policy 1, policy_version 86842 (0.0007) [2023-10-08 03:25:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 176750592. Throughput: 0: 1704.1, 1: 1711.0. Samples: 44195440. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:25:01,211][50642] Avg episode reward: [(0, '21.230'), (1, '26.100')] [2023-10-08 03:25:03,718][52060] Updated weights for policy 0, policy_version 85770 (0.0008) [2023-10-08 03:25:04,060][52059] Updated weights for policy 1, policy_version 86852 (0.0010) [2023-10-08 03:25:04,086][52060] Updated weights for policy 0, policy_version 85780 (0.0008) [2023-10-08 03:25:04,426][52059] Updated weights for policy 1, policy_version 86862 (0.0008) [2023-10-08 03:25:04,461][52060] Updated weights for policy 0, policy_version 85790 (0.0008) [2023-10-08 03:25:04,790][52059] Updated weights for policy 1, policy_version 86872 (0.0007) [2023-10-08 03:25:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 176816128. Throughput: 0: 1707.0, 1: 1738.9. Samples: 44206960. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:25:06,211][50642] Avg episode reward: [(0, '18.740'), (1, '27.980')] [2023-10-08 03:25:08,449][52060] Updated weights for policy 0, policy_version 85800 (0.0008) [2023-10-08 03:25:08,821][52060] Updated weights for policy 0, policy_version 85810 (0.0009) [2023-10-08 03:25:08,845][52059] Updated weights for policy 1, policy_version 86882 (0.0009) [2023-10-08 03:25:09,186][52060] Updated weights for policy 0, policy_version 85820 (0.0009) [2023-10-08 03:25:09,211][52059] Updated weights for policy 1, policy_version 86892 (0.0008) [2023-10-08 03:25:09,577][52059] Updated weights for policy 1, policy_version 86902 (0.0008) [2023-10-08 03:25:09,933][52059] Updated weights for policy 1, policy_version 86912 (0.0007) [2023-10-08 03:25:11,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 176881664. Throughput: 0: 1693.5, 1: 1710.1. Samples: 44226232. Policy #0 lag: (min: 30.0, avg: 35.1, max: 62.0) [2023-10-08 03:25:11,211][50642] Avg episode reward: [(0, '19.590'), (1, '22.270')] [2023-10-08 03:25:13,153][52060] Updated weights for policy 0, policy_version 85830 (0.0008) [2023-10-08 03:25:13,529][52060] Updated weights for policy 0, policy_version 85840 (0.0007) [2023-10-08 03:25:13,738][52059] Updated weights for policy 1, policy_version 86922 (0.0008) [2023-10-08 03:25:13,895][52060] Updated weights for policy 0, policy_version 85850 (0.0008) [2023-10-08 03:25:14,102][52059] Updated weights for policy 1, policy_version 86932 (0.0007) [2023-10-08 03:25:14,461][52059] Updated weights for policy 1, policy_version 86942 (0.0009) [2023-10-08 03:25:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 176947200. Throughput: 0: 1723.1, 1: 1715.7. Samples: 44247554. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:16,211][50642] Avg episode reward: [(0, '22.000'), (1, '22.470')] [2023-10-08 03:25:17,955][52060] Updated weights for policy 0, policy_version 85860 (0.0008) [2023-10-08 03:25:18,318][52060] Updated weights for policy 0, policy_version 85870 (0.0007) [2023-10-08 03:25:18,359][52059] Updated weights for policy 1, policy_version 86952 (0.0009) [2023-10-08 03:25:18,694][52060] Updated weights for policy 0, policy_version 85880 (0.0008) [2023-10-08 03:25:18,718][52059] Updated weights for policy 1, policy_version 86962 (0.0007) [2023-10-08 03:25:19,075][52059] Updated weights for policy 1, policy_version 86972 (0.0008) [2023-10-08 03:25:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 177012736. Throughput: 0: 1696.2, 1: 1714.8. Samples: 44257342. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:21,211][50642] Avg episode reward: [(0, '20.560'), (1, '25.230')] [2023-10-08 03:25:22,625][52060] Updated weights for policy 0, policy_version 85890 (0.0008) [2023-10-08 03:25:22,993][52060] Updated weights for policy 0, policy_version 85900 (0.0009) [2023-10-08 03:25:23,273][52059] Updated weights for policy 1, policy_version 86982 (0.0008) [2023-10-08 03:25:23,359][52060] Updated weights for policy 0, policy_version 85910 (0.0007) [2023-10-08 03:25:23,632][52059] Updated weights for policy 1, policy_version 86992 (0.0008) [2023-10-08 03:25:23,724][52060] Updated weights for policy 0, policy_version 85920 (0.0008) [2023-10-08 03:25:24,003][52059] Updated weights for policy 1, policy_version 87002 (0.0008) [2023-10-08 03:25:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 177078272. Throughput: 0: 1710.2, 1: 1700.8. Samples: 44277868. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:26,211][50642] Avg episode reward: [(0, '20.520'), (1, '26.230')] [2023-10-08 03:25:27,713][52060] Updated weights for policy 0, policy_version 85930 (0.0008) [2023-10-08 03:25:27,791][52059] Updated weights for policy 1, policy_version 87012 (0.0008) [2023-10-08 03:25:28,085][52060] Updated weights for policy 0, policy_version 85940 (0.0008) [2023-10-08 03:25:28,158][52059] Updated weights for policy 1, policy_version 87022 (0.0007) [2023-10-08 03:25:28,451][52060] Updated weights for policy 0, policy_version 85950 (0.0007) [2023-10-08 03:25:28,523][52059] Updated weights for policy 1, policy_version 87032 (0.0008) [2023-10-08 03:25:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 177143808. Throughput: 0: 1720.1, 1: 1728.2. Samples: 44298852. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:31,211][50642] Avg episode reward: [(0, '19.700'), (1, '23.280')] [2023-10-08 03:25:32,316][52059] Updated weights for policy 1, policy_version 87042 (0.0007) [2023-10-08 03:25:32,390][52060] Updated weights for policy 0, policy_version 85960 (0.0008) [2023-10-08 03:25:32,689][52059] Updated weights for policy 1, policy_version 87052 (0.0007) [2023-10-08 03:25:32,760][52060] Updated weights for policy 0, policy_version 85970 (0.0009) [2023-10-08 03:25:33,063][52059] Updated weights for policy 1, policy_version 87062 (0.0007) [2023-10-08 03:25:33,149][52060] Updated weights for policy 0, policy_version 85980 (0.0008) [2023-10-08 03:25:33,441][52059] Updated weights for policy 1, policy_version 87072 (0.0009) [2023-10-08 03:25:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 177209344. Throughput: 0: 1686.5, 1: 1700.6. Samples: 44308120. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:36,211][50642] Avg episode reward: [(0, '21.320'), (1, '21.390')] [2023-10-08 03:25:37,111][52060] Updated weights for policy 0, policy_version 85990 (0.0009) [2023-10-08 03:25:37,228][52059] Updated weights for policy 1, policy_version 87082 (0.0007) [2023-10-08 03:25:37,485][52060] Updated weights for policy 0, policy_version 86000 (0.0009) [2023-10-08 03:25:37,596][52059] Updated weights for policy 1, policy_version 87092 (0.0007) [2023-10-08 03:25:37,849][52060] Updated weights for policy 0, policy_version 86010 (0.0008) [2023-10-08 03:25:37,960][52059] Updated weights for policy 1, policy_version 87102 (0.0010) [2023-10-08 03:25:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177274880. Throughput: 0: 1713.4, 1: 1719.1. Samples: 44329610. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:41,211][50642] Avg episode reward: [(0, '18.340'), (1, '25.240')] [2023-10-08 03:25:41,912][52060] Updated weights for policy 0, policy_version 86020 (0.0008) [2023-10-08 03:25:42,057][52059] Updated weights for policy 1, policy_version 87112 (0.0008) [2023-10-08 03:25:42,291][52060] Updated weights for policy 0, policy_version 86030 (0.0008) [2023-10-08 03:25:42,431][52059] Updated weights for policy 1, policy_version 87122 (0.0009) [2023-10-08 03:25:42,660][52060] Updated weights for policy 0, policy_version 86040 (0.0007) [2023-10-08 03:25:42,810][52059] Updated weights for policy 1, policy_version 87132 (0.0008) [2023-10-08 03:25:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177340416. Throughput: 0: 1716.1, 1: 1727.7. Samples: 44350408. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:46,211][50642] Avg episode reward: [(0, '18.680'), (1, '24.420')] [2023-10-08 03:25:46,690][52060] Updated weights for policy 0, policy_version 86050 (0.0007) [2023-10-08 03:25:46,757][52059] Updated weights for policy 1, policy_version 87142 (0.0008) [2023-10-08 03:25:47,061][52060] Updated weights for policy 0, policy_version 86060 (0.0007) [2023-10-08 03:25:47,124][52059] Updated weights for policy 1, policy_version 87152 (0.0009) [2023-10-08 03:25:47,430][52060] Updated weights for policy 0, policy_version 86070 (0.0007) [2023-10-08 03:25:47,488][52059] Updated weights for policy 1, policy_version 87162 (0.0008) [2023-10-08 03:25:47,794][52060] Updated weights for policy 0, policy_version 86080 (0.0010) [2023-10-08 03:25:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177405952. Throughput: 0: 1695.5, 1: 1699.0. Samples: 44359710. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:51,211][50642] Avg episode reward: [(0, '21.290'), (1, '23.360')] [2023-10-08 03:25:51,415][52059] Updated weights for policy 1, policy_version 87172 (0.0008) [2023-10-08 03:25:51,767][52059] Updated weights for policy 1, policy_version 87182 (0.0009) [2023-10-08 03:25:51,879][52060] Updated weights for policy 0, policy_version 86090 (0.0009) [2023-10-08 03:25:52,135][52059] Updated weights for policy 1, policy_version 87192 (0.0010) [2023-10-08 03:25:52,248][52060] Updated weights for policy 0, policy_version 86100 (0.0008) [2023-10-08 03:25:52,612][52060] Updated weights for policy 0, policy_version 86110 (0.0008) [2023-10-08 03:25:56,038][52059] Updated weights for policy 1, policy_version 87202 (0.0008) [2023-10-08 03:25:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177471488. Throughput: 0: 1707.5, 1: 1725.2. Samples: 44380704. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:25:56,211][50642] Avg episode reward: [(0, '21.430'), (1, '23.330')] [2023-10-08 03:25:56,392][52059] Updated weights for policy 1, policy_version 87212 (0.0008) [2023-10-08 03:25:56,629][52060] Updated weights for policy 0, policy_version 86120 (0.0007) [2023-10-08 03:25:56,758][52059] Updated weights for policy 1, policy_version 87222 (0.0007) [2023-10-08 03:25:56,996][52060] Updated weights for policy 0, policy_version 86130 (0.0008) [2023-10-08 03:25:57,121][52059] Updated weights for policy 1, policy_version 87232 (0.0007) [2023-10-08 03:25:57,361][52060] Updated weights for policy 0, policy_version 86140 (0.0009) [2023-10-08 03:26:01,071][52059] Updated weights for policy 1, policy_version 87242 (0.0011) [2023-10-08 03:26:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 177537024. Throughput: 0: 1706.7, 1: 1721.9. Samples: 44401838. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:26:01,211][50642] Avg episode reward: [(0, '17.640'), (1, '24.800')] [2023-10-08 03:26:01,285][52060] Updated weights for policy 0, policy_version 86150 (0.0009) [2023-10-08 03:26:01,431][52059] Updated weights for policy 1, policy_version 87252 (0.0007) [2023-10-08 03:26:01,653][52060] Updated weights for policy 0, policy_version 86160 (0.0009) [2023-10-08 03:26:01,799][52059] Updated weights for policy 1, policy_version 87262 (0.0008) [2023-10-08 03:26:02,014][52060] Updated weights for policy 0, policy_version 86170 (0.0008) [2023-10-08 03:26:05,909][52059] Updated weights for policy 1, policy_version 87272 (0.0007) [2023-10-08 03:26:06,007][52060] Updated weights for policy 0, policy_version 86180 (0.0008) [2023-10-08 03:26:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177602560. Throughput: 0: 1706.8, 1: 1719.0. Samples: 44411500. Policy #0 lag: (min: 20.0, avg: 20.0, max: 20.0) [2023-10-08 03:26:06,211][50642] Avg episode reward: [(0, '19.140'), (1, '25.630')] [2023-10-08 03:26:06,273][52059] Updated weights for policy 1, policy_version 87282 (0.0007) [2023-10-08 03:26:06,372][52060] Updated weights for policy 0, policy_version 86190 (0.0007) [2023-10-08 03:26:06,634][52059] Updated weights for policy 1, policy_version 87292 (0.0007) [2023-10-08 03:26:06,741][52060] Updated weights for policy 0, policy_version 86200 (0.0007) [2023-10-08 03:26:10,531][52059] Updated weights for policy 1, policy_version 87302 (0.0011) [2023-10-08 03:26:10,816][52060] Updated weights for policy 0, policy_version 86210 (0.0009) [2023-10-08 03:26:10,894][52059] Updated weights for policy 1, policy_version 87312 (0.0009) [2023-10-08 03:26:11,183][52060] Updated weights for policy 0, policy_version 86220 (0.0007) [2023-10-08 03:26:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177668096. Throughput: 0: 1705.5, 1: 1732.0. Samples: 44432552. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:11,211][50642] Avg episode reward: [(0, '22.220'), (1, '22.000')] [2023-10-08 03:26:11,252][52059] Updated weights for policy 1, policy_version 87322 (0.0007) [2023-10-08 03:26:11,548][52060] Updated weights for policy 0, policy_version 86230 (0.0008) [2023-10-08 03:26:11,912][52060] Updated weights for policy 0, policy_version 86240 (0.0008) [2023-10-08 03:26:15,234][52059] Updated weights for policy 1, policy_version 87332 (0.0009) [2023-10-08 03:26:15,597][52059] Updated weights for policy 1, policy_version 87342 (0.0008) [2023-10-08 03:26:15,917][52060] Updated weights for policy 0, policy_version 86250 (0.0008) [2023-10-08 03:26:15,960][52059] Updated weights for policy 1, policy_version 87352 (0.0008) [2023-10-08 03:26:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 177733632. Throughput: 0: 1705.2, 1: 1712.3. Samples: 44452640. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:16,211][50642] Avg episode reward: [(0, '21.090'), (1, '21.770')] [2023-10-08 03:26:16,283][52060] Updated weights for policy 0, policy_version 86260 (0.0007) [2023-10-08 03:26:16,647][52060] Updated weights for policy 0, policy_version 86270 (0.0010) [2023-10-08 03:26:20,021][52059] Updated weights for policy 1, policy_version 87362 (0.0009) [2023-10-08 03:26:20,390][52059] Updated weights for policy 1, policy_version 87372 (0.0008) [2023-10-08 03:26:20,596][52060] Updated weights for policy 0, policy_version 86280 (0.0007) [2023-10-08 03:26:20,758][52059] Updated weights for policy 1, policy_version 87382 (0.0007) [2023-10-08 03:26:20,958][52060] Updated weights for policy 0, policy_version 86290 (0.0007) [2023-10-08 03:26:21,123][52059] Updated weights for policy 1, policy_version 87392 (0.0009) [2023-10-08 03:26:21,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 177831936. Throughput: 0: 1714.5, 1: 1732.2. Samples: 44463220. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:21,211][50642] Avg episode reward: [(0, '20.070'), (1, '24.040')] [2023-10-08 03:26:21,322][52060] Updated weights for policy 0, policy_version 86300 (0.0009) [2023-10-08 03:26:24,996][52059] Updated weights for policy 1, policy_version 87402 (0.0007) [2023-10-08 03:26:25,366][52059] Updated weights for policy 1, policy_version 87412 (0.0009) [2023-10-08 03:26:25,494][52060] Updated weights for policy 0, policy_version 86310 (0.0009) [2023-10-08 03:26:25,718][52059] Updated weights for policy 1, policy_version 87422 (0.0009) [2023-10-08 03:26:25,858][52060] Updated weights for policy 0, policy_version 86320 (0.0009) [2023-10-08 03:26:26,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 177897472. Throughput: 0: 1709.5, 1: 1724.0. Samples: 44484118. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:26,211][50642] Avg episode reward: [(0, '21.700'), (1, '25.160')] [2023-10-08 03:26:26,223][52060] Updated weights for policy 0, policy_version 86330 (0.0009) [2023-10-08 03:26:29,819][52059] Updated weights for policy 1, policy_version 87432 (0.0008) [2023-10-08 03:26:30,186][52059] Updated weights for policy 1, policy_version 87442 (0.0008) [2023-10-08 03:26:30,269][52060] Updated weights for policy 0, policy_version 86340 (0.0008) [2023-10-08 03:26:30,558][52059] Updated weights for policy 1, policy_version 87452 (0.0007) [2023-10-08 03:26:30,653][52060] Updated weights for policy 0, policy_version 86350 (0.0010) [2023-10-08 03:26:31,021][52060] Updated weights for policy 0, policy_version 86360 (0.0011) [2023-10-08 03:26:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 177963008. Throughput: 0: 1691.7, 1: 1705.7. Samples: 44503294. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:31,211][50642] Avg episode reward: [(0, '22.170'), (1, '22.740')] [2023-10-08 03:26:34,384][52059] Updated weights for policy 1, policy_version 87462 (0.0009) [2023-10-08 03:26:34,746][52059] Updated weights for policy 1, policy_version 87472 (0.0008) [2023-10-08 03:26:35,062][52060] Updated weights for policy 0, policy_version 86370 (0.0011) [2023-10-08 03:26:35,116][52059] Updated weights for policy 1, policy_version 87482 (0.0009) [2023-10-08 03:26:35,423][52060] Updated weights for policy 0, policy_version 86380 (0.0010) [2023-10-08 03:26:35,800][52060] Updated weights for policy 0, policy_version 86390 (0.0010) [2023-10-08 03:26:36,166][52060] Updated weights for policy 0, policy_version 86400 (0.0011) [2023-10-08 03:26:36,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 178061312. Throughput: 0: 1706.1, 1: 1738.1. Samples: 44514698. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:36,211][50642] Avg episode reward: [(0, '20.200'), (1, '21.880')] [2023-10-08 03:26:38,881][52059] Updated weights for policy 1, policy_version 87492 (0.0009) [2023-10-08 03:26:39,244][52059] Updated weights for policy 1, policy_version 87502 (0.0010) [2023-10-08 03:26:39,608][52059] Updated weights for policy 1, policy_version 87512 (0.0011) [2023-10-08 03:26:40,164][52060] Updated weights for policy 0, policy_version 86410 (0.0010) [2023-10-08 03:26:40,539][52060] Updated weights for policy 0, policy_version 86420 (0.0009) [2023-10-08 03:26:40,905][52060] Updated weights for policy 0, policy_version 86430 (0.0007) [2023-10-08 03:26:41,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 178126848. Throughput: 0: 1705.2, 1: 1720.0. Samples: 44534842. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:41,211][50642] Avg episode reward: [(0, '19.890'), (1, '23.230')] [2023-10-08 03:26:43,621][52059] Updated weights for policy 1, policy_version 87522 (0.0008) [2023-10-08 03:26:43,977][52059] Updated weights for policy 1, policy_version 87532 (0.0007) [2023-10-08 03:26:44,357][52059] Updated weights for policy 1, policy_version 87542 (0.0010) [2023-10-08 03:26:44,601][52060] Updated weights for policy 0, policy_version 86440 (0.0007) [2023-10-08 03:26:44,723][52059] Updated weights for policy 1, policy_version 87552 (0.0007) [2023-10-08 03:26:44,972][52060] Updated weights for policy 0, policy_version 86450 (0.0011) [2023-10-08 03:26:45,338][52060] Updated weights for policy 0, policy_version 86460 (0.0008) [2023-10-08 03:26:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 178192384. Throughput: 0: 1687.0, 1: 1719.2. Samples: 44555118. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:46,211][50642] Avg episode reward: [(0, '22.150'), (1, '26.040')] [2023-10-08 03:26:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000087552_89653248.pth... [2023-10-08 03:26:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000086464_88539136.pth... [2023-10-08 03:26:46,256][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000084864_86900736.pth [2023-10-08 03:26:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000085952_88014848.pth [2023-10-08 03:26:48,578][52059] Updated weights for policy 1, policy_version 87562 (0.0008) [2023-10-08 03:26:48,942][52059] Updated weights for policy 1, policy_version 87572 (0.0008) [2023-10-08 03:26:49,309][52059] Updated weights for policy 1, policy_version 87582 (0.0007) [2023-10-08 03:26:49,372][52060] Updated weights for policy 0, policy_version 86470 (0.0009) [2023-10-08 03:26:49,740][52060] Updated weights for policy 0, policy_version 86480 (0.0007) [2023-10-08 03:26:50,113][52060] Updated weights for policy 0, policy_version 86490 (0.0008) [2023-10-08 03:26:51,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 178257920. Throughput: 0: 1712.9, 1: 1728.0. Samples: 44566342. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:51,211][50642] Avg episode reward: [(0, '19.790'), (1, '27.620')] [2023-10-08 03:26:53,269][52059] Updated weights for policy 1, policy_version 87592 (0.0011) [2023-10-08 03:26:53,629][52059] Updated weights for policy 1, policy_version 87602 (0.0008) [2023-10-08 03:26:53,997][52059] Updated weights for policy 1, policy_version 87612 (0.0007) [2023-10-08 03:26:54,180][52060] Updated weights for policy 0, policy_version 86500 (0.0008) [2023-10-08 03:26:54,551][52060] Updated weights for policy 0, policy_version 86510 (0.0007) [2023-10-08 03:26:54,923][52060] Updated weights for policy 0, policy_version 86520 (0.0007) [2023-10-08 03:26:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 178323456. Throughput: 0: 1693.0, 1: 1715.8. Samples: 44585948. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:26:56,211][50642] Avg episode reward: [(0, '19.910'), (1, '24.430')] [2023-10-08 03:26:58,015][52059] Updated weights for policy 1, policy_version 87622 (0.0008) [2023-10-08 03:26:58,381][52059] Updated weights for policy 1, policy_version 87632 (0.0007) [2023-10-08 03:26:58,742][52059] Updated weights for policy 1, policy_version 87642 (0.0008) [2023-10-08 03:26:58,869][52060] Updated weights for policy 0, policy_version 86530 (0.0007) [2023-10-08 03:26:59,247][52060] Updated weights for policy 0, policy_version 86540 (0.0009) [2023-10-08 03:26:59,616][52060] Updated weights for policy 0, policy_version 86550 (0.0008) [2023-10-08 03:26:59,974][52060] Updated weights for policy 0, policy_version 86560 (0.0008) [2023-10-08 03:27:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 178388992. Throughput: 0: 1685.7, 1: 1738.5. Samples: 44606732. Policy #0 lag: (min: 31.0, avg: 33.3, max: 63.0) [2023-10-08 03:27:01,211][50642] Avg episode reward: [(0, '19.130'), (1, '23.950')] [2023-10-08 03:27:02,645][52059] Updated weights for policy 1, policy_version 87652 (0.0007) [2023-10-08 03:27:03,007][52059] Updated weights for policy 1, policy_version 87662 (0.0008) [2023-10-08 03:27:03,379][52059] Updated weights for policy 1, policy_version 87672 (0.0007) [2023-10-08 03:27:03,873][52060] Updated weights for policy 0, policy_version 86570 (0.0008) [2023-10-08 03:27:04,233][52060] Updated weights for policy 0, policy_version 86580 (0.0007) [2023-10-08 03:27:04,597][52060] Updated weights for policy 0, policy_version 86590 (0.0008) [2023-10-08 03:27:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 178454528. Throughput: 0: 1701.5, 1: 1721.3. Samples: 44617244. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:06,211][50642] Avg episode reward: [(0, '21.580'), (1, '25.140')] [2023-10-08 03:27:07,312][52059] Updated weights for policy 1, policy_version 87682 (0.0007) [2023-10-08 03:27:07,663][52059] Updated weights for policy 1, policy_version 87692 (0.0008) [2023-10-08 03:27:08,024][52059] Updated weights for policy 1, policy_version 87702 (0.0010) [2023-10-08 03:27:08,388][52059] Updated weights for policy 1, policy_version 87712 (0.0010) [2023-10-08 03:27:08,642][52060] Updated weights for policy 0, policy_version 86600 (0.0008) [2023-10-08 03:27:09,009][52060] Updated weights for policy 0, policy_version 86610 (0.0010) [2023-10-08 03:27:09,389][52060] Updated weights for policy 0, policy_version 86620 (0.0009) [2023-10-08 03:27:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 178520064. Throughput: 0: 1681.0, 1: 1725.5. Samples: 44637412. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:11,211][50642] Avg episode reward: [(0, '18.840'), (1, '25.790')] [2023-10-08 03:27:12,341][52059] Updated weights for policy 1, policy_version 87722 (0.0007) [2023-10-08 03:27:12,704][52059] Updated weights for policy 1, policy_version 87732 (0.0008) [2023-10-08 03:27:13,070][52059] Updated weights for policy 1, policy_version 87742 (0.0007) [2023-10-08 03:27:13,423][52060] Updated weights for policy 0, policy_version 86630 (0.0008) [2023-10-08 03:27:13,791][52060] Updated weights for policy 0, policy_version 86640 (0.0009) [2023-10-08 03:27:14,151][52060] Updated weights for policy 0, policy_version 86650 (0.0011) [2023-10-08 03:27:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 178585600. Throughput: 0: 1703.0, 1: 1756.4. Samples: 44658970. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:16,211][50642] Avg episode reward: [(0, '19.590'), (1, '24.140')] [2023-10-08 03:27:17,184][52059] Updated weights for policy 1, policy_version 87752 (0.0008) [2023-10-08 03:27:17,558][52059] Updated weights for policy 1, policy_version 87762 (0.0007) [2023-10-08 03:27:17,920][52059] Updated weights for policy 1, policy_version 87772 (0.0009) [2023-10-08 03:27:18,248][52060] Updated weights for policy 0, policy_version 86660 (0.0009) [2023-10-08 03:27:18,636][52060] Updated weights for policy 0, policy_version 86670 (0.0008) [2023-10-08 03:27:19,007][52060] Updated weights for policy 0, policy_version 86680 (0.0009) [2023-10-08 03:27:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 178651136. Throughput: 0: 1699.7, 1: 1718.0. Samples: 44668496. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:21,211][50642] Avg episode reward: [(0, '22.130'), (1, '23.410')] [2023-10-08 03:27:21,644][52059] Updated weights for policy 1, policy_version 87782 (0.0009) [2023-10-08 03:27:21,999][52059] Updated weights for policy 1, policy_version 87792 (0.0009) [2023-10-08 03:27:22,376][52059] Updated weights for policy 1, policy_version 87802 (0.0010) [2023-10-08 03:27:23,093][52060] Updated weights for policy 0, policy_version 86690 (0.0010) [2023-10-08 03:27:23,449][52060] Updated weights for policy 0, policy_version 86700 (0.0009) [2023-10-08 03:27:23,815][52060] Updated weights for policy 0, policy_version 86710 (0.0009) [2023-10-08 03:27:24,185][52060] Updated weights for policy 0, policy_version 86720 (0.0011) [2023-10-08 03:27:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 178716672. Throughput: 0: 1686.3, 1: 1741.3. Samples: 44689086. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:26,211][50642] Avg episode reward: [(0, '21.970'), (1, '24.940')] [2023-10-08 03:27:26,328][52059] Updated weights for policy 1, policy_version 87812 (0.0010) [2023-10-08 03:27:26,684][52059] Updated weights for policy 1, policy_version 87822 (0.0009) [2023-10-08 03:27:27,040][52059] Updated weights for policy 1, policy_version 87832 (0.0007) [2023-10-08 03:27:28,169][52060] Updated weights for policy 0, policy_version 86730 (0.0008) [2023-10-08 03:27:28,524][52060] Updated weights for policy 0, policy_version 86740 (0.0007) [2023-10-08 03:27:28,895][52060] Updated weights for policy 0, policy_version 86750 (0.0007) [2023-10-08 03:27:31,068][52059] Updated weights for policy 1, policy_version 87842 (0.0007) [2023-10-08 03:27:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 178782208. Throughput: 0: 1702.7, 1: 1743.7. Samples: 44710204. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:31,211][50642] Avg episode reward: [(0, '19.070'), (1, '24.420')] [2023-10-08 03:27:31,427][52059] Updated weights for policy 1, policy_version 87852 (0.0009) [2023-10-08 03:27:31,789][52059] Updated weights for policy 1, policy_version 87862 (0.0008) [2023-10-08 03:27:32,154][52059] Updated weights for policy 1, policy_version 87872 (0.0009) [2023-10-08 03:27:32,810][52060] Updated weights for policy 0, policy_version 86760 (0.0008) [2023-10-08 03:27:33,173][52060] Updated weights for policy 0, policy_version 86770 (0.0007) [2023-10-08 03:27:33,535][52060] Updated weights for policy 0, policy_version 86780 (0.0007) [2023-10-08 03:27:36,182][52059] Updated weights for policy 1, policy_version 87882 (0.0008) [2023-10-08 03:27:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 178847744. Throughput: 0: 1676.4, 1: 1731.2. Samples: 44719686. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:36,211][50642] Avg episode reward: [(0, '18.200'), (1, '24.460')] [2023-10-08 03:27:36,556][52059] Updated weights for policy 1, policy_version 87892 (0.0008) [2023-10-08 03:27:36,921][52059] Updated weights for policy 1, policy_version 87902 (0.0008) [2023-10-08 03:27:37,613][52060] Updated weights for policy 0, policy_version 86790 (0.0010) [2023-10-08 03:27:37,984][52060] Updated weights for policy 0, policy_version 86800 (0.0008) [2023-10-08 03:27:38,345][52060] Updated weights for policy 0, policy_version 86810 (0.0007) [2023-10-08 03:27:40,929][52059] Updated weights for policy 1, policy_version 87912 (0.0008) [2023-10-08 03:27:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 178913280. Throughput: 0: 1695.2, 1: 1746.3. Samples: 44740816. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:41,211][50642] Avg episode reward: [(0, '20.840'), (1, '23.710')] [2023-10-08 03:27:41,285][52059] Updated weights for policy 1, policy_version 87922 (0.0007) [2023-10-08 03:27:41,657][52059] Updated weights for policy 1, policy_version 87932 (0.0007) [2023-10-08 03:27:42,428][52060] Updated weights for policy 0, policy_version 86820 (0.0008) [2023-10-08 03:27:42,793][52060] Updated weights for policy 0, policy_version 86830 (0.0008) [2023-10-08 03:27:43,172][52060] Updated weights for policy 0, policy_version 86840 (0.0008) [2023-10-08 03:27:45,510][52059] Updated weights for policy 1, policy_version 87942 (0.0007) [2023-10-08 03:27:45,872][52059] Updated weights for policy 1, policy_version 87952 (0.0010) [2023-10-08 03:27:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 178978816. Throughput: 0: 1708.4, 1: 1728.5. Samples: 44761388. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:46,211][50642] Avg episode reward: [(0, '22.210'), (1, '24.580')] [2023-10-08 03:27:46,243][52059] Updated weights for policy 1, policy_version 87962 (0.0010) [2023-10-08 03:27:47,081][52060] Updated weights for policy 0, policy_version 86850 (0.0008) [2023-10-08 03:27:47,448][52060] Updated weights for policy 0, policy_version 86860 (0.0008) [2023-10-08 03:27:47,820][52060] Updated weights for policy 0, policy_version 86870 (0.0010) [2023-10-08 03:27:48,183][52060] Updated weights for policy 0, policy_version 86880 (0.0011) [2023-10-08 03:27:50,164][52059] Updated weights for policy 1, policy_version 87972 (0.0009) [2023-10-08 03:27:50,531][52059] Updated weights for policy 1, policy_version 87982 (0.0008) [2023-10-08 03:27:50,893][52059] Updated weights for policy 1, policy_version 87992 (0.0009) [2023-10-08 03:27:51,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 179077120. Throughput: 0: 1684.4, 1: 1740.6. Samples: 44771372. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:51,211][50642] Avg episode reward: [(0, '19.670'), (1, '24.870')] [2023-10-08 03:27:52,282][52060] Updated weights for policy 0, policy_version 86890 (0.0008) [2023-10-08 03:27:52,644][52060] Updated weights for policy 0, policy_version 86900 (0.0009) [2023-10-08 03:27:53,023][52060] Updated weights for policy 0, policy_version 86910 (0.0009) [2023-10-08 03:27:54,829][52059] Updated weights for policy 1, policy_version 88002 (0.0009) [2023-10-08 03:27:55,190][52059] Updated weights for policy 1, policy_version 88012 (0.0008) [2023-10-08 03:27:55,553][52059] Updated weights for policy 1, policy_version 88022 (0.0009) [2023-10-08 03:27:55,916][52059] Updated weights for policy 1, policy_version 88032 (0.0009) [2023-10-08 03:27:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 179142656. Throughput: 0: 1703.7, 1: 1742.2. Samples: 44792478. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:27:56,211][50642] Avg episode reward: [(0, '19.410'), (1, '23.710')] [2023-10-08 03:27:57,012][52060] Updated weights for policy 0, policy_version 86920 (0.0009) [2023-10-08 03:27:57,369][52060] Updated weights for policy 0, policy_version 86930 (0.0008) [2023-10-08 03:27:57,736][52060] Updated weights for policy 0, policy_version 86940 (0.0008) [2023-10-08 03:27:59,801][52059] Updated weights for policy 1, policy_version 88042 (0.0007) [2023-10-08 03:28:00,167][52059] Updated weights for policy 1, policy_version 88052 (0.0007) [2023-10-08 03:28:00,519][52059] Updated weights for policy 1, policy_version 88062 (0.0008) [2023-10-08 03:28:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 179208192. Throughput: 0: 1707.3, 1: 1715.0. Samples: 44812972. Policy #0 lag: (min: 31.0, avg: 34.7, max: 63.0) [2023-10-08 03:28:01,211][50642] Avg episode reward: [(0, '21.390'), (1, '22.970')] [2023-10-08 03:28:01,629][52060] Updated weights for policy 0, policy_version 86950 (0.0009) [2023-10-08 03:28:01,997][52060] Updated weights for policy 0, policy_version 86960 (0.0007) [2023-10-08 03:28:02,367][52060] Updated weights for policy 0, policy_version 86970 (0.0007) [2023-10-08 03:28:04,448][52059] Updated weights for policy 1, policy_version 88072 (0.0007) [2023-10-08 03:28:04,828][52059] Updated weights for policy 1, policy_version 88082 (0.0008) [2023-10-08 03:28:05,187][52059] Updated weights for policy 1, policy_version 88092 (0.0009) [2023-10-08 03:28:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 179273728. Throughput: 0: 1697.2, 1: 1754.1. Samples: 44823804. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:06,211][50642] Avg episode reward: [(0, '21.200'), (1, '23.970')] [2023-10-08 03:28:06,410][52060] Updated weights for policy 0, policy_version 86980 (0.0008) [2023-10-08 03:28:06,791][52060] Updated weights for policy 0, policy_version 86990 (0.0008) [2023-10-08 03:28:07,163][52060] Updated weights for policy 0, policy_version 87000 (0.0007) [2023-10-08 03:28:09,050][52059] Updated weights for policy 1, policy_version 88102 (0.0010) [2023-10-08 03:28:09,416][52059] Updated weights for policy 1, policy_version 88112 (0.0009) [2023-10-08 03:28:09,776][52059] Updated weights for policy 1, policy_version 88122 (0.0011) [2023-10-08 03:28:11,118][52060] Updated weights for policy 0, policy_version 87010 (0.0007) [2023-10-08 03:28:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 179339264. Throughput: 0: 1713.6, 1: 1726.6. Samples: 44843894. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:11,211][50642] Avg episode reward: [(0, '18.370'), (1, '25.260')] [2023-10-08 03:28:11,481][52060] Updated weights for policy 0, policy_version 87020 (0.0007) [2023-10-08 03:28:11,842][52060] Updated weights for policy 0, policy_version 87030 (0.0008) [2023-10-08 03:28:12,209][52060] Updated weights for policy 0, policy_version 87040 (0.0009) [2023-10-08 03:28:13,574][52059] Updated weights for policy 1, policy_version 88132 (0.0011) [2023-10-08 03:28:13,940][52059] Updated weights for policy 1, policy_version 88142 (0.0009) [2023-10-08 03:28:14,304][52059] Updated weights for policy 1, policy_version 88152 (0.0007) [2023-10-08 03:28:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 179404800. Throughput: 0: 1717.3, 1: 1726.9. Samples: 44865194. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:16,211][50642] Avg episode reward: [(0, '18.440'), (1, '27.770')] [2023-10-08 03:28:16,214][52060] Updated weights for policy 0, policy_version 87050 (0.0009) [2023-10-08 03:28:16,576][52060] Updated weights for policy 0, policy_version 87060 (0.0010) [2023-10-08 03:28:16,957][52060] Updated weights for policy 0, policy_version 87070 (0.0008) [2023-10-08 03:28:18,136][52059] Updated weights for policy 1, policy_version 88162 (0.0008) [2023-10-08 03:28:18,503][52059] Updated weights for policy 1, policy_version 88172 (0.0008) [2023-10-08 03:28:18,862][52059] Updated weights for policy 1, policy_version 88182 (0.0009) [2023-10-08 03:28:19,224][52059] Updated weights for policy 1, policy_version 88192 (0.0010) [2023-10-08 03:28:20,899][52060] Updated weights for policy 0, policy_version 87080 (0.0008) [2023-10-08 03:28:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 179470336. Throughput: 0: 1714.9, 1: 1743.0. Samples: 44875290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:21,211][50642] Avg episode reward: [(0, '19.590'), (1, '24.860')] [2023-10-08 03:28:21,274][52060] Updated weights for policy 0, policy_version 87090 (0.0007) [2023-10-08 03:28:21,632][52060] Updated weights for policy 0, policy_version 87100 (0.0008) [2023-10-08 03:28:23,124][52059] Updated weights for policy 1, policy_version 88202 (0.0008) [2023-10-08 03:28:23,478][52059] Updated weights for policy 1, policy_version 88212 (0.0007) [2023-10-08 03:28:23,846][52059] Updated weights for policy 1, policy_version 88222 (0.0009) [2023-10-08 03:28:25,674][52060] Updated weights for policy 0, policy_version 87110 (0.0009) [2023-10-08 03:28:26,037][52060] Updated weights for policy 0, policy_version 87120 (0.0009) [2023-10-08 03:28:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 179535872. Throughput: 0: 1721.0, 1: 1731.4. Samples: 44896174. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:26,211][50642] Avg episode reward: [(0, '20.900'), (1, '23.220')] [2023-10-08 03:28:26,402][52060] Updated weights for policy 0, policy_version 87130 (0.0007) [2023-10-08 03:28:27,641][52059] Updated weights for policy 1, policy_version 88232 (0.0007) [2023-10-08 03:28:28,009][52059] Updated weights for policy 1, policy_version 88242 (0.0008) [2023-10-08 03:28:28,382][52059] Updated weights for policy 1, policy_version 88252 (0.0009) [2023-10-08 03:28:30,282][52060] Updated weights for policy 0, policy_version 87140 (0.0009) [2023-10-08 03:28:30,643][52060] Updated weights for policy 0, policy_version 87150 (0.0008) [2023-10-08 03:28:31,017][52060] Updated weights for policy 0, policy_version 87160 (0.0009) [2023-10-08 03:28:31,211][50642] Fps is (10 sec: 13106.6, 60 sec: 13653.2, 300 sec: 13662.6). Total num frames: 179601408. Throughput: 0: 1705.6, 1: 1753.5. Samples: 44917046. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:31,212][50642] Avg episode reward: [(0, '17.930'), (1, '25.490')] [2023-10-08 03:28:32,290][52059] Updated weights for policy 1, policy_version 88262 (0.0007) [2023-10-08 03:28:32,649][52059] Updated weights for policy 1, policy_version 88272 (0.0009) [2023-10-08 03:28:33,025][52059] Updated weights for policy 1, policy_version 88282 (0.0008) [2023-10-08 03:28:34,924][52060] Updated weights for policy 0, policy_version 87170 (0.0009) [2023-10-08 03:28:35,297][52060] Updated weights for policy 0, policy_version 87180 (0.0010) [2023-10-08 03:28:35,670][52060] Updated weights for policy 0, policy_version 87190 (0.0009) [2023-10-08 03:28:36,029][52060] Updated weights for policy 0, policy_version 87200 (0.0008) [2023-10-08 03:28:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 179699712. Throughput: 0: 1726.9, 1: 1738.3. Samples: 44927306. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:36,211][50642] Avg episode reward: [(0, '18.550'), (1, '27.170')] [2023-10-08 03:28:36,943][52059] Updated weights for policy 1, policy_version 88292 (0.0007) [2023-10-08 03:28:37,301][52059] Updated weights for policy 1, policy_version 88302 (0.0008) [2023-10-08 03:28:37,660][52059] Updated weights for policy 1, policy_version 88312 (0.0008) [2023-10-08 03:28:40,019][52060] Updated weights for policy 0, policy_version 87210 (0.0007) [2023-10-08 03:28:40,402][52060] Updated weights for policy 0, policy_version 87220 (0.0010) [2023-10-08 03:28:40,771][52060] Updated weights for policy 0, policy_version 87230 (0.0009) [2023-10-08 03:28:41,210][50642] Fps is (10 sec: 16384.8, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 179765248. Throughput: 0: 1728.4, 1: 1737.3. Samples: 44948432. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:41,211][50642] Avg episode reward: [(0, '21.570'), (1, '25.820')] [2023-10-08 03:28:41,763][52059] Updated weights for policy 1, policy_version 88322 (0.0008) [2023-10-08 03:28:42,125][52059] Updated weights for policy 1, policy_version 88332 (0.0009) [2023-10-08 03:28:42,488][52059] Updated weights for policy 1, policy_version 88342 (0.0008) [2023-10-08 03:28:42,851][52059] Updated weights for policy 1, policy_version 88352 (0.0009) [2023-10-08 03:28:44,675][52060] Updated weights for policy 0, policy_version 87240 (0.0009) [2023-10-08 03:28:45,038][52060] Updated weights for policy 0, policy_version 87250 (0.0008) [2023-10-08 03:28:45,407][52060] Updated weights for policy 0, policy_version 87260 (0.0010) [2023-10-08 03:28:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 179830784. Throughput: 0: 1698.9, 1: 1761.3. Samples: 44968680. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:46,211][50642] Avg episode reward: [(0, '20.420'), (1, '24.040')] [2023-10-08 03:28:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000088352_90472448.pth... [2023-10-08 03:28:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000087264_89358336.pth... [2023-10-08 03:28:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000085664_87719936.pth [2023-10-08 03:28:46,259][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000086752_88834048.pth [2023-10-08 03:28:46,771][52059] Updated weights for policy 1, policy_version 88362 (0.0008) [2023-10-08 03:28:47,143][52059] Updated weights for policy 1, policy_version 88372 (0.0008) [2023-10-08 03:28:47,509][52059] Updated weights for policy 1, policy_version 88382 (0.0008) [2023-10-08 03:28:49,423][52060] Updated weights for policy 0, policy_version 87270 (0.0009) [2023-10-08 03:28:49,800][52060] Updated weights for policy 0, policy_version 87280 (0.0008) [2023-10-08 03:28:50,172][52060] Updated weights for policy 0, policy_version 87290 (0.0008) [2023-10-08 03:28:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 179896320. Throughput: 0: 1728.7, 1: 1726.1. Samples: 44979268. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:51,211][50642] Avg episode reward: [(0, '18.860'), (1, '23.540')] [2023-10-08 03:28:51,471][52059] Updated weights for policy 1, policy_version 88392 (0.0010) [2023-10-08 03:28:51,842][52059] Updated weights for policy 1, policy_version 88402 (0.0011) [2023-10-08 03:28:52,209][52059] Updated weights for policy 1, policy_version 88412 (0.0011) [2023-10-08 03:28:54,204][52060] Updated weights for policy 0, policy_version 87300 (0.0008) [2023-10-08 03:28:54,588][52060] Updated weights for policy 0, policy_version 87310 (0.0008) [2023-10-08 03:28:54,954][52060] Updated weights for policy 0, policy_version 87320 (0.0007) [2023-10-08 03:28:56,136][52059] Updated weights for policy 1, policy_version 88422 (0.0008) [2023-10-08 03:28:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 179961856. Throughput: 0: 1706.1, 1: 1749.8. Samples: 44999412. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:28:56,211][50642] Avg episode reward: [(0, '17.940'), (1, '24.160')] [2023-10-08 03:28:56,498][52059] Updated weights for policy 1, policy_version 88432 (0.0008) [2023-10-08 03:28:56,867][52059] Updated weights for policy 1, policy_version 88442 (0.0008) [2023-10-08 03:28:58,963][52060] Updated weights for policy 0, policy_version 87330 (0.0009) [2023-10-08 03:28:59,336][52060] Updated weights for policy 0, policy_version 87340 (0.0007) [2023-10-08 03:28:59,698][52060] Updated weights for policy 0, policy_version 87350 (0.0008) [2023-10-08 03:29:00,072][52060] Updated weights for policy 0, policy_version 87360 (0.0007) [2023-10-08 03:29:00,802][52059] Updated weights for policy 1, policy_version 88452 (0.0008) [2023-10-08 03:29:01,167][52059] Updated weights for policy 1, policy_version 88462 (0.0009) [2023-10-08 03:29:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 180027392. Throughput: 0: 1696.9, 1: 1740.7. Samples: 45019888. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:01,211][50642] Avg episode reward: [(0, '22.840'), (1, '23.600')] [2023-10-08 03:29:01,529][52059] Updated weights for policy 1, policy_version 88472 (0.0009) [2023-10-08 03:29:04,003][52060] Updated weights for policy 0, policy_version 87370 (0.0010) [2023-10-08 03:29:04,379][52060] Updated weights for policy 0, policy_version 87380 (0.0007) [2023-10-08 03:29:04,753][52060] Updated weights for policy 0, policy_version 87390 (0.0008) [2023-10-08 03:29:05,525][52059] Updated weights for policy 1, policy_version 88482 (0.0010) [2023-10-08 03:29:05,887][52059] Updated weights for policy 1, policy_version 88492 (0.0011) [2023-10-08 03:29:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 180092928. Throughput: 0: 1718.5, 1: 1727.0. Samples: 45030338. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:06,211][50642] Avg episode reward: [(0, '20.290'), (1, '23.020')] [2023-10-08 03:29:06,252][52059] Updated weights for policy 1, policy_version 88502 (0.0007) [2023-10-08 03:29:06,615][52059] Updated weights for policy 1, policy_version 88512 (0.0007) [2023-10-08 03:29:08,727][52060] Updated weights for policy 0, policy_version 87400 (0.0008) [2023-10-08 03:29:09,097][52060] Updated weights for policy 0, policy_version 87410 (0.0011) [2023-10-08 03:29:09,460][52060] Updated weights for policy 0, policy_version 87420 (0.0010) [2023-10-08 03:29:10,485][52059] Updated weights for policy 1, policy_version 88522 (0.0008) [2023-10-08 03:29:10,846][52059] Updated weights for policy 1, policy_version 88532 (0.0010) [2023-10-08 03:29:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 180158464. Throughput: 0: 1690.1, 1: 1739.1. Samples: 45050492. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:11,211][50642] Avg episode reward: [(0, '20.200'), (1, '24.240')] [2023-10-08 03:29:11,217][52059] Updated weights for policy 1, policy_version 88542 (0.0010) [2023-10-08 03:29:13,374][52060] Updated weights for policy 0, policy_version 87430 (0.0007) [2023-10-08 03:29:13,744][52060] Updated weights for policy 0, policy_version 87440 (0.0008) [2023-10-08 03:29:14,121][52060] Updated weights for policy 0, policy_version 87450 (0.0009) [2023-10-08 03:29:15,164][52059] Updated weights for policy 1, policy_version 88552 (0.0009) [2023-10-08 03:29:15,526][52059] Updated weights for policy 1, policy_version 88562 (0.0009) [2023-10-08 03:29:15,886][52059] Updated weights for policy 1, policy_version 88572 (0.0009) [2023-10-08 03:29:16,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 180256768. Throughput: 0: 1711.8, 1: 1712.0. Samples: 45071114. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:16,211][50642] Avg episode reward: [(0, '17.710'), (1, '25.290')] [2023-10-08 03:29:18,120][52060] Updated weights for policy 0, policy_version 87460 (0.0007) [2023-10-08 03:29:18,483][52060] Updated weights for policy 0, policy_version 87470 (0.0010) [2023-10-08 03:29:18,854][52060] Updated weights for policy 0, policy_version 87480 (0.0009) [2023-10-08 03:29:19,653][52059] Updated weights for policy 1, policy_version 88582 (0.0007) [2023-10-08 03:29:20,011][52059] Updated weights for policy 1, policy_version 88592 (0.0008) [2023-10-08 03:29:20,379][52059] Updated weights for policy 1, policy_version 88602 (0.0007) [2023-10-08 03:29:21,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 180322304. Throughput: 0: 1700.4, 1: 1738.3. Samples: 45082044. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:21,211][50642] Avg episode reward: [(0, '24.520'), (1, '22.510')] [2023-10-08 03:29:23,031][52060] Updated weights for policy 0, policy_version 87490 (0.0009) [2023-10-08 03:29:23,398][52060] Updated weights for policy 0, policy_version 87500 (0.0007) [2023-10-08 03:29:23,765][52060] Updated weights for policy 0, policy_version 87510 (0.0007) [2023-10-08 03:29:24,119][52060] Updated weights for policy 0, policy_version 87520 (0.0009) [2023-10-08 03:29:24,318][52059] Updated weights for policy 1, policy_version 88612 (0.0007) [2023-10-08 03:29:24,689][52059] Updated weights for policy 1, policy_version 88622 (0.0007) [2023-10-08 03:29:25,066][52059] Updated weights for policy 1, policy_version 88632 (0.0009) [2023-10-08 03:29:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 180387840. Throughput: 0: 1698.1, 1: 1723.6. Samples: 45102408. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:26,211][50642] Avg episode reward: [(0, '18.610'), (1, '24.900')] [2023-10-08 03:29:27,967][52060] Updated weights for policy 0, policy_version 87530 (0.0009) [2023-10-08 03:29:28,333][52060] Updated weights for policy 0, policy_version 87540 (0.0010) [2023-10-08 03:29:28,701][52060] Updated weights for policy 0, policy_version 87550 (0.0009) [2023-10-08 03:29:28,903][52059] Updated weights for policy 1, policy_version 88642 (0.0008) [2023-10-08 03:29:29,262][52059] Updated weights for policy 1, policy_version 88652 (0.0007) [2023-10-08 03:29:29,634][52059] Updated weights for policy 1, policy_version 88662 (0.0009) [2023-10-08 03:29:29,999][52059] Updated weights for policy 1, policy_version 88672 (0.0011) [2023-10-08 03:29:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.6, 300 sec: 13773.7). Total num frames: 180453376. Throughput: 0: 1730.9, 1: 1710.4. Samples: 45123536. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:31,211][50642] Avg episode reward: [(0, '19.480'), (1, '24.530')] [2023-10-08 03:29:32,652][52060] Updated weights for policy 0, policy_version 87560 (0.0007) [2023-10-08 03:29:33,018][52060] Updated weights for policy 0, policy_version 87570 (0.0008) [2023-10-08 03:29:33,392][52060] Updated weights for policy 0, policy_version 87580 (0.0008) [2023-10-08 03:29:33,991][52059] Updated weights for policy 1, policy_version 88682 (0.0008) [2023-10-08 03:29:34,366][52059] Updated weights for policy 1, policy_version 88692 (0.0007) [2023-10-08 03:29:34,725][52059] Updated weights for policy 1, policy_version 88702 (0.0007) [2023-10-08 03:29:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 180518912. Throughput: 0: 1701.3, 1: 1735.2. Samples: 45133912. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:36,211][50642] Avg episode reward: [(0, '19.500'), (1, '26.790')] [2023-10-08 03:29:37,198][52060] Updated weights for policy 0, policy_version 87590 (0.0008) [2023-10-08 03:29:37,562][52060] Updated weights for policy 0, policy_version 87600 (0.0009) [2023-10-08 03:29:37,929][52060] Updated weights for policy 0, policy_version 87610 (0.0007) [2023-10-08 03:29:38,770][52059] Updated weights for policy 1, policy_version 88712 (0.0008) [2023-10-08 03:29:39,144][52059] Updated weights for policy 1, policy_version 88722 (0.0008) [2023-10-08 03:29:39,515][52059] Updated weights for policy 1, policy_version 88732 (0.0009) [2023-10-08 03:29:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 180584448. Throughput: 0: 1729.5, 1: 1712.9. Samples: 45154320. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:41,211][50642] Avg episode reward: [(0, '23.770'), (1, '23.670')] [2023-10-08 03:29:41,871][52060] Updated weights for policy 0, policy_version 87620 (0.0007) [2023-10-08 03:29:42,271][52060] Updated weights for policy 0, policy_version 87630 (0.0008) [2023-10-08 03:29:42,642][52060] Updated weights for policy 0, policy_version 87640 (0.0007) [2023-10-08 03:29:43,521][52059] Updated weights for policy 1, policy_version 88742 (0.0009) [2023-10-08 03:29:43,884][52059] Updated weights for policy 1, policy_version 88752 (0.0008) [2023-10-08 03:29:44,252][52059] Updated weights for policy 1, policy_version 88762 (0.0009) [2023-10-08 03:29:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 180649984. Throughput: 0: 1742.3, 1: 1719.3. Samples: 45175658. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:46,211][50642] Avg episode reward: [(0, '17.870'), (1, '22.830')] [2023-10-08 03:29:46,312][52060] Updated weights for policy 0, policy_version 87650 (0.0007) [2023-10-08 03:29:46,682][52060] Updated weights for policy 0, policy_version 87660 (0.0007) [2023-10-08 03:29:47,065][52060] Updated weights for policy 0, policy_version 87670 (0.0007) [2023-10-08 03:29:47,434][52060] Updated weights for policy 0, policy_version 87680 (0.0009) [2023-10-08 03:29:48,174][52059] Updated weights for policy 1, policy_version 88772 (0.0007) [2023-10-08 03:29:48,539][52059] Updated weights for policy 1, policy_version 88782 (0.0009) [2023-10-08 03:29:48,899][52059] Updated weights for policy 1, policy_version 88792 (0.0009) [2023-10-08 03:29:51,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 180715520. Throughput: 0: 1725.2, 1: 1725.1. Samples: 45185604. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:51,211][50642] Avg episode reward: [(0, '17.290'), (1, '25.660')] [2023-10-08 03:29:51,382][52060] Updated weights for policy 0, policy_version 87690 (0.0008) [2023-10-08 03:29:51,756][52060] Updated weights for policy 0, policy_version 87700 (0.0008) [2023-10-08 03:29:52,127][52060] Updated weights for policy 0, policy_version 87710 (0.0008) [2023-10-08 03:29:52,737][52059] Updated weights for policy 1, policy_version 88802 (0.0009) [2023-10-08 03:29:53,099][52059] Updated weights for policy 1, policy_version 88812 (0.0008) [2023-10-08 03:29:53,461][52059] Updated weights for policy 1, policy_version 88822 (0.0008) [2023-10-08 03:29:53,822][52059] Updated weights for policy 1, policy_version 88832 (0.0009) [2023-10-08 03:29:55,926][52060] Updated weights for policy 0, policy_version 87720 (0.0008) [2023-10-08 03:29:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 180781056. Throughput: 0: 1753.7, 1: 1722.6. Samples: 45206928. Policy #0 lag: (min: 29.0, avg: 30.8, max: 54.0) [2023-10-08 03:29:56,211][50642] Avg episode reward: [(0, '20.680'), (1, '27.510')] [2023-10-08 03:29:56,293][52060] Updated weights for policy 0, policy_version 87730 (0.0008) [2023-10-08 03:29:56,661][52060] Updated weights for policy 0, policy_version 87740 (0.0009) [2023-10-08 03:29:57,709][52059] Updated weights for policy 1, policy_version 88842 (0.0007) [2023-10-08 03:29:58,064][52059] Updated weights for policy 1, policy_version 88852 (0.0009) [2023-10-08 03:29:58,429][52059] Updated weights for policy 1, policy_version 88862 (0.0007) [2023-10-08 03:30:00,668][52060] Updated weights for policy 0, policy_version 87750 (0.0008) [2023-10-08 03:30:01,026][52060] Updated weights for policy 0, policy_version 87760 (0.0008) [2023-10-08 03:30:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 180846592. Throughput: 0: 1736.8, 1: 1748.3. Samples: 45227946. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:01,211][50642] Avg episode reward: [(0, '22.380'), (1, '26.270')] [2023-10-08 03:30:01,402][52060] Updated weights for policy 0, policy_version 87770 (0.0008) [2023-10-08 03:30:02,414][52059] Updated weights for policy 1, policy_version 88872 (0.0008) [2023-10-08 03:30:02,771][52059] Updated weights for policy 1, policy_version 88882 (0.0010) [2023-10-08 03:30:03,140][52059] Updated weights for policy 1, policy_version 88892 (0.0008) [2023-10-08 03:30:05,201][52060] Updated weights for policy 0, policy_version 87780 (0.0008) [2023-10-08 03:30:05,562][52060] Updated weights for policy 0, policy_version 87790 (0.0009) [2023-10-08 03:30:05,942][52060] Updated weights for policy 0, policy_version 87800 (0.0009) [2023-10-08 03:30:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 180912128. Throughput: 0: 1740.9, 1: 1721.0. Samples: 45237828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:06,211][50642] Avg episode reward: [(0, '18.160'), (1, '21.400')] [2023-10-08 03:30:07,138][52059] Updated weights for policy 1, policy_version 88902 (0.0008) [2023-10-08 03:30:07,495][52059] Updated weights for policy 1, policy_version 88912 (0.0007) [2023-10-08 03:30:07,868][52059] Updated weights for policy 1, policy_version 88922 (0.0009) [2023-10-08 03:30:09,918][52060] Updated weights for policy 0, policy_version 87810 (0.0008) [2023-10-08 03:30:10,289][52060] Updated weights for policy 0, policy_version 87820 (0.0009) [2023-10-08 03:30:10,651][52060] Updated weights for policy 0, policy_version 87830 (0.0009) [2023-10-08 03:30:11,022][52060] Updated weights for policy 0, policy_version 87840 (0.0009) [2023-10-08 03:30:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 181010432. Throughput: 0: 1749.2, 1: 1735.9. Samples: 45259234. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:11,211][50642] Avg episode reward: [(0, '17.810'), (1, '22.980')] [2023-10-08 03:30:11,690][52059] Updated weights for policy 1, policy_version 88932 (0.0009) [2023-10-08 03:30:12,064][52059] Updated weights for policy 1, policy_version 88942 (0.0010) [2023-10-08 03:30:12,438][52059] Updated weights for policy 1, policy_version 88952 (0.0010) [2023-10-08 03:30:15,140][52060] Updated weights for policy 0, policy_version 87850 (0.0008) [2023-10-08 03:30:15,506][52060] Updated weights for policy 0, policy_version 87860 (0.0011) [2023-10-08 03:30:15,882][52060] Updated weights for policy 0, policy_version 87870 (0.0010) [2023-10-08 03:30:16,210][50642] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 181075968. Throughput: 0: 1712.7, 1: 1749.9. Samples: 45279354. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:16,211][50642] Avg episode reward: [(0, '22.440'), (1, '26.350')] [2023-10-08 03:30:16,304][52059] Updated weights for policy 1, policy_version 88962 (0.0011) [2023-10-08 03:30:16,668][52059] Updated weights for policy 1, policy_version 88972 (0.0007) [2023-10-08 03:30:17,027][52059] Updated weights for policy 1, policy_version 88982 (0.0009) [2023-10-08 03:30:17,388][52059] Updated weights for policy 1, policy_version 88992 (0.0009) [2023-10-08 03:30:19,932][52060] Updated weights for policy 0, policy_version 87880 (0.0010) [2023-10-08 03:30:20,309][52060] Updated weights for policy 0, policy_version 87890 (0.0009) [2023-10-08 03:30:20,680][52060] Updated weights for policy 0, policy_version 87900 (0.0008) [2023-10-08 03:30:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 181141504. Throughput: 0: 1738.2, 1: 1727.4. Samples: 45289864. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:21,211][50642] Avg episode reward: [(0, '19.870'), (1, '26.310')] [2023-10-08 03:30:21,304][52059] Updated weights for policy 1, policy_version 89002 (0.0011) [2023-10-08 03:30:21,667][52059] Updated weights for policy 1, policy_version 89012 (0.0010) [2023-10-08 03:30:22,032][52059] Updated weights for policy 1, policy_version 89022 (0.0010) [2023-10-08 03:30:24,666][52060] Updated weights for policy 0, policy_version 87910 (0.0008) [2023-10-08 03:30:25,037][52060] Updated weights for policy 0, policy_version 87920 (0.0009) [2023-10-08 03:30:25,408][52060] Updated weights for policy 0, policy_version 87930 (0.0008) [2023-10-08 03:30:25,908][52059] Updated weights for policy 1, policy_version 89032 (0.0011) [2023-10-08 03:30:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 181207040. Throughput: 0: 1722.1, 1: 1758.6. Samples: 45310952. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:26,211][50642] Avg episode reward: [(0, '19.940'), (1, '22.690')] [2023-10-08 03:30:26,282][52059] Updated weights for policy 1, policy_version 89042 (0.0009) [2023-10-08 03:30:26,644][52059] Updated weights for policy 1, policy_version 89052 (0.0008) [2023-10-08 03:30:29,579][52060] Updated weights for policy 0, policy_version 87940 (0.0008) [2023-10-08 03:30:29,951][52060] Updated weights for policy 0, policy_version 87950 (0.0007) [2023-10-08 03:30:30,318][52060] Updated weights for policy 0, policy_version 87960 (0.0007) [2023-10-08 03:30:30,560][52059] Updated weights for policy 1, policy_version 89062 (0.0007) [2023-10-08 03:30:30,921][52059] Updated weights for policy 1, policy_version 89072 (0.0009) [2023-10-08 03:30:31,211][50642] Fps is (10 sec: 13106.6, 60 sec: 13653.2, 300 sec: 13773.6). Total num frames: 181272576. Throughput: 0: 1695.1, 1: 1748.1. Samples: 45330602. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:31,212][50642] Avg episode reward: [(0, '19.070'), (1, '21.510')] [2023-10-08 03:30:31,289][52059] Updated weights for policy 1, policy_version 89082 (0.0007) [2023-10-08 03:30:34,189][52060] Updated weights for policy 0, policy_version 87970 (0.0007) [2023-10-08 03:30:34,551][52060] Updated weights for policy 0, policy_version 87980 (0.0007) [2023-10-08 03:30:34,921][52060] Updated weights for policy 0, policy_version 87990 (0.0008) [2023-10-08 03:30:35,238][52059] Updated weights for policy 1, policy_version 89092 (0.0008) [2023-10-08 03:30:35,287][52060] Updated weights for policy 0, policy_version 88000 (0.0009) [2023-10-08 03:30:35,601][52059] Updated weights for policy 1, policy_version 89102 (0.0011) [2023-10-08 03:30:35,967][52059] Updated weights for policy 1, policy_version 89112 (0.0011) [2023-10-08 03:30:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 181338112. Throughput: 0: 1722.3, 1: 1746.6. Samples: 45341704. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:36,211][50642] Avg episode reward: [(0, '20.730'), (1, '23.160')] [2023-10-08 03:30:39,297][52060] Updated weights for policy 0, policy_version 88010 (0.0007) [2023-10-08 03:30:39,674][52060] Updated weights for policy 0, policy_version 88020 (0.0007) [2023-10-08 03:30:39,727][52059] Updated weights for policy 1, policy_version 89122 (0.0009) [2023-10-08 03:30:40,034][52060] Updated weights for policy 0, policy_version 88030 (0.0007) [2023-10-08 03:30:40,096][52059] Updated weights for policy 1, policy_version 89132 (0.0008) [2023-10-08 03:30:40,459][52059] Updated weights for policy 1, policy_version 89142 (0.0009) [2023-10-08 03:30:40,822][52059] Updated weights for policy 1, policy_version 89152 (0.0007) [2023-10-08 03:30:41,210][50642] Fps is (10 sec: 16384.7, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 181436416. Throughput: 0: 1701.8, 1: 1750.0. Samples: 45362260. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:41,211][50642] Avg episode reward: [(0, '20.160'), (1, '26.700')] [2023-10-08 03:30:44,020][52060] Updated weights for policy 0, policy_version 88040 (0.0010) [2023-10-08 03:30:44,396][52060] Updated weights for policy 0, policy_version 88050 (0.0008) [2023-10-08 03:30:44,702][52059] Updated weights for policy 1, policy_version 89162 (0.0008) [2023-10-08 03:30:44,770][52060] Updated weights for policy 0, policy_version 88060 (0.0009) [2023-10-08 03:30:45,060][52059] Updated weights for policy 1, policy_version 89172 (0.0008) [2023-10-08 03:30:45,433][52059] Updated weights for policy 1, policy_version 89182 (0.0010) [2023-10-08 03:30:46,210][50642] Fps is (10 sec: 16383.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 181501952. Throughput: 0: 1700.6, 1: 1728.7. Samples: 45382266. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:46,212][50642] Avg episode reward: [(0, '20.390'), (1, '24.420')] [2023-10-08 03:30:46,225][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000088064_90177536.pth... [2023-10-08 03:30:46,225][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000089184_91324416.pth... [2023-10-08 03:30:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000086464_88539136.pth [2023-10-08 03:30:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000087552_89653248.pth [2023-10-08 03:30:46,262][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000088064_90177536.pth [2023-10-08 03:30:46,264][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000089184_91324416.pth [2023-10-08 03:30:48,781][52060] Updated weights for policy 0, policy_version 88070 (0.0009) [2023-10-08 03:30:49,144][52060] Updated weights for policy 0, policy_version 88080 (0.0010) [2023-10-08 03:30:49,271][52059] Updated weights for policy 1, policy_version 89192 (0.0008) [2023-10-08 03:30:49,509][52060] Updated weights for policy 0, policy_version 88090 (0.0007) [2023-10-08 03:30:49,638][52059] Updated weights for policy 1, policy_version 89202 (0.0009) [2023-10-08 03:30:50,004][52059] Updated weights for policy 1, policy_version 89212 (0.0007) [2023-10-08 03:30:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 181567488. Throughput: 0: 1702.2, 1: 1760.2. Samples: 45393636. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:30:51,211][50642] Avg episode reward: [(0, '20.760'), (1, '21.110')] [2023-10-08 03:30:53,463][52060] Updated weights for policy 0, policy_version 88100 (0.0008) [2023-10-08 03:30:53,783][52059] Updated weights for policy 1, policy_version 89222 (0.0008) [2023-10-08 03:30:53,827][52060] Updated weights for policy 0, policy_version 88110 (0.0007) [2023-10-08 03:30:54,142][52059] Updated weights for policy 1, policy_version 89232 (0.0007) [2023-10-08 03:30:54,182][52060] Updated weights for policy 0, policy_version 88120 (0.0007) [2023-10-08 03:30:54,507][52059] Updated weights for policy 1, policy_version 89242 (0.0008) [2023-10-08 03:30:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 181633024. Throughput: 0: 1680.3, 1: 1736.2. Samples: 45412978. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:30:56,211][50642] Avg episode reward: [(0, '19.820'), (1, '24.160')] [2023-10-08 03:30:58,118][52060] Updated weights for policy 0, policy_version 88130 (0.0009) [2023-10-08 03:30:58,352][52059] Updated weights for policy 1, policy_version 89252 (0.0008) [2023-10-08 03:30:58,480][52060] Updated weights for policy 0, policy_version 88140 (0.0008) [2023-10-08 03:30:58,713][52059] Updated weights for policy 1, policy_version 89262 (0.0009) [2023-10-08 03:30:58,854][52060] Updated weights for policy 0, policy_version 88150 (0.0008) [2023-10-08 03:30:59,081][52059] Updated weights for policy 1, policy_version 89272 (0.0008) [2023-10-08 03:30:59,220][52060] Updated weights for policy 0, policy_version 88160 (0.0008) [2023-10-08 03:31:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 181698560. Throughput: 0: 1711.2, 1: 1733.4. Samples: 45434362. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:01,211][50642] Avg episode reward: [(0, '19.910'), (1, '25.650')] [2023-10-08 03:31:03,094][52060] Updated weights for policy 0, policy_version 88170 (0.0010) [2023-10-08 03:31:03,098][52059] Updated weights for policy 1, policy_version 89282 (0.0009) [2023-10-08 03:31:03,452][52059] Updated weights for policy 1, policy_version 89292 (0.0007) [2023-10-08 03:31:03,468][52060] Updated weights for policy 0, policy_version 88180 (0.0009) [2023-10-08 03:31:03,816][52059] Updated weights for policy 1, policy_version 89302 (0.0007) [2023-10-08 03:31:03,829][52060] Updated weights for policy 0, policy_version 88190 (0.0008) [2023-10-08 03:31:04,179][52059] Updated weights for policy 1, policy_version 89312 (0.0010) [2023-10-08 03:31:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 181764096. Throughput: 0: 1689.8, 1: 1741.5. Samples: 45444274. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:06,211][50642] Avg episode reward: [(0, '20.320'), (1, '27.010')] [2023-10-08 03:31:07,921][52060] Updated weights for policy 0, policy_version 88200 (0.0008) [2023-10-08 03:31:08,157][52059] Updated weights for policy 1, policy_version 89322 (0.0007) [2023-10-08 03:31:08,285][52060] Updated weights for policy 0, policy_version 88210 (0.0008) [2023-10-08 03:31:08,512][52059] Updated weights for policy 1, policy_version 89332 (0.0009) [2023-10-08 03:31:08,652][52060] Updated weights for policy 0, policy_version 88220 (0.0009) [2023-10-08 03:31:08,878][52059] Updated weights for policy 1, policy_version 89342 (0.0009) [2023-10-08 03:31:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 181829632. Throughput: 0: 1699.8, 1: 1731.5. Samples: 45465358. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:11,211][50642] Avg episode reward: [(0, '19.720'), (1, '22.820')] [2023-10-08 03:31:12,622][52060] Updated weights for policy 0, policy_version 88230 (0.0010) [2023-10-08 03:31:12,866][52059] Updated weights for policy 1, policy_version 89352 (0.0010) [2023-10-08 03:31:12,991][52060] Updated weights for policy 0, policy_version 88240 (0.0008) [2023-10-08 03:31:13,241][52059] Updated weights for policy 1, policy_version 89362 (0.0007) [2023-10-08 03:31:13,353][52060] Updated weights for policy 0, policy_version 88250 (0.0008) [2023-10-08 03:31:13,607][52059] Updated weights for policy 1, policy_version 89372 (0.0008) [2023-10-08 03:31:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 181895168. Throughput: 0: 1723.8, 1: 1738.3. Samples: 45486392. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:16,211][50642] Avg episode reward: [(0, '18.930'), (1, '24.290')] [2023-10-08 03:31:17,517][52060] Updated weights for policy 0, policy_version 88260 (0.0008) [2023-10-08 03:31:17,568][52059] Updated weights for policy 1, policy_version 89382 (0.0008) [2023-10-08 03:31:17,900][52060] Updated weights for policy 0, policy_version 88270 (0.0008) [2023-10-08 03:31:17,941][52059] Updated weights for policy 1, policy_version 89392 (0.0007) [2023-10-08 03:31:18,262][52060] Updated weights for policy 0, policy_version 88280 (0.0009) [2023-10-08 03:31:18,300][52059] Updated weights for policy 1, policy_version 89402 (0.0010) [2023-10-08 03:31:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 181960704. Throughput: 0: 1689.0, 1: 1730.7. Samples: 45495590. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:21,211][50642] Avg episode reward: [(0, '21.870'), (1, '24.220')] [2023-10-08 03:31:22,150][52059] Updated weights for policy 1, policy_version 89412 (0.0007) [2023-10-08 03:31:22,217][52060] Updated weights for policy 0, policy_version 88290 (0.0007) [2023-10-08 03:31:22,509][52059] Updated weights for policy 1, policy_version 89422 (0.0008) [2023-10-08 03:31:22,586][52060] Updated weights for policy 0, policy_version 88300 (0.0008) [2023-10-08 03:31:22,876][52059] Updated weights for policy 1, policy_version 89432 (0.0007) [2023-10-08 03:31:22,950][52060] Updated weights for policy 0, policy_version 88310 (0.0007) [2023-10-08 03:31:23,328][52060] Updated weights for policy 0, policy_version 88320 (0.0007) [2023-10-08 03:31:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 182026240. Throughput: 0: 1707.8, 1: 1732.8. Samples: 45517086. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:26,211][50642] Avg episode reward: [(0, '20.860'), (1, '28.390')] [2023-10-08 03:31:26,742][52059] Updated weights for policy 1, policy_version 89442 (0.0008) [2023-10-08 03:31:27,091][52059] Updated weights for policy 1, policy_version 89452 (0.0008) [2023-10-08 03:31:27,282][52060] Updated weights for policy 0, policy_version 88330 (0.0007) [2023-10-08 03:31:27,459][52059] Updated weights for policy 1, policy_version 89462 (0.0008) [2023-10-08 03:31:27,651][52060] Updated weights for policy 0, policy_version 88340 (0.0007) [2023-10-08 03:31:27,811][52059] Updated weights for policy 1, policy_version 89472 (0.0007) [2023-10-08 03:31:28,014][52060] Updated weights for policy 0, policy_version 88350 (0.0010) [2023-10-08 03:31:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 182091776. Throughput: 0: 1720.2, 1: 1745.9. Samples: 45538240. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:31,211][50642] Avg episode reward: [(0, '19.990'), (1, '23.770')] [2023-10-08 03:31:31,886][52059] Updated weights for policy 1, policy_version 89482 (0.0008) [2023-10-08 03:31:32,010][52060] Updated weights for policy 0, policy_version 88360 (0.0009) [2023-10-08 03:31:32,249][52059] Updated weights for policy 1, policy_version 89492 (0.0008) [2023-10-08 03:31:32,377][52060] Updated weights for policy 0, policy_version 88370 (0.0009) [2023-10-08 03:31:32,613][52059] Updated weights for policy 1, policy_version 89502 (0.0008) [2023-10-08 03:31:32,741][52060] Updated weights for policy 0, policy_version 88380 (0.0008) [2023-10-08 03:31:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 182157312. Throughput: 0: 1704.9, 1: 1717.2. Samples: 45547632. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:36,212][50642] Avg episode reward: [(0, '19.850'), (1, '23.800')] [2023-10-08 03:31:36,494][52059] Updated weights for policy 1, policy_version 89512 (0.0008) [2023-10-08 03:31:36,790][52060] Updated weights for policy 0, policy_version 88390 (0.0011) [2023-10-08 03:31:36,853][52059] Updated weights for policy 1, policy_version 89522 (0.0007) [2023-10-08 03:31:37,163][52060] Updated weights for policy 0, policy_version 88400 (0.0009) [2023-10-08 03:31:37,218][52059] Updated weights for policy 1, policy_version 89532 (0.0007) [2023-10-08 03:31:37,520][52060] Updated weights for policy 0, policy_version 88410 (0.0010) [2023-10-08 03:31:41,006][52059] Updated weights for policy 1, policy_version 89542 (0.0007) [2023-10-08 03:31:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 182222848. Throughput: 0: 1721.4, 1: 1747.5. Samples: 45569082. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:41,211][50642] Avg episode reward: [(0, '21.170'), (1, '24.400')] [2023-10-08 03:31:41,374][52059] Updated weights for policy 1, policy_version 89552 (0.0009) [2023-10-08 03:31:41,594][52060] Updated weights for policy 0, policy_version 88420 (0.0010) [2023-10-08 03:31:41,745][52059] Updated weights for policy 1, policy_version 89562 (0.0008) [2023-10-08 03:31:41,959][52060] Updated weights for policy 0, policy_version 88430 (0.0008) [2023-10-08 03:31:42,328][52060] Updated weights for policy 0, policy_version 88440 (0.0009) [2023-10-08 03:31:45,789][52059] Updated weights for policy 1, policy_version 89572 (0.0008) [2023-10-08 03:31:46,152][52059] Updated weights for policy 1, policy_version 89582 (0.0007) [2023-10-08 03:31:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 182288384. Throughput: 0: 1720.2, 1: 1746.1. Samples: 45590348. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:46,211][50642] Avg episode reward: [(0, '22.310'), (1, '25.900')] [2023-10-08 03:31:46,231][52060] Updated weights for policy 0, policy_version 88450 (0.0008) [2023-10-08 03:31:46,511][52059] Updated weights for policy 1, policy_version 89592 (0.0007) [2023-10-08 03:31:46,601][52060] Updated weights for policy 0, policy_version 88460 (0.0008) [2023-10-08 03:31:46,969][52060] Updated weights for policy 0, policy_version 88470 (0.0008) [2023-10-08 03:31:47,338][52060] Updated weights for policy 0, policy_version 88480 (0.0008) [2023-10-08 03:31:50,249][52059] Updated weights for policy 1, policy_version 89602 (0.0007) [2023-10-08 03:31:50,611][52059] Updated weights for policy 1, policy_version 89612 (0.0007) [2023-10-08 03:31:50,979][52059] Updated weights for policy 1, policy_version 89622 (0.0007) [2023-10-08 03:31:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 182353920. Throughput: 0: 1715.5, 1: 1746.1. Samples: 45600048. Policy #0 lag: (min: 24.0, avg: 50.7, max: 56.0) [2023-10-08 03:31:51,211][50642] Avg episode reward: [(0, '20.420'), (1, '27.010')] [2023-10-08 03:31:51,348][52059] Updated weights for policy 1, policy_version 89632 (0.0008) [2023-10-08 03:31:51,369][52060] Updated weights for policy 0, policy_version 88490 (0.0007) [2023-10-08 03:31:51,744][52060] Updated weights for policy 0, policy_version 88500 (0.0007) [2023-10-08 03:31:52,122][52060] Updated weights for policy 0, policy_version 88510 (0.0010) [2023-10-08 03:31:55,238][52059] Updated weights for policy 1, policy_version 89642 (0.0008) [2023-10-08 03:31:55,593][52059] Updated weights for policy 1, policy_version 89652 (0.0012) [2023-10-08 03:31:55,962][52059] Updated weights for policy 1, policy_version 89662 (0.0008) [2023-10-08 03:31:56,022][52060] Updated weights for policy 0, policy_version 88520 (0.0008) [2023-10-08 03:31:56,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 182452224. Throughput: 0: 1714.8, 1: 1759.3. Samples: 45621694. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:31:56,211][50642] Avg episode reward: [(0, '21.490'), (1, '23.560')] [2023-10-08 03:31:56,387][52060] Updated weights for policy 0, policy_version 88530 (0.0011) [2023-10-08 03:31:56,766][52060] Updated weights for policy 0, policy_version 88540 (0.0009) [2023-10-08 03:32:00,061][52059] Updated weights for policy 1, policy_version 89672 (0.0008) [2023-10-08 03:32:00,439][52059] Updated weights for policy 1, policy_version 89682 (0.0010) [2023-10-08 03:32:00,790][52060] Updated weights for policy 0, policy_version 88550 (0.0008) [2023-10-08 03:32:00,804][52059] Updated weights for policy 1, policy_version 89692 (0.0008) [2023-10-08 03:32:01,162][52060] Updated weights for policy 0, policy_version 88560 (0.0008) [2023-10-08 03:32:01,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 182517760. Throughput: 0: 1705.3, 1: 1733.2. Samples: 45641126. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:01,211][50642] Avg episode reward: [(0, '22.030'), (1, '22.800')] [2023-10-08 03:32:01,537][52060] Updated weights for policy 0, policy_version 88570 (0.0009) [2023-10-08 03:32:04,692][52059] Updated weights for policy 1, policy_version 89702 (0.0007) [2023-10-08 03:32:05,056][52059] Updated weights for policy 1, policy_version 89712 (0.0008) [2023-10-08 03:32:05,424][52059] Updated weights for policy 1, policy_version 89722 (0.0008) [2023-10-08 03:32:05,577][52060] Updated weights for policy 0, policy_version 88580 (0.0007) [2023-10-08 03:32:05,964][52060] Updated weights for policy 0, policy_version 88590 (0.0008) [2023-10-08 03:32:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 182583296. Throughput: 0: 1715.4, 1: 1760.5. Samples: 45652006. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:06,211][50642] Avg episode reward: [(0, '18.720'), (1, '24.610')] [2023-10-08 03:32:06,338][52060] Updated weights for policy 0, policy_version 88600 (0.0007) [2023-10-08 03:32:09,123][52059] Updated weights for policy 1, policy_version 89732 (0.0009) [2023-10-08 03:32:09,489][52059] Updated weights for policy 1, policy_version 89742 (0.0007) [2023-10-08 03:32:09,857][52059] Updated weights for policy 1, policy_version 89752 (0.0007) [2023-10-08 03:32:10,283][52060] Updated weights for policy 0, policy_version 88610 (0.0010) [2023-10-08 03:32:10,648][52060] Updated weights for policy 0, policy_version 88620 (0.0009) [2023-10-08 03:32:11,008][52060] Updated weights for policy 0, policy_version 88630 (0.0008) [2023-10-08 03:32:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 182648832. Throughput: 0: 1713.6, 1: 1739.9. Samples: 45672492. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:11,211][50642] Avg episode reward: [(0, '21.110'), (1, '28.000')] [2023-10-08 03:32:11,386][52060] Updated weights for policy 0, policy_version 88640 (0.0007) [2023-10-08 03:32:13,873][52059] Updated weights for policy 1, policy_version 89762 (0.0008) [2023-10-08 03:32:14,238][52059] Updated weights for policy 1, policy_version 89772 (0.0007) [2023-10-08 03:32:14,597][52059] Updated weights for policy 1, policy_version 89782 (0.0007) [2023-10-08 03:32:14,967][52059] Updated weights for policy 1, policy_version 89792 (0.0007) [2023-10-08 03:32:15,279][52060] Updated weights for policy 0, policy_version 88650 (0.0008) [2023-10-08 03:32:15,642][52060] Updated weights for policy 0, policy_version 88660 (0.0011) [2023-10-08 03:32:16,009][52060] Updated weights for policy 0, policy_version 88670 (0.0010) [2023-10-08 03:32:16,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 182747136. Throughput: 0: 1690.6, 1: 1742.0. Samples: 45692710. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:16,211][50642] Avg episode reward: [(0, '20.930'), (1, '26.850')] [2023-10-08 03:32:18,674][52059] Updated weights for policy 1, policy_version 89802 (0.0011) [2023-10-08 03:32:19,027][52059] Updated weights for policy 1, policy_version 89812 (0.0009) [2023-10-08 03:32:19,392][52059] Updated weights for policy 1, policy_version 89822 (0.0010) [2023-10-08 03:32:19,966][52060] Updated weights for policy 0, policy_version 88680 (0.0008) [2023-10-08 03:32:20,336][52060] Updated weights for policy 0, policy_version 88690 (0.0009) [2023-10-08 03:32:20,710][52060] Updated weights for policy 0, policy_version 88700 (0.0008) [2023-10-08 03:32:21,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 182812672. Throughput: 0: 1714.7, 1: 1762.8. Samples: 45704118. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:21,211][50642] Avg episode reward: [(0, '21.200'), (1, '22.270')] [2023-10-08 03:32:23,384][52059] Updated weights for policy 1, policy_version 89832 (0.0008) [2023-10-08 03:32:23,745][52059] Updated weights for policy 1, policy_version 89842 (0.0010) [2023-10-08 03:32:24,112][52059] Updated weights for policy 1, policy_version 89852 (0.0010) [2023-10-08 03:32:24,620][52060] Updated weights for policy 0, policy_version 88710 (0.0009) [2023-10-08 03:32:24,995][52060] Updated weights for policy 0, policy_version 88720 (0.0009) [2023-10-08 03:32:25,351][52060] Updated weights for policy 0, policy_version 88730 (0.0008) [2023-10-08 03:32:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 182878208. Throughput: 0: 1707.8, 1: 1739.2. Samples: 45724198. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:26,211][50642] Avg episode reward: [(0, '17.850'), (1, '24.030')] [2023-10-08 03:32:27,956][52059] Updated weights for policy 1, policy_version 89862 (0.0007) [2023-10-08 03:32:28,316][52059] Updated weights for policy 1, policy_version 89872 (0.0008) [2023-10-08 03:32:28,689][52059] Updated weights for policy 1, policy_version 89882 (0.0009) [2023-10-08 03:32:29,246][52060] Updated weights for policy 0, policy_version 88740 (0.0009) [2023-10-08 03:32:29,612][52060] Updated weights for policy 0, policy_version 88750 (0.0008) [2023-10-08 03:32:29,988][52060] Updated weights for policy 0, policy_version 88760 (0.0009) [2023-10-08 03:32:31,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 182943744. Throughput: 0: 1688.4, 1: 1741.2. Samples: 45744678. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:31,211][50642] Avg episode reward: [(0, '19.310'), (1, '25.050')] [2023-10-08 03:32:32,744][52059] Updated weights for policy 1, policy_version 89892 (0.0009) [2023-10-08 03:32:33,107][52059] Updated weights for policy 1, policy_version 89902 (0.0008) [2023-10-08 03:32:33,466][52059] Updated weights for policy 1, policy_version 89912 (0.0009) [2023-10-08 03:32:34,126][52060] Updated weights for policy 0, policy_version 88770 (0.0009) [2023-10-08 03:32:34,488][52060] Updated weights for policy 0, policy_version 88780 (0.0010) [2023-10-08 03:32:34,862][52060] Updated weights for policy 0, policy_version 88790 (0.0011) [2023-10-08 03:32:35,229][52060] Updated weights for policy 0, policy_version 88800 (0.0009) [2023-10-08 03:32:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 183009280. Throughput: 0: 1718.3, 1: 1730.3. Samples: 45755232. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:36,211][50642] Avg episode reward: [(0, '22.650'), (1, '25.210')] [2023-10-08 03:32:37,428][52059] Updated weights for policy 1, policy_version 89922 (0.0008) [2023-10-08 03:32:37,788][52059] Updated weights for policy 1, policy_version 89932 (0.0009) [2023-10-08 03:32:38,150][52059] Updated weights for policy 1, policy_version 89942 (0.0010) [2023-10-08 03:32:38,511][52059] Updated weights for policy 1, policy_version 89952 (0.0010) [2023-10-08 03:32:39,294][52060] Updated weights for policy 0, policy_version 88810 (0.0008) [2023-10-08 03:32:39,660][52060] Updated weights for policy 0, policy_version 88820 (0.0008) [2023-10-08 03:32:40,038][52060] Updated weights for policy 0, policy_version 88830 (0.0007) [2023-10-08 03:32:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 183074816. Throughput: 0: 1699.3, 1: 1720.4. Samples: 45775582. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:41,211][50642] Avg episode reward: [(0, '22.130'), (1, '22.890')] [2023-10-08 03:32:42,486][52059] Updated weights for policy 1, policy_version 89962 (0.0007) [2023-10-08 03:32:42,851][52059] Updated weights for policy 1, policy_version 89972 (0.0007) [2023-10-08 03:32:43,214][52059] Updated weights for policy 1, policy_version 89982 (0.0008) [2023-10-08 03:32:43,826][52060] Updated weights for policy 0, policy_version 88840 (0.0008) [2023-10-08 03:32:44,194][52060] Updated weights for policy 0, policy_version 88850 (0.0008) [2023-10-08 03:32:44,563][52060] Updated weights for policy 0, policy_version 88860 (0.0009) [2023-10-08 03:32:46,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 183140352. Throughput: 0: 1700.0, 1: 1752.2. Samples: 45796476. Policy #0 lag: (min: 31.0, avg: 39.0, max: 63.0) [2023-10-08 03:32:46,211][50642] Avg episode reward: [(0, '20.010'), (1, '21.470')] [2023-10-08 03:32:46,223][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000088864_90996736.pth... [2023-10-08 03:32:46,223][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000089984_92143616.pth... [2023-10-08 03:32:46,257][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000088352_90472448.pth [2023-10-08 03:32:46,264][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000087264_89358336.pth [2023-10-08 03:32:47,162][52059] Updated weights for policy 1, policy_version 89992 (0.0010) [2023-10-08 03:32:47,546][52059] Updated weights for policy 1, policy_version 90002 (0.0007) [2023-10-08 03:32:47,901][52059] Updated weights for policy 1, policy_version 90012 (0.0008) [2023-10-08 03:32:48,609][52060] Updated weights for policy 0, policy_version 88870 (0.0009) [2023-10-08 03:32:48,977][52060] Updated weights for policy 0, policy_version 88880 (0.0010) [2023-10-08 03:32:49,349][52060] Updated weights for policy 0, policy_version 88890 (0.0008) [2023-10-08 03:32:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 183205888. Throughput: 0: 1713.9, 1: 1725.1. Samples: 45806762. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:32:51,211][50642] Avg episode reward: [(0, '18.530'), (1, '22.530')] [2023-10-08 03:32:51,651][52059] Updated weights for policy 1, policy_version 90022 (0.0008) [2023-10-08 03:32:52,020][52059] Updated weights for policy 1, policy_version 90032 (0.0008) [2023-10-08 03:32:52,383][52059] Updated weights for policy 1, policy_version 90042 (0.0009) [2023-10-08 03:32:53,277][52060] Updated weights for policy 0, policy_version 88900 (0.0007) [2023-10-08 03:32:53,641][52060] Updated weights for policy 0, policy_version 88910 (0.0007) [2023-10-08 03:32:54,012][52060] Updated weights for policy 0, policy_version 88920 (0.0009) [2023-10-08 03:32:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183271424. Throughput: 0: 1698.6, 1: 1745.2. Samples: 45827466. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:32:56,211][50642] Avg episode reward: [(0, '21.310'), (1, '17.960')] [2023-10-08 03:32:56,492][52059] Updated weights for policy 1, policy_version 90052 (0.0008) [2023-10-08 03:32:56,864][52059] Updated weights for policy 1, policy_version 90062 (0.0007) [2023-10-08 03:32:57,235][52059] Updated weights for policy 1, policy_version 90072 (0.0009) [2023-10-08 03:32:57,991][52060] Updated weights for policy 0, policy_version 88930 (0.0010) [2023-10-08 03:32:58,360][52060] Updated weights for policy 0, policy_version 88940 (0.0011) [2023-10-08 03:32:58,729][52060] Updated weights for policy 0, policy_version 88950 (0.0007) [2023-10-08 03:32:59,085][52060] Updated weights for policy 0, policy_version 88960 (0.0010) [2023-10-08 03:33:00,991][52059] Updated weights for policy 1, policy_version 90082 (0.0009) [2023-10-08 03:33:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183336960. Throughput: 0: 1721.1, 1: 1750.4. Samples: 45848926. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:01,211][50642] Avg episode reward: [(0, '20.930'), (1, '17.290')] [2023-10-08 03:33:01,367][52059] Updated weights for policy 1, policy_version 90092 (0.0010) [2023-10-08 03:33:01,717][52059] Updated weights for policy 1, policy_version 90102 (0.0011) [2023-10-08 03:33:02,081][52059] Updated weights for policy 1, policy_version 90112 (0.0010) [2023-10-08 03:33:03,168][52060] Updated weights for policy 0, policy_version 88970 (0.0010) [2023-10-08 03:33:03,531][52060] Updated weights for policy 0, policy_version 88980 (0.0009) [2023-10-08 03:33:03,898][52060] Updated weights for policy 0, policy_version 88990 (0.0009) [2023-10-08 03:33:06,140][52059] Updated weights for policy 1, policy_version 90122 (0.0007) [2023-10-08 03:33:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183402496. Throughput: 0: 1698.5, 1: 1729.2. Samples: 45858368. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:06,211][50642] Avg episode reward: [(0, '19.590'), (1, '16.340')] [2023-10-08 03:33:06,498][52059] Updated weights for policy 1, policy_version 90132 (0.0009) [2023-10-08 03:33:06,870][52059] Updated weights for policy 1, policy_version 90142 (0.0008) [2023-10-08 03:33:07,819][52060] Updated weights for policy 0, policy_version 89000 (0.0009) [2023-10-08 03:33:08,191][52060] Updated weights for policy 0, policy_version 89010 (0.0008) [2023-10-08 03:33:08,561][52060] Updated weights for policy 0, policy_version 89020 (0.0008) [2023-10-08 03:33:10,663][52059] Updated weights for policy 1, policy_version 90152 (0.0008) [2023-10-08 03:33:11,036][52059] Updated weights for policy 1, policy_version 90162 (0.0007) [2023-10-08 03:33:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183468032. Throughput: 0: 1703.6, 1: 1747.2. Samples: 45879482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:11,211][50642] Avg episode reward: [(0, '19.520'), (1, '18.390')] [2023-10-08 03:33:11,401][52059] Updated weights for policy 1, policy_version 90172 (0.0008) [2023-10-08 03:33:12,588][52060] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-10-08 03:33:12,958][52060] Updated weights for policy 0, policy_version 89040 (0.0007) [2023-10-08 03:33:13,338][52060] Updated weights for policy 0, policy_version 89050 (0.0009) [2023-10-08 03:33:15,244][52059] Updated weights for policy 1, policy_version 90182 (0.0008) [2023-10-08 03:33:15,608][52059] Updated weights for policy 1, policy_version 90192 (0.0008) [2023-10-08 03:33:15,965][52059] Updated weights for policy 1, policy_version 90202 (0.0007) [2023-10-08 03:33:16,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 183566336. Throughput: 0: 1724.8, 1: 1729.9. Samples: 45900142. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:16,211][50642] Avg episode reward: [(0, '20.340'), (1, '21.560')] [2023-10-08 03:33:17,198][52060] Updated weights for policy 0, policy_version 89060 (0.0007) [2023-10-08 03:33:17,569][52060] Updated weights for policy 0, policy_version 89070 (0.0007) [2023-10-08 03:33:17,929][52060] Updated weights for policy 0, policy_version 89080 (0.0007) [2023-10-08 03:33:19,886][52059] Updated weights for policy 1, policy_version 90212 (0.0007) [2023-10-08 03:33:20,256][52059] Updated weights for policy 1, policy_version 90222 (0.0009) [2023-10-08 03:33:20,611][52059] Updated weights for policy 1, policy_version 90232 (0.0009) [2023-10-08 03:33:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 183631872. Throughput: 0: 1697.3, 1: 1753.6. Samples: 45910522. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:21,211][50642] Avg episode reward: [(0, '20.080'), (1, '20.930')] [2023-10-08 03:33:21,818][52060] Updated weights for policy 0, policy_version 89090 (0.0009) [2023-10-08 03:33:22,193][52060] Updated weights for policy 0, policy_version 89100 (0.0008) [2023-10-08 03:33:22,563][52060] Updated weights for policy 0, policy_version 89110 (0.0007) [2023-10-08 03:33:22,934][52060] Updated weights for policy 0, policy_version 89120 (0.0007) [2023-10-08 03:33:24,490][52059] Updated weights for policy 1, policy_version 90242 (0.0008) [2023-10-08 03:33:24,865][52059] Updated weights for policy 1, policy_version 90252 (0.0011) [2023-10-08 03:33:25,226][52059] Updated weights for policy 1, policy_version 90262 (0.0010) [2023-10-08 03:33:25,586][52059] Updated weights for policy 1, policy_version 90272 (0.0008) [2023-10-08 03:33:26,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13884.8). Total num frames: 183697408. Throughput: 0: 1720.1, 1: 1743.9. Samples: 45931464. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:26,211][50642] Avg episode reward: [(0, '21.170'), (1, '20.600')] [2023-10-08 03:33:26,906][52060] Updated weights for policy 0, policy_version 89130 (0.0009) [2023-10-08 03:33:27,281][52060] Updated weights for policy 0, policy_version 89140 (0.0008) [2023-10-08 03:33:27,645][52060] Updated weights for policy 0, policy_version 89150 (0.0007) [2023-10-08 03:33:29,563][52059] Updated weights for policy 1, policy_version 90282 (0.0010) [2023-10-08 03:33:29,927][52059] Updated weights for policy 1, policy_version 90292 (0.0007) [2023-10-08 03:33:30,299][52059] Updated weights for policy 1, policy_version 90302 (0.0010) [2023-10-08 03:33:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183762944. Throughput: 0: 1728.5, 1: 1726.7. Samples: 45951956. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:31,211][50642] Avg episode reward: [(0, '19.450'), (1, '22.210')] [2023-10-08 03:33:31,542][52060] Updated weights for policy 0, policy_version 89160 (0.0008) [2023-10-08 03:33:31,916][52060] Updated weights for policy 0, policy_version 89170 (0.0008) [2023-10-08 03:33:32,278][52060] Updated weights for policy 0, policy_version 89180 (0.0009) [2023-10-08 03:33:34,329][52059] Updated weights for policy 1, policy_version 90312 (0.0007) [2023-10-08 03:33:34,688][52059] Updated weights for policy 1, policy_version 90322 (0.0007) [2023-10-08 03:33:35,061][52059] Updated weights for policy 1, policy_version 90332 (0.0009) [2023-10-08 03:33:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183828480. Throughput: 0: 1705.9, 1: 1752.0. Samples: 45962368. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:36,211][50642] Avg episode reward: [(0, '22.340'), (1, '26.010')] [2023-10-08 03:33:36,399][52060] Updated weights for policy 0, policy_version 89190 (0.0009) [2023-10-08 03:33:36,764][52060] Updated weights for policy 0, policy_version 89200 (0.0008) [2023-10-08 03:33:37,144][52060] Updated weights for policy 0, policy_version 89210 (0.0009) [2023-10-08 03:33:38,791][52059] Updated weights for policy 1, policy_version 90342 (0.0009) [2023-10-08 03:33:39,150][52059] Updated weights for policy 1, policy_version 90352 (0.0010) [2023-10-08 03:33:39,513][52059] Updated weights for policy 1, policy_version 90362 (0.0009) [2023-10-08 03:33:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183894016. Throughput: 0: 1721.7, 1: 1725.0. Samples: 45982566. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:41,211][50642] Avg episode reward: [(0, '20.650'), (1, '23.590')] [2023-10-08 03:33:41,242][52060] Updated weights for policy 0, policy_version 89220 (0.0011) [2023-10-08 03:33:41,612][52060] Updated weights for policy 0, policy_version 89230 (0.0008) [2023-10-08 03:33:41,975][52060] Updated weights for policy 0, policy_version 89240 (0.0008) [2023-10-08 03:33:43,573][52059] Updated weights for policy 1, policy_version 90372 (0.0008) [2023-10-08 03:33:43,932][52059] Updated weights for policy 1, policy_version 90382 (0.0008) [2023-10-08 03:33:44,298][52059] Updated weights for policy 1, policy_version 90392 (0.0008) [2023-10-08 03:33:45,887][52060] Updated weights for policy 0, policy_version 89250 (0.0007) [2023-10-08 03:33:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 183959552. Throughput: 0: 1725.5, 1: 1719.8. Samples: 46003962. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:46,211][50642] Avg episode reward: [(0, '20.270'), (1, '24.730')] [2023-10-08 03:33:46,298][52060] Updated weights for policy 0, policy_version 89260 (0.0008) [2023-10-08 03:33:46,672][52060] Updated weights for policy 0, policy_version 89270 (0.0007) [2023-10-08 03:33:47,045][52060] Updated weights for policy 0, policy_version 89280 (0.0008) [2023-10-08 03:33:48,313][52059] Updated weights for policy 1, policy_version 90402 (0.0008) [2023-10-08 03:33:48,678][52059] Updated weights for policy 1, policy_version 90412 (0.0009) [2023-10-08 03:33:49,046][52059] Updated weights for policy 1, policy_version 90422 (0.0010) [2023-10-08 03:33:49,407][52059] Updated weights for policy 1, policy_version 90432 (0.0009) [2023-10-08 03:33:51,042][52060] Updated weights for policy 0, policy_version 89290 (0.0009) [2023-10-08 03:33:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 184025088. Throughput: 0: 1719.3, 1: 1732.1. Samples: 46013680. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:51,211][50642] Avg episode reward: [(0, '19.140'), (1, '24.400')] [2023-10-08 03:33:51,420][52060] Updated weights for policy 0, policy_version 89300 (0.0008) [2023-10-08 03:33:51,792][52060] Updated weights for policy 0, policy_version 89310 (0.0009) [2023-10-08 03:33:53,367][52059] Updated weights for policy 1, policy_version 90442 (0.0009) [2023-10-08 03:33:53,734][52059] Updated weights for policy 1, policy_version 90452 (0.0009) [2023-10-08 03:33:54,100][52059] Updated weights for policy 1, policy_version 90462 (0.0007) [2023-10-08 03:33:55,749][52060] Updated weights for policy 0, policy_version 89320 (0.0008) [2023-10-08 03:33:56,107][52060] Updated weights for policy 0, policy_version 89330 (0.0007) [2023-10-08 03:33:56,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 184090624. Throughput: 0: 1729.0, 1: 1717.9. Samples: 46034592. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:33:56,211][50642] Avg episode reward: [(0, '22.360'), (1, '26.900')] [2023-10-08 03:33:56,472][52060] Updated weights for policy 0, policy_version 89340 (0.0007) [2023-10-08 03:33:57,868][52059] Updated weights for policy 1, policy_version 90472 (0.0010) [2023-10-08 03:33:58,229][52059] Updated weights for policy 1, policy_version 90482 (0.0007) [2023-10-08 03:33:58,594][52059] Updated weights for policy 1, policy_version 90492 (0.0008) [2023-10-08 03:34:00,262][52060] Updated weights for policy 0, policy_version 89350 (0.0008) [2023-10-08 03:34:00,622][52060] Updated weights for policy 0, policy_version 89360 (0.0011) [2023-10-08 03:34:00,990][52060] Updated weights for policy 0, policy_version 89370 (0.0008) [2023-10-08 03:34:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 184156160. Throughput: 0: 1709.0, 1: 1739.8. Samples: 46055340. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:01,211][50642] Avg episode reward: [(0, '20.040'), (1, '25.920')] [2023-10-08 03:34:02,535][52059] Updated weights for policy 1, policy_version 90502 (0.0008) [2023-10-08 03:34:02,900][52059] Updated weights for policy 1, policy_version 90512 (0.0007) [2023-10-08 03:34:03,253][52059] Updated weights for policy 1, policy_version 90522 (0.0007) [2023-10-08 03:34:05,010][52060] Updated weights for policy 0, policy_version 89380 (0.0008) [2023-10-08 03:34:05,388][52060] Updated weights for policy 0, policy_version 89390 (0.0010) [2023-10-08 03:34:05,747][52060] Updated weights for policy 0, policy_version 89400 (0.0009) [2023-10-08 03:34:06,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 184254464. Throughput: 0: 1727.1, 1: 1715.2. Samples: 46065428. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:06,211][50642] Avg episode reward: [(0, '19.490'), (1, '23.100')] [2023-10-08 03:34:07,224][52059] Updated weights for policy 1, policy_version 90532 (0.0008) [2023-10-08 03:34:07,584][52059] Updated weights for policy 1, policy_version 90542 (0.0009) [2023-10-08 03:34:07,946][52059] Updated weights for policy 1, policy_version 90552 (0.0009) [2023-10-08 03:34:09,713][52060] Updated weights for policy 0, policy_version 89410 (0.0008) [2023-10-08 03:34:10,082][52060] Updated weights for policy 0, policy_version 89420 (0.0009) [2023-10-08 03:34:10,455][52060] Updated weights for policy 0, policy_version 89430 (0.0009) [2023-10-08 03:34:10,821][52060] Updated weights for policy 0, policy_version 89440 (0.0009) [2023-10-08 03:34:11,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 184320000. Throughput: 0: 1719.3, 1: 1729.9. Samples: 46086676. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:11,211][50642] Avg episode reward: [(0, '20.340'), (1, '24.920')] [2023-10-08 03:34:11,810][52059] Updated weights for policy 1, policy_version 90562 (0.0008) [2023-10-08 03:34:12,176][52059] Updated weights for policy 1, policy_version 90572 (0.0009) [2023-10-08 03:34:12,533][52059] Updated weights for policy 1, policy_version 90582 (0.0007) [2023-10-08 03:34:12,897][52059] Updated weights for policy 1, policy_version 90592 (0.0009) [2023-10-08 03:34:14,683][52060] Updated weights for policy 0, policy_version 89450 (0.0007) [2023-10-08 03:34:15,037][52060] Updated weights for policy 0, policy_version 89460 (0.0008) [2023-10-08 03:34:15,406][52060] Updated weights for policy 0, policy_version 89470 (0.0009) [2023-10-08 03:34:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 184385536. Throughput: 0: 1697.5, 1: 1746.7. Samples: 46106946. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:16,211][50642] Avg episode reward: [(0, '22.220'), (1, '25.730')] [2023-10-08 03:34:16,873][52059] Updated weights for policy 1, policy_version 90602 (0.0007) [2023-10-08 03:34:17,230][52059] Updated weights for policy 1, policy_version 90612 (0.0007) [2023-10-08 03:34:17,600][52059] Updated weights for policy 1, policy_version 90622 (0.0007) [2023-10-08 03:34:19,506][52060] Updated weights for policy 0, policy_version 89480 (0.0009) [2023-10-08 03:34:19,883][52060] Updated weights for policy 0, policy_version 89490 (0.0009) [2023-10-08 03:34:20,254][52060] Updated weights for policy 0, policy_version 89500 (0.0009) [2023-10-08 03:34:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 184451072. Throughput: 0: 1729.2, 1: 1722.1. Samples: 46117676. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:21,211][50642] Avg episode reward: [(0, '19.530'), (1, '24.130')] [2023-10-08 03:34:21,410][52059] Updated weights for policy 1, policy_version 90632 (0.0010) [2023-10-08 03:34:21,793][52059] Updated weights for policy 1, policy_version 90642 (0.0009) [2023-10-08 03:34:22,161][52059] Updated weights for policy 1, policy_version 90652 (0.0010) [2023-10-08 03:34:24,234][52060] Updated weights for policy 0, policy_version 89510 (0.0010) [2023-10-08 03:34:24,600][52060] Updated weights for policy 0, policy_version 89520 (0.0011) [2023-10-08 03:34:24,977][52060] Updated weights for policy 0, policy_version 89530 (0.0010) [2023-10-08 03:34:26,064][52059] Updated weights for policy 1, policy_version 90662 (0.0007) [2023-10-08 03:34:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 184516608. Throughput: 0: 1708.9, 1: 1746.5. Samples: 46138060. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:26,211][50642] Avg episode reward: [(0, '19.510'), (1, '24.660')] [2023-10-08 03:34:26,423][52059] Updated weights for policy 1, policy_version 90672 (0.0007) [2023-10-08 03:34:26,784][52059] Updated weights for policy 1, policy_version 90682 (0.0007) [2023-10-08 03:34:28,977][52060] Updated weights for policy 0, policy_version 89540 (0.0008) [2023-10-08 03:34:29,339][52060] Updated weights for policy 0, policy_version 89550 (0.0008) [2023-10-08 03:34:29,703][52060] Updated weights for policy 0, policy_version 89560 (0.0009) [2023-10-08 03:34:30,761][52059] Updated weights for policy 1, policy_version 90692 (0.0008) [2023-10-08 03:34:31,116][52059] Updated weights for policy 1, policy_version 90702 (0.0009) [2023-10-08 03:34:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 184582144. Throughput: 0: 1695.4, 1: 1745.7. Samples: 46158814. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:31,211][50642] Avg episode reward: [(0, '22.010'), (1, '23.930')] [2023-10-08 03:34:31,480][52059] Updated weights for policy 1, policy_version 90712 (0.0011) [2023-10-08 03:34:33,755][52060] Updated weights for policy 0, policy_version 89570 (0.0010) [2023-10-08 03:34:34,164][52060] Updated weights for policy 0, policy_version 89580 (0.0008) [2023-10-08 03:34:34,535][52060] Updated weights for policy 0, policy_version 89590 (0.0007) [2023-10-08 03:34:34,898][52060] Updated weights for policy 0, policy_version 89600 (0.0008) [2023-10-08 03:34:35,383][52059] Updated weights for policy 1, policy_version 90722 (0.0009) [2023-10-08 03:34:35,751][52059] Updated weights for policy 1, policy_version 90732 (0.0009) [2023-10-08 03:34:36,113][52059] Updated weights for policy 1, policy_version 90742 (0.0007) [2023-10-08 03:34:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 184647680. Throughput: 0: 1724.4, 1: 1739.7. Samples: 46169562. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:36,211][50642] Avg episode reward: [(0, '21.190'), (1, '24.450')] [2023-10-08 03:34:36,475][52059] Updated weights for policy 1, policy_version 90752 (0.0007) [2023-10-08 03:34:38,639][52060] Updated weights for policy 0, policy_version 89610 (0.0009) [2023-10-08 03:34:39,005][52060] Updated weights for policy 0, policy_version 89620 (0.0009) [2023-10-08 03:34:39,380][52060] Updated weights for policy 0, policy_version 89630 (0.0008) [2023-10-08 03:34:40,424][52059] Updated weights for policy 1, policy_version 90762 (0.0007) [2023-10-08 03:34:40,782][52059] Updated weights for policy 1, policy_version 90772 (0.0008) [2023-10-08 03:34:41,156][52059] Updated weights for policy 1, policy_version 90782 (0.0007) [2023-10-08 03:34:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 184713216. Throughput: 0: 1695.8, 1: 1756.4. Samples: 46189942. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:34:41,211][50642] Avg episode reward: [(0, '19.530'), (1, '23.890')] [2023-10-08 03:34:43,407][52060] Updated weights for policy 0, policy_version 89640 (0.0009) [2023-10-08 03:34:43,777][52060] Updated weights for policy 0, policy_version 89650 (0.0008) [2023-10-08 03:34:44,150][52060] Updated weights for policy 0, policy_version 89660 (0.0009) [2023-10-08 03:34:45,038][52059] Updated weights for policy 1, policy_version 90792 (0.0008) [2023-10-08 03:34:45,408][52059] Updated weights for policy 1, policy_version 90802 (0.0008) [2023-10-08 03:34:45,781][52059] Updated weights for policy 1, policy_version 90812 (0.0010) [2023-10-08 03:34:46,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 184811520. Throughput: 0: 1716.1, 1: 1726.4. Samples: 46210248. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:34:46,211][50642] Avg episode reward: [(0, '21.230'), (1, '24.470')] [2023-10-08 03:34:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000090816_92995584.pth... [2023-10-08 03:34:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000089664_91815936.pth... [2023-10-08 03:34:46,255][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000088064_90177536.pth [2023-10-08 03:34:46,260][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000089184_91324416.pth [2023-10-08 03:34:48,045][52060] Updated weights for policy 0, policy_version 89670 (0.0009) [2023-10-08 03:34:48,406][52060] Updated weights for policy 0, policy_version 89680 (0.0010) [2023-10-08 03:34:48,769][52060] Updated weights for policy 0, policy_version 89690 (0.0009) [2023-10-08 03:34:49,797][52059] Updated weights for policy 1, policy_version 90822 (0.0008) [2023-10-08 03:34:50,166][52059] Updated weights for policy 1, policy_version 90832 (0.0007) [2023-10-08 03:34:50,530][52059] Updated weights for policy 1, policy_version 90842 (0.0009) [2023-10-08 03:34:51,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 184877056. Throughput: 0: 1699.7, 1: 1754.9. Samples: 46220886. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:34:51,211][50642] Avg episode reward: [(0, '22.140'), (1, '23.600')] [2023-10-08 03:34:52,997][52060] Updated weights for policy 0, policy_version 89700 (0.0009) [2023-10-08 03:34:53,368][52060] Updated weights for policy 0, policy_version 89710 (0.0008) [2023-10-08 03:34:53,742][52060] Updated weights for policy 0, policy_version 89720 (0.0007) [2023-10-08 03:34:54,439][52059] Updated weights for policy 1, policy_version 90852 (0.0010) [2023-10-08 03:34:54,803][52059] Updated weights for policy 1, policy_version 90862 (0.0009) [2023-10-08 03:34:55,165][52059] Updated weights for policy 1, policy_version 90872 (0.0007) [2023-10-08 03:34:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 184942592. Throughput: 0: 1697.4, 1: 1735.5. Samples: 46241158. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:34:56,211][50642] Avg episode reward: [(0, '19.590'), (1, '24.450')] [2023-10-08 03:34:57,825][52060] Updated weights for policy 0, policy_version 89730 (0.0007) [2023-10-08 03:34:58,198][52060] Updated weights for policy 0, policy_version 89740 (0.0009) [2023-10-08 03:34:58,562][52060] Updated weights for policy 0, policy_version 89750 (0.0011) [2023-10-08 03:34:58,937][52060] Updated weights for policy 0, policy_version 89760 (0.0010) [2023-10-08 03:34:59,145][52059] Updated weights for policy 1, policy_version 90882 (0.0008) [2023-10-08 03:34:59,506][52059] Updated weights for policy 1, policy_version 90892 (0.0009) [2023-10-08 03:34:59,882][52059] Updated weights for policy 1, policy_version 90902 (0.0009) [2023-10-08 03:35:00,245][52059] Updated weights for policy 1, policy_version 90912 (0.0009) [2023-10-08 03:35:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 185008128. Throughput: 0: 1719.8, 1: 1716.3. Samples: 46261572. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:01,211][50642] Avg episode reward: [(0, '21.270'), (1, '25.150')] [2023-10-08 03:35:02,894][52060] Updated weights for policy 0, policy_version 89770 (0.0008) [2023-10-08 03:35:03,269][52060] Updated weights for policy 0, policy_version 89780 (0.0009) [2023-10-08 03:35:03,633][52060] Updated weights for policy 0, policy_version 89790 (0.0009) [2023-10-08 03:35:04,234][52059] Updated weights for policy 1, policy_version 90922 (0.0011) [2023-10-08 03:35:04,605][52059] Updated weights for policy 1, policy_version 90932 (0.0009) [2023-10-08 03:35:04,976][52059] Updated weights for policy 1, policy_version 90942 (0.0008) [2023-10-08 03:35:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185073664. Throughput: 0: 1688.4, 1: 1745.3. Samples: 46272194. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:06,211][50642] Avg episode reward: [(0, '19.430'), (1, '25.120')] [2023-10-08 03:35:07,498][52060] Updated weights for policy 0, policy_version 89800 (0.0007) [2023-10-08 03:35:07,871][52060] Updated weights for policy 0, policy_version 89810 (0.0007) [2023-10-08 03:35:08,247][52060] Updated weights for policy 0, policy_version 89820 (0.0009) [2023-10-08 03:35:09,018][52059] Updated weights for policy 1, policy_version 90952 (0.0008) [2023-10-08 03:35:09,396][52059] Updated weights for policy 1, policy_version 90962 (0.0007) [2023-10-08 03:35:09,755][52059] Updated weights for policy 1, policy_version 90972 (0.0009) [2023-10-08 03:35:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185139200. Throughput: 0: 1710.8, 1: 1713.5. Samples: 46292154. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:11,211][50642] Avg episode reward: [(0, '22.860'), (1, '26.920')] [2023-10-08 03:35:12,237][52060] Updated weights for policy 0, policy_version 89830 (0.0009) [2023-10-08 03:35:12,609][52060] Updated weights for policy 0, policy_version 89840 (0.0010) [2023-10-08 03:35:12,962][52060] Updated weights for policy 0, policy_version 89850 (0.0010) [2023-10-08 03:35:13,672][52059] Updated weights for policy 1, policy_version 90982 (0.0009) [2023-10-08 03:35:14,037][52059] Updated weights for policy 1, policy_version 90992 (0.0009) [2023-10-08 03:35:14,402][52059] Updated weights for policy 1, policy_version 91002 (0.0008) [2023-10-08 03:35:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185204736. Throughput: 0: 1721.3, 1: 1713.4. Samples: 46313378. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:16,211][50642] Avg episode reward: [(0, '19.460'), (1, '23.460')] [2023-10-08 03:35:16,917][52060] Updated weights for policy 0, policy_version 89860 (0.0008) [2023-10-08 03:35:17,279][52060] Updated weights for policy 0, policy_version 89870 (0.0009) [2023-10-08 03:35:17,650][52060] Updated weights for policy 0, policy_version 89880 (0.0010) [2023-10-08 03:35:18,146][52059] Updated weights for policy 1, policy_version 91012 (0.0007) [2023-10-08 03:35:18,513][52059] Updated weights for policy 1, policy_version 91022 (0.0007) [2023-10-08 03:35:18,891][52059] Updated weights for policy 1, policy_version 91032 (0.0008) [2023-10-08 03:35:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185270272. Throughput: 0: 1699.6, 1: 1720.7. Samples: 46323476. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:21,211][50642] Avg episode reward: [(0, '20.290'), (1, '25.520')] [2023-10-08 03:35:21,683][52060] Updated weights for policy 0, policy_version 89890 (0.0008) [2023-10-08 03:35:22,052][52060] Updated weights for policy 0, policy_version 89900 (0.0009) [2023-10-08 03:35:22,410][52060] Updated weights for policy 0, policy_version 89910 (0.0007) [2023-10-08 03:35:22,783][52060] Updated weights for policy 0, policy_version 89920 (0.0007) [2023-10-08 03:35:22,823][52059] Updated weights for policy 1, policy_version 91042 (0.0008) [2023-10-08 03:35:23,183][52059] Updated weights for policy 1, policy_version 91052 (0.0008) [2023-10-08 03:35:23,539][52059] Updated weights for policy 1, policy_version 91062 (0.0008) [2023-10-08 03:35:23,895][52059] Updated weights for policy 1, policy_version 91072 (0.0009) [2023-10-08 03:35:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185335808. Throughput: 0: 1722.1, 1: 1706.0. Samples: 46344210. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:26,211][50642] Avg episode reward: [(0, '21.700'), (1, '24.460')] [2023-10-08 03:35:26,695][52060] Updated weights for policy 0, policy_version 89930 (0.0009) [2023-10-08 03:35:27,066][52060] Updated weights for policy 0, policy_version 89940 (0.0007) [2023-10-08 03:35:27,433][52060] Updated weights for policy 0, policy_version 89950 (0.0007) [2023-10-08 03:35:28,010][52059] Updated weights for policy 1, policy_version 91082 (0.0008) [2023-10-08 03:35:28,377][52059] Updated weights for policy 1, policy_version 91092 (0.0009) [2023-10-08 03:35:28,743][52059] Updated weights for policy 1, policy_version 91102 (0.0007) [2023-10-08 03:35:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185401344. Throughput: 0: 1722.6, 1: 1731.9. Samples: 46365700. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:31,211][50642] Avg episode reward: [(0, '21.800'), (1, '24.540')] [2023-10-08 03:35:31,326][52060] Updated weights for policy 0, policy_version 89960 (0.0009) [2023-10-08 03:35:31,693][52060] Updated weights for policy 0, policy_version 89970 (0.0008) [2023-10-08 03:35:32,074][52060] Updated weights for policy 0, policy_version 89980 (0.0008) [2023-10-08 03:35:32,646][52059] Updated weights for policy 1, policy_version 91112 (0.0008) [2023-10-08 03:35:33,009][52059] Updated weights for policy 1, policy_version 91122 (0.0007) [2023-10-08 03:35:33,381][52059] Updated weights for policy 1, policy_version 91132 (0.0009) [2023-10-08 03:35:36,159][52060] Updated weights for policy 0, policy_version 89990 (0.0010) [2023-10-08 03:35:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 185466880. Throughput: 0: 1723.3, 1: 1709.2. Samples: 46375346. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:36,211][50642] Avg episode reward: [(0, '19.230'), (1, '21.910')] [2023-10-08 03:35:36,526][52060] Updated weights for policy 0, policy_version 90000 (0.0009) [2023-10-08 03:35:36,899][52060] Updated weights for policy 0, policy_version 90010 (0.0007) [2023-10-08 03:35:37,174][52059] Updated weights for policy 1, policy_version 91142 (0.0008) [2023-10-08 03:35:37,537][52059] Updated weights for policy 1, policy_version 91152 (0.0011) [2023-10-08 03:35:37,898][52059] Updated weights for policy 1, policy_version 91162 (0.0009) [2023-10-08 03:35:40,668][52060] Updated weights for policy 0, policy_version 90020 (0.0008) [2023-10-08 03:35:41,033][52060] Updated weights for policy 0, policy_version 90030 (0.0010) [2023-10-08 03:35:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 185532416. Throughput: 0: 1728.0, 1: 1724.0. Samples: 46396500. Policy #0 lag: (min: 18.0, avg: 24.9, max: 50.0) [2023-10-08 03:35:41,211][50642] Avg episode reward: [(0, '20.820'), (1, '22.550')] [2023-10-08 03:35:41,397][52060] Updated weights for policy 0, policy_version 90040 (0.0009) [2023-10-08 03:35:41,910][52059] Updated weights for policy 1, policy_version 91172 (0.0008) [2023-10-08 03:35:42,276][52059] Updated weights for policy 1, policy_version 91182 (0.0007) [2023-10-08 03:35:42,640][52059] Updated weights for policy 1, policy_version 91192 (0.0008) [2023-10-08 03:35:45,308][52060] Updated weights for policy 0, policy_version 90050 (0.0008) [2023-10-08 03:35:45,680][52060] Updated weights for policy 0, policy_version 90060 (0.0009) [2023-10-08 03:35:46,047][52060] Updated weights for policy 0, policy_version 90070 (0.0008) [2023-10-08 03:35:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 185597952. Throughput: 0: 1714.8, 1: 1748.9. Samples: 46417436. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:35:46,211][50642] Avg episode reward: [(0, '20.890'), (1, '24.210')] [2023-10-08 03:35:46,418][52060] Updated weights for policy 0, policy_version 90080 (0.0008) [2023-10-08 03:35:46,550][52059] Updated weights for policy 1, policy_version 91202 (0.0008) [2023-10-08 03:35:46,923][52059] Updated weights for policy 1, policy_version 91212 (0.0008) [2023-10-08 03:35:47,278][52059] Updated weights for policy 1, policy_version 91222 (0.0009) [2023-10-08 03:35:47,642][52059] Updated weights for policy 1, policy_version 91232 (0.0010) [2023-10-08 03:35:50,317][52060] Updated weights for policy 0, policy_version 90090 (0.0007) [2023-10-08 03:35:50,679][52060] Updated weights for policy 0, policy_version 90100 (0.0007) [2023-10-08 03:35:51,051][52060] Updated weights for policy 0, policy_version 90110 (0.0008) [2023-10-08 03:35:51,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185696256. Throughput: 0: 1733.2, 1: 1717.3. Samples: 46427470. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:35:51,211][50642] Avg episode reward: [(0, '22.140'), (1, '23.770')] [2023-10-08 03:35:51,576][52059] Updated weights for policy 1, policy_version 91242 (0.0008) [2023-10-08 03:35:51,944][52059] Updated weights for policy 1, policy_version 91252 (0.0009) [2023-10-08 03:35:52,305][52059] Updated weights for policy 1, policy_version 91262 (0.0009) [2023-10-08 03:35:55,042][52060] Updated weights for policy 0, policy_version 90120 (0.0008) [2023-10-08 03:35:55,405][52060] Updated weights for policy 0, policy_version 90130 (0.0008) [2023-10-08 03:35:55,783][52060] Updated weights for policy 0, policy_version 90140 (0.0010) [2023-10-08 03:35:56,190][52059] Updated weights for policy 1, policy_version 91272 (0.0008) [2023-10-08 03:35:56,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185761792. Throughput: 0: 1727.6, 1: 1752.8. Samples: 46448770. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:35:56,211][50642] Avg episode reward: [(0, '18.650'), (1, '23.530')] [2023-10-08 03:35:56,556][52059] Updated weights for policy 1, policy_version 91282 (0.0007) [2023-10-08 03:35:56,920][52059] Updated weights for policy 1, policy_version 91292 (0.0007) [2023-10-08 03:35:59,783][52060] Updated weights for policy 0, policy_version 90150 (0.0009) [2023-10-08 03:36:00,160][52060] Updated weights for policy 0, policy_version 90160 (0.0009) [2023-10-08 03:36:00,527][52060] Updated weights for policy 0, policy_version 90170 (0.0007) [2023-10-08 03:36:00,757][52059] Updated weights for policy 1, policy_version 91302 (0.0007) [2023-10-08 03:36:01,126][52059] Updated weights for policy 1, policy_version 91312 (0.0009) [2023-10-08 03:36:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 185827328. Throughput: 0: 1697.5, 1: 1751.2. Samples: 46468574. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:01,211][50642] Avg episode reward: [(0, '20.780'), (1, '23.030')] [2023-10-08 03:36:01,487][52059] Updated weights for policy 1, policy_version 91322 (0.0009) [2023-10-08 03:36:04,454][52060] Updated weights for policy 0, policy_version 90180 (0.0007) [2023-10-08 03:36:04,821][52060] Updated weights for policy 0, policy_version 90190 (0.0008) [2023-10-08 03:36:05,189][52060] Updated weights for policy 0, policy_version 90200 (0.0009) [2023-10-08 03:36:05,254][52059] Updated weights for policy 1, policy_version 91332 (0.0009) [2023-10-08 03:36:05,628][52059] Updated weights for policy 1, policy_version 91342 (0.0010) [2023-10-08 03:36:05,990][52059] Updated weights for policy 1, policy_version 91352 (0.0011) [2023-10-08 03:36:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 185892864. Throughput: 0: 1726.9, 1: 1749.2. Samples: 46479898. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:06,211][50642] Avg episode reward: [(0, '22.740'), (1, '24.850')] [2023-10-08 03:36:09,334][52060] Updated weights for policy 0, policy_version 90210 (0.0008) [2023-10-08 03:36:09,730][52060] Updated weights for policy 0, policy_version 90220 (0.0009) [2023-10-08 03:36:09,969][52059] Updated weights for policy 1, policy_version 91362 (0.0010) [2023-10-08 03:36:10,099][52060] Updated weights for policy 0, policy_version 90230 (0.0009) [2023-10-08 03:36:10,335][52059] Updated weights for policy 1, policy_version 91372 (0.0010) [2023-10-08 03:36:10,471][52060] Updated weights for policy 0, policy_version 90240 (0.0007) [2023-10-08 03:36:10,703][52059] Updated weights for policy 1, policy_version 91382 (0.0009) [2023-10-08 03:36:11,059][52059] Updated weights for policy 1, policy_version 91392 (0.0009) [2023-10-08 03:36:11,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 185991168. Throughput: 0: 1714.8, 1: 1754.5. Samples: 46500330. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:11,211][50642] Avg episode reward: [(0, '21.090'), (1, '22.420')] [2023-10-08 03:36:14,336][52060] Updated weights for policy 0, policy_version 90250 (0.0007) [2023-10-08 03:36:14,702][52060] Updated weights for policy 0, policy_version 90260 (0.0008) [2023-10-08 03:36:14,898][52059] Updated weights for policy 1, policy_version 91402 (0.0008) [2023-10-08 03:36:15,058][52060] Updated weights for policy 0, policy_version 90270 (0.0007) [2023-10-08 03:36:15,270][52059] Updated weights for policy 1, policy_version 91412 (0.0009) [2023-10-08 03:36:15,632][52059] Updated weights for policy 1, policy_version 91422 (0.0009) [2023-10-08 03:36:16,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 186056704. Throughput: 0: 1695.8, 1: 1727.6. Samples: 46519752. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:16,211][50642] Avg episode reward: [(0, '20.210'), (1, '21.510')] [2023-10-08 03:36:19,063][52060] Updated weights for policy 0, policy_version 90280 (0.0008) [2023-10-08 03:36:19,444][52060] Updated weights for policy 0, policy_version 90290 (0.0008) [2023-10-08 03:36:19,608][52059] Updated weights for policy 1, policy_version 91432 (0.0008) [2023-10-08 03:36:19,817][52060] Updated weights for policy 0, policy_version 90300 (0.0009) [2023-10-08 03:36:19,977][52059] Updated weights for policy 1, policy_version 91442 (0.0009) [2023-10-08 03:36:20,337][52059] Updated weights for policy 1, policy_version 91452 (0.0010) [2023-10-08 03:36:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 186122240. Throughput: 0: 1720.9, 1: 1754.8. Samples: 46531754. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:21,211][50642] Avg episode reward: [(0, '19.340'), (1, '25.020')] [2023-10-08 03:36:23,877][52060] Updated weights for policy 0, policy_version 90310 (0.0009) [2023-10-08 03:36:24,252][52060] Updated weights for policy 0, policy_version 90320 (0.0010) [2023-10-08 03:36:24,364][52059] Updated weights for policy 1, policy_version 91462 (0.0009) [2023-10-08 03:36:24,614][52060] Updated weights for policy 0, policy_version 90330 (0.0008) [2023-10-08 03:36:24,726][52059] Updated weights for policy 1, policy_version 91472 (0.0008) [2023-10-08 03:36:25,087][52059] Updated weights for policy 1, policy_version 91482 (0.0009) [2023-10-08 03:36:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 186187776. Throughput: 0: 1695.8, 1: 1734.0. Samples: 46550840. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:26,211][50642] Avg episode reward: [(0, '21.900'), (1, '25.150')] [2023-10-08 03:36:28,741][52060] Updated weights for policy 0, policy_version 90340 (0.0007) [2023-10-08 03:36:28,966][52059] Updated weights for policy 1, policy_version 91492 (0.0008) [2023-10-08 03:36:29,110][52060] Updated weights for policy 0, policy_version 90350 (0.0008) [2023-10-08 03:36:29,330][52059] Updated weights for policy 1, policy_version 91502 (0.0008) [2023-10-08 03:36:29,482][52060] Updated weights for policy 0, policy_version 90360 (0.0007) [2023-10-08 03:36:29,693][52059] Updated weights for policy 1, policy_version 91512 (0.0009) [2023-10-08 03:36:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 186253312. Throughput: 0: 1705.0, 1: 1713.5. Samples: 46571268. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:31,211][50642] Avg episode reward: [(0, '20.560'), (1, '26.800')] [2023-10-08 03:36:33,438][52060] Updated weights for policy 0, policy_version 90370 (0.0007) [2023-10-08 03:36:33,805][52059] Updated weights for policy 1, policy_version 91522 (0.0010) [2023-10-08 03:36:33,812][52060] Updated weights for policy 0, policy_version 90380 (0.0008) [2023-10-08 03:36:34,161][52059] Updated weights for policy 1, policy_version 91532 (0.0007) [2023-10-08 03:36:34,169][52060] Updated weights for policy 0, policy_version 90390 (0.0009) [2023-10-08 03:36:34,538][52059] Updated weights for policy 1, policy_version 91542 (0.0008) [2023-10-08 03:36:34,543][52060] Updated weights for policy 0, policy_version 90400 (0.0009) [2023-10-08 03:36:34,906][52059] Updated weights for policy 1, policy_version 91552 (0.0008) [2023-10-08 03:36:36,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 186318848. Throughput: 0: 1706.0, 1: 1738.5. Samples: 46582470. Policy #0 lag: (min: 31.0, avg: 33.2, max: 63.0) [2023-10-08 03:36:36,211][50642] Avg episode reward: [(0, '19.140'), (1, '24.790')] [2023-10-08 03:36:38,679][52060] Updated weights for policy 0, policy_version 90410 (0.0009) [2023-10-08 03:36:38,879][52059] Updated weights for policy 1, policy_version 91562 (0.0008) [2023-10-08 03:36:39,038][52060] Updated weights for policy 0, policy_version 90420 (0.0010) [2023-10-08 03:36:39,246][52059] Updated weights for policy 1, policy_version 91572 (0.0008) [2023-10-08 03:36:39,418][52060] Updated weights for policy 0, policy_version 90430 (0.0007) [2023-10-08 03:36:39,615][52059] Updated weights for policy 1, policy_version 91582 (0.0008) [2023-10-08 03:36:41,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 186384384. Throughput: 0: 1686.4, 1: 1705.8. Samples: 46601416. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:36:41,211][50642] Avg episode reward: [(0, '20.540'), (1, '25.160')] [2023-10-08 03:36:43,340][52060] Updated weights for policy 0, policy_version 90440 (0.0009) [2023-10-08 03:36:43,705][52060] Updated weights for policy 0, policy_version 90450 (0.0009) [2023-10-08 03:36:43,845][52059] Updated weights for policy 1, policy_version 91592 (0.0010) [2023-10-08 03:36:44,074][52060] Updated weights for policy 0, policy_version 90460 (0.0007) [2023-10-08 03:36:44,215][52059] Updated weights for policy 1, policy_version 91602 (0.0008) [2023-10-08 03:36:44,575][52059] Updated weights for policy 1, policy_version 91612 (0.0010) [2023-10-08 03:36:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 186449920. Throughput: 0: 1717.5, 1: 1699.8. Samples: 46622354. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:36:46,212][50642] Avg episode reward: [(0, '20.280'), (1, '28.210')] [2023-10-08 03:36:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000090464_92635136.pth... [2023-10-08 03:36:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000091616_93814784.pth... [2023-10-08 03:36:46,257][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000089984_92143616.pth [2023-10-08 03:36:46,260][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000088864_90996736.pth [2023-10-08 03:36:48,068][52060] Updated weights for policy 0, policy_version 90470 (0.0009) [2023-10-08 03:36:48,408][52059] Updated weights for policy 1, policy_version 91622 (0.0009) [2023-10-08 03:36:48,428][52060] Updated weights for policy 0, policy_version 90480 (0.0008) [2023-10-08 03:36:48,768][52059] Updated weights for policy 1, policy_version 91632 (0.0008) [2023-10-08 03:36:48,803][52060] Updated weights for policy 0, policy_version 90490 (0.0008) [2023-10-08 03:36:49,128][52059] Updated weights for policy 1, policy_version 91642 (0.0008) [2023-10-08 03:36:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 186515456. Throughput: 0: 1687.9, 1: 1704.1. Samples: 46632536. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:36:51,211][50642] Avg episode reward: [(0, '19.090'), (1, '26.600')] [2023-10-08 03:36:52,736][52060] Updated weights for policy 0, policy_version 90500 (0.0008) [2023-10-08 03:36:53,113][52060] Updated weights for policy 0, policy_version 90510 (0.0009) [2023-10-08 03:36:53,335][52059] Updated weights for policy 1, policy_version 91652 (0.0009) [2023-10-08 03:36:53,484][52060] Updated weights for policy 0, policy_version 90520 (0.0008) [2023-10-08 03:36:53,704][52059] Updated weights for policy 1, policy_version 91662 (0.0007) [2023-10-08 03:36:54,063][52059] Updated weights for policy 1, policy_version 91672 (0.0008) [2023-10-08 03:36:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 186580992. Throughput: 0: 1692.7, 1: 1692.0. Samples: 46652640. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:36:56,212][50642] Avg episode reward: [(0, '20.510'), (1, '23.650')] [2023-10-08 03:36:57,468][52060] Updated weights for policy 0, policy_version 90530 (0.0008) [2023-10-08 03:36:57,839][52059] Updated weights for policy 1, policy_version 91682 (0.0009) [2023-10-08 03:36:57,874][52060] Updated weights for policy 0, policy_version 90540 (0.0009) [2023-10-08 03:36:58,204][52059] Updated weights for policy 1, policy_version 91692 (0.0008) [2023-10-08 03:36:58,229][52060] Updated weights for policy 0, policy_version 90550 (0.0007) [2023-10-08 03:36:58,572][52059] Updated weights for policy 1, policy_version 91702 (0.0008) [2023-10-08 03:36:58,603][52060] Updated weights for policy 0, policy_version 90560 (0.0007) [2023-10-08 03:36:58,930][52059] Updated weights for policy 1, policy_version 91712 (0.0009) [2023-10-08 03:37:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 186646528. Throughput: 0: 1706.7, 1: 1719.7. Samples: 46673940. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:01,211][50642] Avg episode reward: [(0, '22.530'), (1, '23.540')] [2023-10-08 03:37:02,559][52060] Updated weights for policy 0, policy_version 90570 (0.0007) [2023-10-08 03:37:02,857][52059] Updated weights for policy 1, policy_version 91722 (0.0009) [2023-10-08 03:37:02,919][52060] Updated weights for policy 0, policy_version 90580 (0.0008) [2023-10-08 03:37:03,226][52059] Updated weights for policy 1, policy_version 91732 (0.0008) [2023-10-08 03:37:03,286][52060] Updated weights for policy 0, policy_version 90590 (0.0007) [2023-10-08 03:37:03,598][52059] Updated weights for policy 1, policy_version 91742 (0.0009) [2023-10-08 03:37:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 186712064. Throughput: 0: 1675.0, 1: 1690.9. Samples: 46683220. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:06,211][50642] Avg episode reward: [(0, '18.550'), (1, '25.320')] [2023-10-08 03:37:07,388][52060] Updated weights for policy 0, policy_version 90600 (0.0008) [2023-10-08 03:37:07,584][52059] Updated weights for policy 1, policy_version 91752 (0.0010) [2023-10-08 03:37:07,760][52060] Updated weights for policy 0, policy_version 90610 (0.0009) [2023-10-08 03:37:07,942][52059] Updated weights for policy 1, policy_version 91762 (0.0008) [2023-10-08 03:37:08,127][52060] Updated weights for policy 0, policy_version 90620 (0.0008) [2023-10-08 03:37:08,307][52059] Updated weights for policy 1, policy_version 91772 (0.0008) [2023-10-08 03:37:11,211][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 186777600. Throughput: 0: 1703.9, 1: 1713.5. Samples: 46704622. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:11,212][50642] Avg episode reward: [(0, '17.440'), (1, '26.230')] [2023-10-08 03:37:12,280][52060] Updated weights for policy 0, policy_version 90630 (0.0009) [2023-10-08 03:37:12,337][52059] Updated weights for policy 1, policy_version 91782 (0.0008) [2023-10-08 03:37:12,641][52060] Updated weights for policy 0, policy_version 90640 (0.0008) [2023-10-08 03:37:12,692][52059] Updated weights for policy 1, policy_version 91792 (0.0008) [2023-10-08 03:37:13,018][52060] Updated weights for policy 0, policy_version 90650 (0.0008) [2023-10-08 03:37:13,059][52059] Updated weights for policy 1, policy_version 91802 (0.0010) [2023-10-08 03:37:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 186843136. Throughput: 0: 1704.7, 1: 1726.7. Samples: 46725682. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:16,211][50642] Avg episode reward: [(0, '20.230'), (1, '25.790')] [2023-10-08 03:37:17,010][52059] Updated weights for policy 1, policy_version 91812 (0.0007) [2023-10-08 03:37:17,017][52060] Updated weights for policy 0, policy_version 90660 (0.0008) [2023-10-08 03:37:17,381][52059] Updated weights for policy 1, policy_version 91822 (0.0008) [2023-10-08 03:37:17,388][52060] Updated weights for policy 0, policy_version 90670 (0.0007) [2023-10-08 03:37:17,746][52060] Updated weights for policy 0, policy_version 90680 (0.0007) [2023-10-08 03:37:17,748][52059] Updated weights for policy 1, policy_version 91832 (0.0009) [2023-10-08 03:37:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 186908672. Throughput: 0: 1690.8, 1: 1704.5. Samples: 46735258. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:21,211][50642] Avg episode reward: [(0, '20.140'), (1, '23.540')] [2023-10-08 03:37:21,529][52060] Updated weights for policy 0, policy_version 90690 (0.0008) [2023-10-08 03:37:21,614][52059] Updated weights for policy 1, policy_version 91842 (0.0009) [2023-10-08 03:37:21,892][52060] Updated weights for policy 0, policy_version 90700 (0.0008) [2023-10-08 03:37:21,977][52059] Updated weights for policy 1, policy_version 91852 (0.0009) [2023-10-08 03:37:22,260][52060] Updated weights for policy 0, policy_version 90710 (0.0009) [2023-10-08 03:37:22,345][52059] Updated weights for policy 1, policy_version 91862 (0.0007) [2023-10-08 03:37:22,628][52060] Updated weights for policy 0, policy_version 90720 (0.0008) [2023-10-08 03:37:22,701][52059] Updated weights for policy 1, policy_version 91872 (0.0008) [2023-10-08 03:37:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 186974208. Throughput: 0: 1718.6, 1: 1735.9. Samples: 46756868. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:26,211][50642] Avg episode reward: [(0, '18.350'), (1, '27.390')] [2023-10-08 03:37:26,546][52059] Updated weights for policy 1, policy_version 91882 (0.0008) [2023-10-08 03:37:26,577][52060] Updated weights for policy 0, policy_version 90730 (0.0009) [2023-10-08 03:37:26,913][52059] Updated weights for policy 1, policy_version 91892 (0.0010) [2023-10-08 03:37:26,950][52060] Updated weights for policy 0, policy_version 90740 (0.0007) [2023-10-08 03:37:27,264][52059] Updated weights for policy 1, policy_version 91902 (0.0008) [2023-10-08 03:37:27,322][52060] Updated weights for policy 0, policy_version 90750 (0.0008) [2023-10-08 03:37:31,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 187039744. Throughput: 0: 1717.5, 1: 1744.5. Samples: 46778146. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:31,211][50642] Avg episode reward: [(0, '19.010'), (1, '27.500')] [2023-10-08 03:37:31,273][52060] Updated weights for policy 0, policy_version 90760 (0.0009) [2023-10-08 03:37:31,306][52059] Updated weights for policy 1, policy_version 91912 (0.0009) [2023-10-08 03:37:31,649][52060] Updated weights for policy 0, policy_version 90770 (0.0007) [2023-10-08 03:37:31,676][52059] Updated weights for policy 1, policy_version 91922 (0.0008) [2023-10-08 03:37:32,016][52060] Updated weights for policy 0, policy_version 90780 (0.0007) [2023-10-08 03:37:32,031][52059] Updated weights for policy 1, policy_version 91932 (0.0007) [2023-10-08 03:37:36,043][52060] Updated weights for policy 0, policy_version 90790 (0.0008) [2023-10-08 03:37:36,057][52059] Updated weights for policy 1, policy_version 91942 (0.0010) [2023-10-08 03:37:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 187105280. Throughput: 0: 1712.0, 1: 1724.5. Samples: 46787178. Policy #0 lag: (min: 5.0, avg: 7.1, max: 37.0) [2023-10-08 03:37:36,211][50642] Avg episode reward: [(0, '20.980'), (1, '25.620')] [2023-10-08 03:37:36,410][52060] Updated weights for policy 0, policy_version 90800 (0.0007) [2023-10-08 03:37:36,421][52059] Updated weights for policy 1, policy_version 91952 (0.0007) [2023-10-08 03:37:36,782][52060] Updated weights for policy 0, policy_version 90810 (0.0007) [2023-10-08 03:37:36,795][52059] Updated weights for policy 1, policy_version 91962 (0.0007) [2023-10-08 03:37:40,676][52059] Updated weights for policy 1, policy_version 91972 (0.0009) [2023-10-08 03:37:40,796][52060] Updated weights for policy 0, policy_version 90820 (0.0008) [2023-10-08 03:37:41,041][52059] Updated weights for policy 1, policy_version 91982 (0.0009) [2023-10-08 03:37:41,164][52060] Updated weights for policy 0, policy_version 90830 (0.0008) [2023-10-08 03:37:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 187170816. Throughput: 0: 1722.5, 1: 1742.6. Samples: 46808572. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:37:41,211][50642] Avg episode reward: [(0, '21.310'), (1, '22.990')] [2023-10-08 03:37:41,407][52059] Updated weights for policy 1, policy_version 91992 (0.0007) [2023-10-08 03:37:41,521][52060] Updated weights for policy 0, policy_version 90840 (0.0007) [2023-10-08 03:37:45,212][52059] Updated weights for policy 1, policy_version 92002 (0.0007) [2023-10-08 03:37:45,527][52060] Updated weights for policy 0, policy_version 90850 (0.0008) [2023-10-08 03:37:45,580][52059] Updated weights for policy 1, policy_version 92012 (0.0009) [2023-10-08 03:37:45,924][52060] Updated weights for policy 0, policy_version 90860 (0.0008) [2023-10-08 03:37:45,942][52059] Updated weights for policy 1, policy_version 92022 (0.0008) [2023-10-08 03:37:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 187236352. Throughput: 0: 1713.6, 1: 1727.4. Samples: 46828782. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:37:46,211][50642] Avg episode reward: [(0, '18.990'), (1, '26.270')] [2023-10-08 03:37:46,288][52060] Updated weights for policy 0, policy_version 90870 (0.0008) [2023-10-08 03:37:46,304][52059] Updated weights for policy 1, policy_version 92032 (0.0007) [2023-10-08 03:37:46,655][52060] Updated weights for policy 0, policy_version 90880 (0.0009) [2023-10-08 03:37:50,175][52059] Updated weights for policy 1, policy_version 92042 (0.0008) [2023-10-08 03:37:50,495][52060] Updated weights for policy 0, policy_version 90890 (0.0007) [2023-10-08 03:37:50,544][52059] Updated weights for policy 1, policy_version 92052 (0.0007) [2023-10-08 03:37:50,854][52060] Updated weights for policy 0, policy_version 90900 (0.0007) [2023-10-08 03:37:50,903][52059] Updated weights for policy 1, policy_version 92062 (0.0008) [2023-10-08 03:37:51,204][52060] Updated weights for policy 0, policy_version 90910 (0.0009) [2023-10-08 03:37:51,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 187334656. Throughput: 0: 1726.7, 1: 1745.5. Samples: 46839470. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:37:51,211][50642] Avg episode reward: [(0, '20.510'), (1, '26.590')] [2023-10-08 03:37:54,864][52059] Updated weights for policy 1, policy_version 92072 (0.0008) [2023-10-08 03:37:55,232][52059] Updated weights for policy 1, policy_version 92082 (0.0008) [2023-10-08 03:37:55,252][52060] Updated weights for policy 0, policy_version 90920 (0.0008) [2023-10-08 03:37:55,596][52059] Updated weights for policy 1, policy_version 92092 (0.0007) [2023-10-08 03:37:55,611][52060] Updated weights for policy 0, policy_version 90930 (0.0008) [2023-10-08 03:37:55,976][52060] Updated weights for policy 0, policy_version 90940 (0.0007) [2023-10-08 03:37:56,210][50642] Fps is (10 sec: 19661.1, 60 sec: 14199.6, 300 sec: 13884.8). Total num frames: 187432960. Throughput: 0: 1726.2, 1: 1736.5. Samples: 46860442. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:37:56,211][50642] Avg episode reward: [(0, '21.850'), (1, '24.920')] [2023-10-08 03:37:59,499][52059] Updated weights for policy 1, policy_version 92102 (0.0009) [2023-10-08 03:37:59,848][52060] Updated weights for policy 0, policy_version 90950 (0.0007) [2023-10-08 03:37:59,856][52059] Updated weights for policy 1, policy_version 92112 (0.0009) [2023-10-08 03:38:00,208][52060] Updated weights for policy 0, policy_version 90960 (0.0010) [2023-10-08 03:38:00,223][52059] Updated weights for policy 1, policy_version 92122 (0.0008) [2023-10-08 03:38:00,577][52060] Updated weights for policy 0, policy_version 90970 (0.0009) [2023-10-08 03:38:01,210][50642] Fps is (10 sec: 16384.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 187498496. Throughput: 0: 1704.3, 1: 1718.6. Samples: 46879714. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:01,211][50642] Avg episode reward: [(0, '22.250'), (1, '22.540')] [2023-10-08 03:38:04,279][52059] Updated weights for policy 1, policy_version 92132 (0.0008) [2023-10-08 03:38:04,618][52060] Updated weights for policy 0, policy_version 90980 (0.0008) [2023-10-08 03:38:04,637][52059] Updated weights for policy 1, policy_version 92142 (0.0007) [2023-10-08 03:38:04,989][52060] Updated weights for policy 0, policy_version 90990 (0.0009) [2023-10-08 03:38:04,989][52059] Updated weights for policy 1, policy_version 92152 (0.0008) [2023-10-08 03:38:05,356][52060] Updated weights for policy 0, policy_version 91000 (0.0008) [2023-10-08 03:38:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 187564032. Throughput: 0: 1728.0, 1: 1746.7. Samples: 46891620. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:06,211][50642] Avg episode reward: [(0, '18.970'), (1, '22.820')] [2023-10-08 03:38:08,963][52059] Updated weights for policy 1, policy_version 92162 (0.0009) [2023-10-08 03:38:09,322][52059] Updated weights for policy 1, policy_version 92172 (0.0008) [2023-10-08 03:38:09,404][52060] Updated weights for policy 0, policy_version 91010 (0.0010) [2023-10-08 03:38:09,685][52059] Updated weights for policy 1, policy_version 92182 (0.0008) [2023-10-08 03:38:09,774][52060] Updated weights for policy 0, policy_version 91020 (0.0008) [2023-10-08 03:38:10,053][52059] Updated weights for policy 1, policy_version 92192 (0.0008) [2023-10-08 03:38:10,141][52060] Updated weights for policy 0, policy_version 91030 (0.0009) [2023-10-08 03:38:10,504][52060] Updated weights for policy 0, policy_version 91040 (0.0009) [2023-10-08 03:38:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 187629568. Throughput: 0: 1708.8, 1: 1720.0. Samples: 46911164. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:11,211][50642] Avg episode reward: [(0, '20.400'), (1, '25.410')] [2023-10-08 03:38:13,921][52059] Updated weights for policy 1, policy_version 92202 (0.0009) [2023-10-08 03:38:14,293][52059] Updated weights for policy 1, policy_version 92212 (0.0009) [2023-10-08 03:38:14,483][52060] Updated weights for policy 0, policy_version 91050 (0.0011) [2023-10-08 03:38:14,654][52059] Updated weights for policy 1, policy_version 92222 (0.0008) [2023-10-08 03:38:14,847][52060] Updated weights for policy 0, policy_version 91060 (0.0008) [2023-10-08 03:38:15,222][52060] Updated weights for policy 0, policy_version 91070 (0.0008) [2023-10-08 03:38:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 187695104. Throughput: 0: 1688.7, 1: 1715.2. Samples: 46931322. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:16,211][50642] Avg episode reward: [(0, '20.090'), (1, '24.570')] [2023-10-08 03:38:18,482][52059] Updated weights for policy 1, policy_version 92232 (0.0008) [2023-10-08 03:38:18,854][52059] Updated weights for policy 1, policy_version 92242 (0.0008) [2023-10-08 03:38:19,216][52059] Updated weights for policy 1, policy_version 92252 (0.0008) [2023-10-08 03:38:19,239][52060] Updated weights for policy 0, policy_version 91080 (0.0007) [2023-10-08 03:38:19,600][52060] Updated weights for policy 0, policy_version 91090 (0.0007) [2023-10-08 03:38:19,962][52060] Updated weights for policy 0, policy_version 91100 (0.0007) [2023-10-08 03:38:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 187760640. Throughput: 0: 1721.5, 1: 1732.9. Samples: 46942626. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:21,211][50642] Avg episode reward: [(0, '20.300'), (1, '22.250')] [2023-10-08 03:38:23,349][52059] Updated weights for policy 1, policy_version 92262 (0.0009) [2023-10-08 03:38:23,709][52059] Updated weights for policy 1, policy_version 92272 (0.0009) [2023-10-08 03:38:24,043][52060] Updated weights for policy 0, policy_version 91110 (0.0008) [2023-10-08 03:38:24,070][52059] Updated weights for policy 1, policy_version 92282 (0.0008) [2023-10-08 03:38:24,402][52060] Updated weights for policy 0, policy_version 91120 (0.0009) [2023-10-08 03:38:24,773][52060] Updated weights for policy 0, policy_version 91130 (0.0010) [2023-10-08 03:38:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 187826176. Throughput: 0: 1692.1, 1: 1717.5. Samples: 46962006. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:26,211][50642] Avg episode reward: [(0, '20.060'), (1, '21.370')] [2023-10-08 03:38:28,051][52059] Updated weights for policy 1, policy_version 92292 (0.0008) [2023-10-08 03:38:28,415][52059] Updated weights for policy 1, policy_version 92302 (0.0008) [2023-10-08 03:38:28,779][52060] Updated weights for policy 0, policy_version 91140 (0.0009) [2023-10-08 03:38:28,787][52059] Updated weights for policy 1, policy_version 92312 (0.0007) [2023-10-08 03:38:29,149][52060] Updated weights for policy 0, policy_version 91150 (0.0008) [2023-10-08 03:38:29,512][52060] Updated weights for policy 0, policy_version 91160 (0.0007) [2023-10-08 03:38:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 187891712. Throughput: 0: 1695.4, 1: 1728.3. Samples: 46982850. Policy #0 lag: (min: 31.0, avg: 38.7, max: 63.0) [2023-10-08 03:38:31,211][50642] Avg episode reward: [(0, '20.960'), (1, '25.900')] [2023-10-08 03:38:32,751][52059] Updated weights for policy 1, policy_version 92322 (0.0007) [2023-10-08 03:38:33,114][52059] Updated weights for policy 1, policy_version 92332 (0.0007) [2023-10-08 03:38:33,452][52060] Updated weights for policy 0, policy_version 91170 (0.0007) [2023-10-08 03:38:33,484][52059] Updated weights for policy 1, policy_version 92342 (0.0007) [2023-10-08 03:38:33,839][52060] Updated weights for policy 0, policy_version 91180 (0.0007) [2023-10-08 03:38:33,848][52059] Updated weights for policy 1, policy_version 92352 (0.0007) [2023-10-08 03:38:34,208][52060] Updated weights for policy 0, policy_version 91190 (0.0008) [2023-10-08 03:38:34,571][52060] Updated weights for policy 0, policy_version 91200 (0.0008) [2023-10-08 03:38:36,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 187957248. Throughput: 0: 1704.0, 1: 1710.5. Samples: 46993120. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:38:36,211][50642] Avg episode reward: [(0, '21.040'), (1, '27.880')] [2023-10-08 03:38:37,841][52059] Updated weights for policy 1, policy_version 92362 (0.0007) [2023-10-08 03:38:38,204][52059] Updated weights for policy 1, policy_version 92372 (0.0008) [2023-10-08 03:38:38,392][52060] Updated weights for policy 0, policy_version 91210 (0.0010) [2023-10-08 03:38:38,570][52059] Updated weights for policy 1, policy_version 92382 (0.0008) [2023-10-08 03:38:38,763][52060] Updated weights for policy 0, policy_version 91220 (0.0009) [2023-10-08 03:38:39,123][52060] Updated weights for policy 0, policy_version 91230 (0.0007) [2023-10-08 03:38:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 188022784. Throughput: 0: 1687.1, 1: 1710.3. Samples: 47013330. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:38:41,211][50642] Avg episode reward: [(0, '19.720'), (1, '23.770')] [2023-10-08 03:38:42,555][52059] Updated weights for policy 1, policy_version 92392 (0.0009) [2023-10-08 03:38:42,927][52059] Updated weights for policy 1, policy_version 92402 (0.0008) [2023-10-08 03:38:43,264][52060] Updated weights for policy 0, policy_version 91240 (0.0007) [2023-10-08 03:38:43,284][52059] Updated weights for policy 1, policy_version 92412 (0.0009) [2023-10-08 03:38:43,644][52060] Updated weights for policy 0, policy_version 91250 (0.0008) [2023-10-08 03:38:44,009][52060] Updated weights for policy 0, policy_version 91260 (0.0008) [2023-10-08 03:38:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 188088320. Throughput: 0: 1711.9, 1: 1728.5. Samples: 47034534. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:38:46,211][50642] Avg episode reward: [(0, '20.260'), (1, '23.220')] [2023-10-08 03:38:46,219][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000092416_94633984.pth... [2023-10-08 03:38:46,219][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000091264_93454336.pth... [2023-10-08 03:38:46,251][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000089664_91815936.pth [2023-10-08 03:38:46,259][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000090816_92995584.pth [2023-10-08 03:38:47,174][52059] Updated weights for policy 1, policy_version 92422 (0.0007) [2023-10-08 03:38:47,536][52059] Updated weights for policy 1, policy_version 92432 (0.0010) [2023-10-08 03:38:47,908][52059] Updated weights for policy 1, policy_version 92442 (0.0009) [2023-10-08 03:38:47,972][52060] Updated weights for policy 0, policy_version 91270 (0.0007) [2023-10-08 03:38:48,343][52060] Updated weights for policy 0, policy_version 91280 (0.0008) [2023-10-08 03:38:48,707][52060] Updated weights for policy 0, policy_version 91290 (0.0008) [2023-10-08 03:38:51,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 188153856. Throughput: 0: 1688.8, 1: 1700.0. Samples: 47044118. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:38:51,211][50642] Avg episode reward: [(0, '22.840'), (1, '18.210')] [2023-10-08 03:38:51,933][52059] Updated weights for policy 1, policy_version 92452 (0.0008) [2023-10-08 03:38:52,294][52059] Updated weights for policy 1, policy_version 92462 (0.0009) [2023-10-08 03:38:52,667][52059] Updated weights for policy 1, policy_version 92472 (0.0008) [2023-10-08 03:38:52,820][52060] Updated weights for policy 0, policy_version 91300 (0.0009) [2023-10-08 03:38:53,181][52060] Updated weights for policy 0, policy_version 91310 (0.0009) [2023-10-08 03:38:53,549][52060] Updated weights for policy 0, policy_version 91320 (0.0011) [2023-10-08 03:38:56,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13773.7). Total num frames: 188219392. Throughput: 0: 1695.9, 1: 1723.9. Samples: 47065054. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:38:56,211][50642] Avg episode reward: [(0, '20.800'), (1, '16.310')] [2023-10-08 03:38:56,458][52059] Updated weights for policy 1, policy_version 92482 (0.0008) [2023-10-08 03:38:56,829][52059] Updated weights for policy 1, policy_version 92492 (0.0008) [2023-10-08 03:38:57,194][52059] Updated weights for policy 1, policy_version 92502 (0.0011) [2023-10-08 03:38:57,415][52060] Updated weights for policy 0, policy_version 91330 (0.0010) [2023-10-08 03:38:57,562][52059] Updated weights for policy 1, policy_version 92512 (0.0007) [2023-10-08 03:38:57,787][52060] Updated weights for policy 0, policy_version 91340 (0.0007) [2023-10-08 03:38:58,158][52060] Updated weights for policy 0, policy_version 91350 (0.0009) [2023-10-08 03:38:58,528][52060] Updated weights for policy 0, policy_version 91360 (0.0009) [2023-10-08 03:39:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 188284928. Throughput: 0: 1713.9, 1: 1737.0. Samples: 47086614. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:01,211][50642] Avg episode reward: [(0, '21.140'), (1, '15.130')] [2023-10-08 03:39:01,456][52059] Updated weights for policy 1, policy_version 92522 (0.0012) [2023-10-08 03:39:01,826][52059] Updated weights for policy 1, policy_version 92532 (0.0009) [2023-10-08 03:39:02,188][52059] Updated weights for policy 1, policy_version 92542 (0.0008) [2023-10-08 03:39:02,674][52060] Updated weights for policy 0, policy_version 91370 (0.0008) [2023-10-08 03:39:03,047][52060] Updated weights for policy 0, policy_version 91380 (0.0008) [2023-10-08 03:39:03,403][52060] Updated weights for policy 0, policy_version 91390 (0.0008) [2023-10-08 03:39:06,061][52059] Updated weights for policy 1, policy_version 92552 (0.0009) [2023-10-08 03:39:06,210][50642] Fps is (10 sec: 13107.6, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 188350464. Throughput: 0: 1681.7, 1: 1725.2. Samples: 47095938. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:06,211][50642] Avg episode reward: [(0, '19.740'), (1, '15.600')] [2023-10-08 03:39:06,424][52059] Updated weights for policy 1, policy_version 92562 (0.0007) [2023-10-08 03:39:06,793][52059] Updated weights for policy 1, policy_version 92572 (0.0008) [2023-10-08 03:39:07,286][52060] Updated weights for policy 0, policy_version 91400 (0.0009) [2023-10-08 03:39:07,655][52060] Updated weights for policy 0, policy_version 91410 (0.0009) [2023-10-08 03:39:08,018][52060] Updated weights for policy 0, policy_version 91420 (0.0007) [2023-10-08 03:39:10,639][52059] Updated weights for policy 1, policy_version 92582 (0.0010) [2023-10-08 03:39:11,008][52059] Updated weights for policy 1, policy_version 92592 (0.0010) [2023-10-08 03:39:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 188416000. Throughput: 0: 1714.6, 1: 1744.8. Samples: 47117678. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:11,211][50642] Avg episode reward: [(0, '21.250'), (1, '14.280')] [2023-10-08 03:39:11,379][52059] Updated weights for policy 1, policy_version 92602 (0.0010) [2023-10-08 03:39:11,978][52060] Updated weights for policy 0, policy_version 91430 (0.0007) [2023-10-08 03:39:12,350][52060] Updated weights for policy 0, policy_version 91440 (0.0007) [2023-10-08 03:39:12,716][52060] Updated weights for policy 0, policy_version 91450 (0.0007) [2023-10-08 03:39:15,285][52059] Updated weights for policy 1, policy_version 92612 (0.0008) [2023-10-08 03:39:15,648][52059] Updated weights for policy 1, policy_version 92622 (0.0009) [2023-10-08 03:39:16,020][52059] Updated weights for policy 1, policy_version 92632 (0.0009) [2023-10-08 03:39:16,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 188481536. Throughput: 0: 1727.1, 1: 1729.9. Samples: 47138412. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:16,211][50642] Avg episode reward: [(0, '21.480'), (1, '14.140')] [2023-10-08 03:39:16,545][52060] Updated weights for policy 0, policy_version 91460 (0.0008) [2023-10-08 03:39:16,911][52060] Updated weights for policy 0, policy_version 91470 (0.0007) [2023-10-08 03:39:17,287][52060] Updated weights for policy 0, policy_version 91480 (0.0008) [2023-10-08 03:39:19,853][52059] Updated weights for policy 1, policy_version 92642 (0.0007) [2023-10-08 03:39:20,216][52059] Updated weights for policy 1, policy_version 92652 (0.0010) [2023-10-08 03:39:20,577][52059] Updated weights for policy 1, policy_version 92662 (0.0011) [2023-10-08 03:39:20,941][52059] Updated weights for policy 1, policy_version 92672 (0.0010) [2023-10-08 03:39:21,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 188579840. Throughput: 0: 1707.1, 1: 1745.3. Samples: 47148478. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:21,211][50642] Avg episode reward: [(0, '20.890'), (1, '16.280')] [2023-10-08 03:39:21,392][52060] Updated weights for policy 0, policy_version 91490 (0.0010) [2023-10-08 03:39:21,769][52060] Updated weights for policy 0, policy_version 91500 (0.0008) [2023-10-08 03:39:22,139][52060] Updated weights for policy 0, policy_version 91510 (0.0008) [2023-10-08 03:39:22,507][52060] Updated weights for policy 0, policy_version 91520 (0.0008) [2023-10-08 03:39:24,721][52059] Updated weights for policy 1, policy_version 92682 (0.0012) [2023-10-08 03:39:25,097][52059] Updated weights for policy 1, policy_version 92692 (0.0009) [2023-10-08 03:39:25,462][52059] Updated weights for policy 1, policy_version 92702 (0.0008) [2023-10-08 03:39:26,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 188645376. Throughput: 0: 1720.8, 1: 1748.7. Samples: 47169454. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:26,211][50642] Avg episode reward: [(0, '19.490'), (1, '14.050')] [2023-10-08 03:39:26,427][52060] Updated weights for policy 0, policy_version 91530 (0.0009) [2023-10-08 03:39:26,785][52060] Updated weights for policy 0, policy_version 91540 (0.0008) [2023-10-08 03:39:27,157][52060] Updated weights for policy 0, policy_version 91550 (0.0008) [2023-10-08 03:39:29,356][52059] Updated weights for policy 1, policy_version 92712 (0.0009) [2023-10-08 03:39:29,714][52059] Updated weights for policy 1, policy_version 92722 (0.0007) [2023-10-08 03:39:30,081][52059] Updated weights for policy 1, policy_version 92732 (0.0008) [2023-10-08 03:39:31,132][52060] Updated weights for policy 0, policy_version 91560 (0.0007) [2023-10-08 03:39:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 188710912. Throughput: 0: 1722.8, 1: 1737.6. Samples: 47190252. Policy #0 lag: (min: 6.0, avg: 11.7, max: 38.0) [2023-10-08 03:39:31,211][50642] Avg episode reward: [(0, '22.120'), (1, '16.700')] [2023-10-08 03:39:31,503][52060] Updated weights for policy 0, policy_version 91570 (0.0010) [2023-10-08 03:39:31,869][52060] Updated weights for policy 0, policy_version 91580 (0.0010) [2023-10-08 03:39:34,116][52059] Updated weights for policy 1, policy_version 92742 (0.0008) [2023-10-08 03:39:34,488][52059] Updated weights for policy 1, policy_version 92752 (0.0008) [2023-10-08 03:39:34,858][52059] Updated weights for policy 1, policy_version 92762 (0.0008) [2023-10-08 03:39:35,780][52060] Updated weights for policy 0, policy_version 91590 (0.0010) [2023-10-08 03:39:36,137][52060] Updated weights for policy 0, policy_version 91600 (0.0008) [2023-10-08 03:39:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 188776448. Throughput: 0: 1719.6, 1: 1769.3. Samples: 47201120. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:39:36,211][50642] Avg episode reward: [(0, '22.000'), (1, '14.990')] [2023-10-08 03:39:36,515][52060] Updated weights for policy 0, policy_version 91610 (0.0007) [2023-10-08 03:39:38,809][52059] Updated weights for policy 1, policy_version 92772 (0.0008) [2023-10-08 03:39:39,175][52059] Updated weights for policy 1, policy_version 92782 (0.0007) [2023-10-08 03:39:39,542][52059] Updated weights for policy 1, policy_version 92792 (0.0007) [2023-10-08 03:39:40,460][52060] Updated weights for policy 0, policy_version 91620 (0.0008) [2023-10-08 03:39:40,831][52060] Updated weights for policy 0, policy_version 91630 (0.0010) [2023-10-08 03:39:41,189][52060] Updated weights for policy 0, policy_version 91640 (0.0010) [2023-10-08 03:39:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 188841984. Throughput: 0: 1733.5, 1: 1744.1. Samples: 47221542. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:39:41,211][50642] Avg episode reward: [(0, '22.050'), (1, '15.840')] [2023-10-08 03:39:43,559][52059] Updated weights for policy 1, policy_version 92802 (0.0008) [2023-10-08 03:39:43,929][52059] Updated weights for policy 1, policy_version 92812 (0.0009) [2023-10-08 03:39:44,292][52059] Updated weights for policy 1, policy_version 92822 (0.0010) [2023-10-08 03:39:44,651][52059] Updated weights for policy 1, policy_version 92832 (0.0010) [2023-10-08 03:39:45,116][52060] Updated weights for policy 0, policy_version 91650 (0.0010) [2023-10-08 03:39:45,487][52060] Updated weights for policy 0, policy_version 91660 (0.0010) [2023-10-08 03:39:45,845][52060] Updated weights for policy 0, policy_version 91670 (0.0007) [2023-10-08 03:39:46,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 188940288. Throughput: 0: 1716.6, 1: 1737.8. Samples: 47242062. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:39:46,211][52060] Updated weights for policy 0, policy_version 91680 (0.0010) [2023-10-08 03:39:46,211][50642] Avg episode reward: [(0, '19.760'), (1, '17.070')] [2023-10-08 03:39:48,252][52059] Updated weights for policy 1, policy_version 92842 (0.0010) [2023-10-08 03:39:48,618][52059] Updated weights for policy 1, policy_version 92852 (0.0009) [2023-10-08 03:39:48,973][52059] Updated weights for policy 1, policy_version 92862 (0.0009) [2023-10-08 03:39:49,917][52060] Updated weights for policy 0, policy_version 91690 (0.0009) [2023-10-08 03:39:50,283][52060] Updated weights for policy 0, policy_version 91700 (0.0011) [2023-10-08 03:39:50,659][52060] Updated weights for policy 0, policy_version 91710 (0.0009) [2023-10-08 03:39:51,210][50642] Fps is (10 sec: 16384.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 189005824. Throughput: 0: 1740.0, 1: 1745.8. Samples: 47252798. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:39:51,211][50642] Avg episode reward: [(0, '20.650'), (1, '14.430')] [2023-10-08 03:39:52,807][52059] Updated weights for policy 1, policy_version 92872 (0.0007) [2023-10-08 03:39:53,184][52059] Updated weights for policy 1, policy_version 92882 (0.0009) [2023-10-08 03:39:53,546][52059] Updated weights for policy 1, policy_version 92892 (0.0009) [2023-10-08 03:39:54,596][52060] Updated weights for policy 0, policy_version 91720 (0.0010) [2023-10-08 03:39:54,965][52060] Updated weights for policy 0, policy_version 91730 (0.0008) [2023-10-08 03:39:55,336][52060] Updated weights for policy 0, policy_version 91740 (0.0009) [2023-10-08 03:39:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 189071360. Throughput: 0: 1724.2, 1: 1737.8. Samples: 47273468. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:39:56,211][50642] Avg episode reward: [(0, '22.840'), (1, '16.490')] [2023-10-08 03:39:57,696][52059] Updated weights for policy 1, policy_version 92902 (0.0008) [2023-10-08 03:39:58,080][52059] Updated weights for policy 1, policy_version 92912 (0.0008) [2023-10-08 03:39:58,449][52059] Updated weights for policy 1, policy_version 92922 (0.0010) [2023-10-08 03:39:59,256][52060] Updated weights for policy 0, policy_version 91750 (0.0007) [2023-10-08 03:39:59,629][52060] Updated weights for policy 0, policy_version 91760 (0.0007) [2023-10-08 03:39:59,992][52060] Updated weights for policy 0, policy_version 91770 (0.0007) [2023-10-08 03:40:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 189136896. Throughput: 0: 1702.9, 1: 1752.1. Samples: 47293884. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:40:01,211][50642] Avg episode reward: [(0, '20.220'), (1, '15.770')] [2023-10-08 03:40:02,414][52059] Updated weights for policy 1, policy_version 92932 (0.0008) [2023-10-08 03:40:02,783][52059] Updated weights for policy 1, policy_version 92942 (0.0007) [2023-10-08 03:40:03,149][52059] Updated weights for policy 1, policy_version 92952 (0.0008) [2023-10-08 03:40:03,994][52060] Updated weights for policy 0, policy_version 91780 (0.0009) [2023-10-08 03:40:04,359][52060] Updated weights for policy 0, policy_version 91790 (0.0009) [2023-10-08 03:40:04,731][52060] Updated weights for policy 0, policy_version 91800 (0.0008) [2023-10-08 03:40:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 189202432. Throughput: 0: 1732.0, 1: 1734.5. Samples: 47304472. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:40:06,211][50642] Avg episode reward: [(0, '19.440'), (1, '15.690')] [2023-10-08 03:40:07,027][52059] Updated weights for policy 1, policy_version 92962 (0.0007) [2023-10-08 03:40:07,394][52059] Updated weights for policy 1, policy_version 92972 (0.0007) [2023-10-08 03:40:07,760][52059] Updated weights for policy 1, policy_version 92982 (0.0008) [2023-10-08 03:40:08,128][52059] Updated weights for policy 1, policy_version 92992 (0.0009) [2023-10-08 03:40:08,967][52060] Updated weights for policy 0, policy_version 91810 (0.0009) [2023-10-08 03:40:09,362][52060] Updated weights for policy 0, policy_version 91820 (0.0008) [2023-10-08 03:40:09,728][52060] Updated weights for policy 0, policy_version 91830 (0.0007) [2023-10-08 03:40:10,100][52060] Updated weights for policy 0, policy_version 91840 (0.0011) [2023-10-08 03:40:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 189267968. Throughput: 0: 1709.2, 1: 1740.8. Samples: 47324708. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:40:11,211][50642] Avg episode reward: [(0, '18.550'), (1, '17.320')] [2023-10-08 03:40:11,999][52059] Updated weights for policy 1, policy_version 93002 (0.0008) [2023-10-08 03:40:12,366][52059] Updated weights for policy 1, policy_version 93012 (0.0009) [2023-10-08 03:40:12,727][52059] Updated weights for policy 1, policy_version 93022 (0.0008) [2023-10-08 03:40:14,210][52060] Updated weights for policy 0, policy_version 91850 (0.0007) [2023-10-08 03:40:14,573][52060] Updated weights for policy 0, policy_version 91860 (0.0009) [2023-10-08 03:40:14,934][52060] Updated weights for policy 0, policy_version 91870 (0.0007) [2023-10-08 03:40:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 189333504. Throughput: 0: 1699.7, 1: 1757.3. Samples: 47345816. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:40:16,211][50642] Avg episode reward: [(0, '22.530'), (1, '15.520')] [2023-10-08 03:40:16,545][52059] Updated weights for policy 1, policy_version 93032 (0.0010) [2023-10-08 03:40:16,902][52059] Updated weights for policy 1, policy_version 93042 (0.0010) [2023-10-08 03:40:17,267][52059] Updated weights for policy 1, policy_version 93052 (0.0011) [2023-10-08 03:40:18,955][52060] Updated weights for policy 0, policy_version 91880 (0.0008) [2023-10-08 03:40:19,329][52060] Updated weights for policy 0, policy_version 91890 (0.0008) [2023-10-08 03:40:19,702][52060] Updated weights for policy 0, policy_version 91900 (0.0009) [2023-10-08 03:40:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 189399040. Throughput: 0: 1723.8, 1: 1722.7. Samples: 47356212. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:40:21,211][50642] Avg episode reward: [(0, '20.910'), (1, '17.600')] [2023-10-08 03:40:21,240][52059] Updated weights for policy 1, policy_version 93062 (0.0008) [2023-10-08 03:40:21,614][52059] Updated weights for policy 1, policy_version 93072 (0.0008) [2023-10-08 03:40:21,976][52059] Updated weights for policy 1, policy_version 93082 (0.0010) [2023-10-08 03:40:23,585][52060] Updated weights for policy 0, policy_version 91910 (0.0008) [2023-10-08 03:40:23,956][52060] Updated weights for policy 0, policy_version 91920 (0.0008) [2023-10-08 03:40:24,327][52060] Updated weights for policy 0, policy_version 91930 (0.0008) [2023-10-08 03:40:25,769][52059] Updated weights for policy 1, policy_version 93092 (0.0008) [2023-10-08 03:40:26,131][52059] Updated weights for policy 1, policy_version 93102 (0.0008) [2023-10-08 03:40:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 189464576. Throughput: 0: 1695.9, 1: 1754.0. Samples: 47376788. Policy #0 lag: (min: 31.0, avg: 38.2, max: 63.0) [2023-10-08 03:40:26,211][50642] Avg episode reward: [(0, '19.540'), (1, '17.070')] [2023-10-08 03:40:26,495][52059] Updated weights for policy 1, policy_version 93112 (0.0009) [2023-10-08 03:40:28,135][52060] Updated weights for policy 0, policy_version 91940 (0.0010) [2023-10-08 03:40:28,500][52060] Updated weights for policy 0, policy_version 91950 (0.0008) [2023-10-08 03:40:28,873][52060] Updated weights for policy 0, policy_version 91960 (0.0008) [2023-10-08 03:40:30,460][52059] Updated weights for policy 1, policy_version 93122 (0.0010) [2023-10-08 03:40:30,819][52059] Updated weights for policy 1, policy_version 93132 (0.0008) [2023-10-08 03:40:31,184][52059] Updated weights for policy 1, policy_version 93142 (0.0007) [2023-10-08 03:40:31,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 189530112. Throughput: 0: 1717.6, 1: 1740.4. Samples: 47397670. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:40:31,211][50642] Avg episode reward: [(0, '20.380'), (1, '16.970')] [2023-10-08 03:40:31,543][52059] Updated weights for policy 1, policy_version 93152 (0.0007) [2023-10-08 03:40:32,814][52060] Updated weights for policy 0, policy_version 91970 (0.0008) [2023-10-08 03:40:33,182][52060] Updated weights for policy 0, policy_version 91980 (0.0008) [2023-10-08 03:40:33,555][52060] Updated weights for policy 0, policy_version 91990 (0.0008) [2023-10-08 03:40:33,920][52060] Updated weights for policy 0, policy_version 92000 (0.0008) [2023-10-08 03:40:35,396][52059] Updated weights for policy 1, policy_version 93162 (0.0009) [2023-10-08 03:40:35,759][52059] Updated weights for policy 1, policy_version 93172 (0.0008) [2023-10-08 03:40:36,125][52059] Updated weights for policy 1, policy_version 93182 (0.0007) [2023-10-08 03:40:36,210][50642] Fps is (10 sec: 16383.5, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 189628416. Throughput: 0: 1699.8, 1: 1742.6. Samples: 47407704. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:40:36,211][50642] Avg episode reward: [(0, '23.650'), (1, '18.170')] [2023-10-08 03:40:37,849][52060] Updated weights for policy 0, policy_version 92010 (0.0009) [2023-10-08 03:40:38,221][52060] Updated weights for policy 0, policy_version 92020 (0.0011) [2023-10-08 03:40:38,598][52060] Updated weights for policy 0, policy_version 92030 (0.0009) [2023-10-08 03:40:40,110][52059] Updated weights for policy 1, policy_version 93192 (0.0009) [2023-10-08 03:40:40,468][52059] Updated weights for policy 1, policy_version 93202 (0.0009) [2023-10-08 03:40:40,835][52059] Updated weights for policy 1, policy_version 93212 (0.0009) [2023-10-08 03:40:41,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 189693952. Throughput: 0: 1703.6, 1: 1745.3. Samples: 47428668. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:40:41,211][50642] Avg episode reward: [(0, '18.980'), (1, '15.930')] [2023-10-08 03:40:42,626][52060] Updated weights for policy 0, policy_version 92040 (0.0008) [2023-10-08 03:40:42,994][52060] Updated weights for policy 0, policy_version 92050 (0.0009) [2023-10-08 03:40:43,358][52060] Updated weights for policy 0, policy_version 92060 (0.0008) [2023-10-08 03:40:44,673][52059] Updated weights for policy 1, policy_version 93222 (0.0009) [2023-10-08 03:40:45,044][52059] Updated weights for policy 1, policy_version 93232 (0.0008) [2023-10-08 03:40:45,405][52059] Updated weights for policy 1, policy_version 93242 (0.0007) [2023-10-08 03:40:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 189759488. Throughput: 0: 1716.1, 1: 1722.4. Samples: 47448618. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:40:46,211][50642] Avg episode reward: [(0, '20.620'), (1, '17.530')] [2023-10-08 03:40:46,221][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000092064_94273536.pth... [2023-10-08 03:40:46,221][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000093248_95485952.pth... [2023-10-08 03:40:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000090464_92635136.pth [2023-10-08 03:40:46,261][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000091616_93814784.pth [2023-10-08 03:40:47,433][52060] Updated weights for policy 0, policy_version 92070 (0.0009) [2023-10-08 03:40:47,807][52060] Updated weights for policy 0, policy_version 92080 (0.0010) [2023-10-08 03:40:48,167][52060] Updated weights for policy 0, policy_version 92090 (0.0009) [2023-10-08 03:40:49,414][52059] Updated weights for policy 1, policy_version 93252 (0.0009) [2023-10-08 03:40:49,782][52059] Updated weights for policy 1, policy_version 93262 (0.0008) [2023-10-08 03:40:50,143][52059] Updated weights for policy 1, policy_version 93272 (0.0007) [2023-10-08 03:40:51,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 189825024. Throughput: 0: 1686.4, 1: 1751.3. Samples: 47459170. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:40:51,211][50642] Avg episode reward: [(0, '15.000'), (1, '15.790')] [2023-10-08 03:40:52,290][52060] Updated weights for policy 0, policy_version 92100 (0.0009) [2023-10-08 03:40:52,656][52060] Updated weights for policy 0, policy_version 92110 (0.0007) [2023-10-08 03:40:53,017][52060] Updated weights for policy 0, policy_version 92120 (0.0008) [2023-10-08 03:40:54,041][52059] Updated weights for policy 1, policy_version 93282 (0.0007) [2023-10-08 03:40:54,412][52059] Updated weights for policy 1, policy_version 93292 (0.0007) [2023-10-08 03:40:54,771][52059] Updated weights for policy 1, policy_version 93302 (0.0008) [2023-10-08 03:40:55,130][52059] Updated weights for policy 1, policy_version 93312 (0.0007) [2023-10-08 03:40:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 189890560. Throughput: 0: 1710.8, 1: 1730.8. Samples: 47479580. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:40:56,211][50642] Avg episode reward: [(0, '14.370'), (1, '16.900')] [2023-10-08 03:40:57,120][52060] Updated weights for policy 0, policy_version 92130 (0.0007) [2023-10-08 03:40:57,536][52060] Updated weights for policy 0, policy_version 92140 (0.0010) [2023-10-08 03:40:57,908][52060] Updated weights for policy 0, policy_version 92150 (0.0010) [2023-10-08 03:40:58,277][52060] Updated weights for policy 0, policy_version 92160 (0.0011) [2023-10-08 03:40:58,994][52059] Updated weights for policy 1, policy_version 93322 (0.0010) [2023-10-08 03:40:59,366][52059] Updated weights for policy 1, policy_version 93332 (0.0009) [2023-10-08 03:40:59,737][52059] Updated weights for policy 1, policy_version 93342 (0.0007) [2023-10-08 03:41:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 189956096. Throughput: 0: 1711.4, 1: 1720.1. Samples: 47500234. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:41:01,211][50642] Avg episode reward: [(0, '16.880'), (1, '17.500')] [2023-10-08 03:41:02,284][52060] Updated weights for policy 0, policy_version 92170 (0.0008) [2023-10-08 03:41:02,644][52060] Updated weights for policy 0, policy_version 92180 (0.0007) [2023-10-08 03:41:03,014][52060] Updated weights for policy 0, policy_version 92190 (0.0007) [2023-10-08 03:41:03,729][52059] Updated weights for policy 1, policy_version 93352 (0.0009) [2023-10-08 03:41:04,090][52059] Updated weights for policy 1, policy_version 93362 (0.0009) [2023-10-08 03:41:04,459][52059] Updated weights for policy 1, policy_version 93372 (0.0008) [2023-10-08 03:41:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 190021632. Throughput: 0: 1686.0, 1: 1742.2. Samples: 47510480. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:41:06,211][50642] Avg episode reward: [(0, '14.450'), (1, '16.180')] [2023-10-08 03:41:06,903][52060] Updated weights for policy 0, policy_version 92200 (0.0007) [2023-10-08 03:41:07,268][52060] Updated weights for policy 0, policy_version 92210 (0.0010) [2023-10-08 03:41:07,632][52060] Updated weights for policy 0, policy_version 92220 (0.0010) [2023-10-08 03:41:08,475][52059] Updated weights for policy 1, policy_version 93382 (0.0008) [2023-10-08 03:41:08,830][52059] Updated weights for policy 1, policy_version 93392 (0.0007) [2023-10-08 03:41:09,207][52059] Updated weights for policy 1, policy_version 93402 (0.0009) [2023-10-08 03:41:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 190087168. Throughput: 0: 1711.2, 1: 1713.9. Samples: 47530918. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:41:11,211][50642] Avg episode reward: [(0, '16.660'), (1, '19.040')] [2023-10-08 03:41:11,588][52060] Updated weights for policy 0, policy_version 92230 (0.0009) [2023-10-08 03:41:11,967][52060] Updated weights for policy 0, policy_version 92240 (0.0007) [2023-10-08 03:41:12,334][52060] Updated weights for policy 0, policy_version 92250 (0.0007) [2023-10-08 03:41:12,981][52059] Updated weights for policy 1, policy_version 93412 (0.0007) [2023-10-08 03:41:13,351][52059] Updated weights for policy 1, policy_version 93422 (0.0010) [2023-10-08 03:41:13,719][52059] Updated weights for policy 1, policy_version 93432 (0.0009) [2023-10-08 03:41:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 190152704. Throughput: 0: 1706.8, 1: 1733.2. Samples: 47552470. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:41:16,211][50642] Avg episode reward: [(0, '16.260'), (1, '16.220')] [2023-10-08 03:41:16,445][52060] Updated weights for policy 0, policy_version 92260 (0.0007) [2023-10-08 03:41:16,815][52060] Updated weights for policy 0, policy_version 92270 (0.0008) [2023-10-08 03:41:17,172][52060] Updated weights for policy 0, policy_version 92280 (0.0008) [2023-10-08 03:41:17,692][52059] Updated weights for policy 1, policy_version 93442 (0.0007) [2023-10-08 03:41:18,052][52059] Updated weights for policy 1, policy_version 93452 (0.0009) [2023-10-08 03:41:18,423][52059] Updated weights for policy 1, policy_version 93462 (0.0009) [2023-10-08 03:41:18,788][52059] Updated weights for policy 1, policy_version 93472 (0.0007) [2023-10-08 03:41:21,054][52060] Updated weights for policy 0, policy_version 92290 (0.0008) [2023-10-08 03:41:21,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 190218240. Throughput: 0: 1707.5, 1: 1724.2. Samples: 47562132. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:41:21,211][50642] Avg episode reward: [(0, '15.150'), (1, '17.490')] [2023-10-08 03:41:21,428][52060] Updated weights for policy 0, policy_version 92300 (0.0007) [2023-10-08 03:41:21,803][52060] Updated weights for policy 0, policy_version 92310 (0.0008) [2023-10-08 03:41:22,180][52060] Updated weights for policy 0, policy_version 92320 (0.0007) [2023-10-08 03:41:22,690][52059] Updated weights for policy 1, policy_version 93482 (0.0010) [2023-10-08 03:41:23,055][52059] Updated weights for policy 1, policy_version 93492 (0.0009) [2023-10-08 03:41:23,424][52059] Updated weights for policy 1, policy_version 93502 (0.0007) [2023-10-08 03:41:26,137][52060] Updated weights for policy 0, policy_version 92330 (0.0007) [2023-10-08 03:41:26,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 190283776. Throughput: 0: 1717.3, 1: 1726.8. Samples: 47583652. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:41:26,211][50642] Avg episode reward: [(0, '18.110'), (1, '17.550')] [2023-10-08 03:41:26,511][52060] Updated weights for policy 0, policy_version 92340 (0.0008) [2023-10-08 03:41:26,889][52060] Updated weights for policy 0, policy_version 92350 (0.0011) [2023-10-08 03:41:27,275][52059] Updated weights for policy 1, policy_version 93512 (0.0008) [2023-10-08 03:41:27,642][52059] Updated weights for policy 1, policy_version 93522 (0.0007) [2023-10-08 03:41:27,998][52059] Updated weights for policy 1, policy_version 93532 (0.0010) [2023-10-08 03:41:30,676][52060] Updated weights for policy 0, policy_version 92360 (0.0010) [2023-10-08 03:41:31,048][52060] Updated weights for policy 0, policy_version 92370 (0.0008) [2023-10-08 03:41:31,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 190349312. Throughput: 0: 1714.0, 1: 1756.9. Samples: 47604810. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:41:31,211][50642] Avg episode reward: [(0, '14.930'), (1, '15.550')] [2023-10-08 03:41:31,420][52060] Updated weights for policy 0, policy_version 92380 (0.0009) [2023-10-08 03:41:31,998][52059] Updated weights for policy 1, policy_version 93542 (0.0007) [2023-10-08 03:41:32,373][52059] Updated weights for policy 1, policy_version 93552 (0.0007) [2023-10-08 03:41:32,742][52059] Updated weights for policy 1, policy_version 93562 (0.0008) [2023-10-08 03:41:35,089][52060] Updated weights for policy 0, policy_version 92390 (0.0007) [2023-10-08 03:41:35,452][52060] Updated weights for policy 0, policy_version 92400 (0.0008) [2023-10-08 03:41:35,829][52060] Updated weights for policy 0, policy_version 92410 (0.0008) [2023-10-08 03:41:36,210][50642] Fps is (10 sec: 16383.6, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 190447616. Throughput: 0: 1732.7, 1: 1726.2. Samples: 47614818. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:41:36,211][50642] Avg episode reward: [(0, '16.270'), (1, '17.210')] [2023-10-08 03:41:36,553][52059] Updated weights for policy 1, policy_version 93572 (0.0009) [2023-10-08 03:41:36,924][52059] Updated weights for policy 1, policy_version 93582 (0.0008) [2023-10-08 03:41:37,291][52059] Updated weights for policy 1, policy_version 93592 (0.0009) [2023-10-08 03:41:39,832][52060] Updated weights for policy 0, policy_version 92420 (0.0010) [2023-10-08 03:41:40,201][52060] Updated weights for policy 0, policy_version 92430 (0.0007) [2023-10-08 03:41:40,576][52060] Updated weights for policy 0, policy_version 92440 (0.0008) [2023-10-08 03:41:41,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 190513152. Throughput: 0: 1732.8, 1: 1740.7. Samples: 47635888. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:41:41,211][50642] Avg episode reward: [(0, '15.820'), (1, '17.820')] [2023-10-08 03:41:41,360][52059] Updated weights for policy 1, policy_version 93602 (0.0007) [2023-10-08 03:41:41,728][52059] Updated weights for policy 1, policy_version 93612 (0.0009) [2023-10-08 03:41:42,098][52059] Updated weights for policy 1, policy_version 93622 (0.0008) [2023-10-08 03:41:42,465][52059] Updated weights for policy 1, policy_version 93632 (0.0008) [2023-10-08 03:41:44,445][52060] Updated weights for policy 0, policy_version 92450 (0.0009) [2023-10-08 03:41:44,834][52060] Updated weights for policy 0, policy_version 92460 (0.0009) [2023-10-08 03:41:45,200][52060] Updated weights for policy 0, policy_version 92470 (0.0007) [2023-10-08 03:41:45,568][52060] Updated weights for policy 0, policy_version 92480 (0.0009) [2023-10-08 03:41:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 190578688. Throughput: 0: 1711.7, 1: 1750.5. Samples: 47656032. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:41:46,211][50642] Avg episode reward: [(0, '14.150'), (1, '19.260')] [2023-10-08 03:41:46,385][52059] Updated weights for policy 1, policy_version 93642 (0.0008) [2023-10-08 03:41:46,759][52059] Updated weights for policy 1, policy_version 93652 (0.0010) [2023-10-08 03:41:47,120][52059] Updated weights for policy 1, policy_version 93662 (0.0007) [2023-10-08 03:41:49,516][52060] Updated weights for policy 0, policy_version 92490 (0.0007) [2023-10-08 03:41:49,894][52060] Updated weights for policy 0, policy_version 92500 (0.0007) [2023-10-08 03:41:50,259][52060] Updated weights for policy 0, policy_version 92510 (0.0009) [2023-10-08 03:41:50,905][52059] Updated weights for policy 1, policy_version 93672 (0.0010) [2023-10-08 03:41:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 190644224. Throughput: 0: 1745.1, 1: 1729.3. Samples: 47666828. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:41:51,211][50642] Avg episode reward: [(0, '16.800'), (1, '18.790')] [2023-10-08 03:41:51,283][52059] Updated weights for policy 1, policy_version 93682 (0.0008) [2023-10-08 03:41:51,650][52059] Updated weights for policy 1, policy_version 93692 (0.0008) [2023-10-08 03:41:54,438][52060] Updated weights for policy 0, policy_version 92520 (0.0009) [2023-10-08 03:41:54,804][52060] Updated weights for policy 0, policy_version 92530 (0.0009) [2023-10-08 03:41:55,182][52060] Updated weights for policy 0, policy_version 92540 (0.0011) [2023-10-08 03:41:55,484][52059] Updated weights for policy 1, policy_version 93702 (0.0008) [2023-10-08 03:41:55,855][52059] Updated weights for policy 1, policy_version 93712 (0.0010) [2023-10-08 03:41:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 190709760. Throughput: 0: 1721.7, 1: 1758.7. Samples: 47687536. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:41:56,211][50642] Avg episode reward: [(0, '14.310'), (1, '18.480')] [2023-10-08 03:41:56,217][52059] Updated weights for policy 1, policy_version 93722 (0.0010) [2023-10-08 03:41:59,151][52060] Updated weights for policy 0, policy_version 92550 (0.0009) [2023-10-08 03:41:59,514][52060] Updated weights for policy 0, policy_version 92560 (0.0007) [2023-10-08 03:41:59,885][52060] Updated weights for policy 0, policy_version 92570 (0.0010) [2023-10-08 03:42:00,252][52059] Updated weights for policy 1, policy_version 93732 (0.0008) [2023-10-08 03:42:00,626][52059] Updated weights for policy 1, policy_version 93742 (0.0007) [2023-10-08 03:42:00,978][52059] Updated weights for policy 1, policy_version 93752 (0.0009) [2023-10-08 03:42:01,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 190775296. Throughput: 0: 1712.8, 1: 1730.5. Samples: 47707418. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:42:01,211][50642] Avg episode reward: [(0, '15.900'), (1, '17.900')] [2023-10-08 03:42:03,801][52060] Updated weights for policy 0, policy_version 92580 (0.0008) [2023-10-08 03:42:04,171][52060] Updated weights for policy 0, policy_version 92590 (0.0010) [2023-10-08 03:42:04,553][52060] Updated weights for policy 0, policy_version 92600 (0.0009) [2023-10-08 03:42:04,759][52059] Updated weights for policy 1, policy_version 93762 (0.0011) [2023-10-08 03:42:05,120][52059] Updated weights for policy 1, policy_version 93772 (0.0009) [2023-10-08 03:42:05,477][52059] Updated weights for policy 1, policy_version 93782 (0.0010) [2023-10-08 03:42:05,837][52059] Updated weights for policy 1, policy_version 93792 (0.0009) [2023-10-08 03:42:06,210][50642] Fps is (10 sec: 16384.3, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 190873600. Throughput: 0: 1736.3, 1: 1750.2. Samples: 47719024. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:42:06,211][50642] Avg episode reward: [(0, '17.890'), (1, '16.070')] [2023-10-08 03:42:08,626][52060] Updated weights for policy 0, policy_version 92610 (0.0008) [2023-10-08 03:42:08,991][52060] Updated weights for policy 0, policy_version 92620 (0.0010) [2023-10-08 03:42:09,367][52060] Updated weights for policy 0, policy_version 92630 (0.0010) [2023-10-08 03:42:09,653][52059] Updated weights for policy 1, policy_version 93802 (0.0009) [2023-10-08 03:42:09,739][52060] Updated weights for policy 0, policy_version 92640 (0.0009) [2023-10-08 03:42:10,027][52059] Updated weights for policy 1, policy_version 93812 (0.0009) [2023-10-08 03:42:10,385][52059] Updated weights for policy 1, policy_version 93822 (0.0008) [2023-10-08 03:42:11,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 190939136. Throughput: 0: 1705.2, 1: 1739.3. Samples: 47738654. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:42:11,211][50642] Avg episode reward: [(0, '14.510'), (1, '17.620')] [2023-10-08 03:42:13,768][52060] Updated weights for policy 0, policy_version 92650 (0.0010) [2023-10-08 03:42:14,125][52060] Updated weights for policy 0, policy_version 92660 (0.0009) [2023-10-08 03:42:14,245][52059] Updated weights for policy 1, policy_version 93832 (0.0008) [2023-10-08 03:42:14,494][52060] Updated weights for policy 0, policy_version 92670 (0.0008) [2023-10-08 03:42:14,619][52059] Updated weights for policy 1, policy_version 93842 (0.0010) [2023-10-08 03:42:14,991][52059] Updated weights for policy 1, policy_version 93852 (0.0009) [2023-10-08 03:42:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 191004672. Throughput: 0: 1706.1, 1: 1724.5. Samples: 47759186. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:42:16,211][50642] Avg episode reward: [(0, '16.510'), (1, '17.740')] [2023-10-08 03:42:18,513][52060] Updated weights for policy 0, policy_version 92680 (0.0007) [2023-10-08 03:42:18,878][52060] Updated weights for policy 0, policy_version 92690 (0.0008) [2023-10-08 03:42:19,059][52059] Updated weights for policy 1, policy_version 93862 (0.0009) [2023-10-08 03:42:19,240][52060] Updated weights for policy 0, policy_version 92700 (0.0007) [2023-10-08 03:42:19,436][52059] Updated weights for policy 1, policy_version 93872 (0.0008) [2023-10-08 03:42:19,800][52059] Updated weights for policy 1, policy_version 93882 (0.0009) [2023-10-08 03:42:21,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 191070208. Throughput: 0: 1707.2, 1: 1752.5. Samples: 47770502. Policy #0 lag: (min: 10.0, avg: 16.6, max: 42.0) [2023-10-08 03:42:21,211][50642] Avg episode reward: [(0, '15.430'), (1, '17.060')] [2023-10-08 03:42:22,974][52060] Updated weights for policy 0, policy_version 92710 (0.0007) [2023-10-08 03:42:23,335][52060] Updated weights for policy 0, policy_version 92720 (0.0010) [2023-10-08 03:42:23,703][52060] Updated weights for policy 0, policy_version 92730 (0.0009) [2023-10-08 03:42:23,762][52059] Updated weights for policy 1, policy_version 93892 (0.0009) [2023-10-08 03:42:24,115][52059] Updated weights for policy 1, policy_version 93902 (0.0007) [2023-10-08 03:42:24,472][52059] Updated weights for policy 1, policy_version 93912 (0.0008) [2023-10-08 03:42:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 191135744. Throughput: 0: 1693.4, 1: 1725.2. Samples: 47789724. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:26,211][50642] Avg episode reward: [(0, '14.140'), (1, '17.670')] [2023-10-08 03:42:27,649][52060] Updated weights for policy 0, policy_version 92740 (0.0010) [2023-10-08 03:42:28,025][52060] Updated weights for policy 0, policy_version 92750 (0.0009) [2023-10-08 03:42:28,383][52060] Updated weights for policy 0, policy_version 92760 (0.0010) [2023-10-08 03:42:28,493][52059] Updated weights for policy 1, policy_version 93922 (0.0008) [2023-10-08 03:42:28,866][52059] Updated weights for policy 1, policy_version 93932 (0.0009) [2023-10-08 03:42:29,228][52059] Updated weights for policy 1, policy_version 93942 (0.0010) [2023-10-08 03:42:29,594][52059] Updated weights for policy 1, policy_version 93952 (0.0010) [2023-10-08 03:42:31,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 191201280. Throughput: 0: 1723.8, 1: 1726.1. Samples: 47811278. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:31,211][50642] Avg episode reward: [(0, '16.530'), (1, '18.770')] [2023-10-08 03:42:32,425][52060] Updated weights for policy 0, policy_version 92770 (0.0009) [2023-10-08 03:42:32,827][52060] Updated weights for policy 0, policy_version 92780 (0.0010) [2023-10-08 03:42:33,195][52060] Updated weights for policy 0, policy_version 92790 (0.0007) [2023-10-08 03:42:33,474][52059] Updated weights for policy 1, policy_version 93962 (0.0007) [2023-10-08 03:42:33,560][52060] Updated weights for policy 0, policy_version 92800 (0.0009) [2023-10-08 03:42:33,843][52059] Updated weights for policy 1, policy_version 93972 (0.0007) [2023-10-08 03:42:34,204][52059] Updated weights for policy 1, policy_version 93982 (0.0007) [2023-10-08 03:42:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 191266816. Throughput: 0: 1689.5, 1: 1739.9. Samples: 47821150. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:36,211][50642] Avg episode reward: [(0, '13.630'), (1, '18.650')] [2023-10-08 03:42:37,544][52060] Updated weights for policy 0, policy_version 92810 (0.0009) [2023-10-08 03:42:37,915][52060] Updated weights for policy 0, policy_version 92820 (0.0008) [2023-10-08 03:42:37,990][52059] Updated weights for policy 1, policy_version 93992 (0.0008) [2023-10-08 03:42:38,281][52060] Updated weights for policy 0, policy_version 92830 (0.0008) [2023-10-08 03:42:38,348][52059] Updated weights for policy 1, policy_version 94002 (0.0007) [2023-10-08 03:42:38,718][52059] Updated weights for policy 1, policy_version 94012 (0.0007) [2023-10-08 03:42:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13884.7). Total num frames: 191332352. Throughput: 0: 1708.5, 1: 1727.4. Samples: 47842152. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:41,211][50642] Avg episode reward: [(0, '16.430'), (1, '18.350')] [2023-10-08 03:42:42,350][52060] Updated weights for policy 0, policy_version 92840 (0.0007) [2023-10-08 03:42:42,558][52059] Updated weights for policy 1, policy_version 94022 (0.0007) [2023-10-08 03:42:42,720][52060] Updated weights for policy 0, policy_version 92850 (0.0008) [2023-10-08 03:42:42,917][52059] Updated weights for policy 1, policy_version 94032 (0.0007) [2023-10-08 03:42:43,085][52060] Updated weights for policy 0, policy_version 92860 (0.0009) [2023-10-08 03:42:43,288][52059] Updated weights for policy 1, policy_version 94042 (0.0008) [2023-10-08 03:42:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 191397888. Throughput: 0: 1716.3, 1: 1748.9. Samples: 47863354. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:46,211][50642] Avg episode reward: [(0, '14.420'), (1, '17.730')] [2023-10-08 03:42:46,218][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000094048_96305152.pth... [2023-10-08 03:42:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000092864_95092736.pth... [2023-10-08 03:42:46,258][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000092416_94633984.pth [2023-10-08 03:42:46,258][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000091264_93454336.pth [2023-10-08 03:42:47,153][52060] Updated weights for policy 0, policy_version 92870 (0.0008) [2023-10-08 03:42:47,217][52059] Updated weights for policy 1, policy_version 94052 (0.0007) [2023-10-08 03:42:47,527][52060] Updated weights for policy 0, policy_version 92880 (0.0009) [2023-10-08 03:42:47,575][52059] Updated weights for policy 1, policy_version 94062 (0.0009) [2023-10-08 03:42:47,892][52060] Updated weights for policy 0, policy_version 92890 (0.0009) [2023-10-08 03:42:47,943][52059] Updated weights for policy 1, policy_version 94072 (0.0009) [2023-10-08 03:42:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 191463424. Throughput: 0: 1687.4, 1: 1725.5. Samples: 47872606. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:51,211][50642] Avg episode reward: [(0, '15.920'), (1, '21.480')] [2023-10-08 03:42:51,703][52060] Updated weights for policy 0, policy_version 92900 (0.0008) [2023-10-08 03:42:51,822][52059] Updated weights for policy 1, policy_version 94082 (0.0009) [2023-10-08 03:42:52,067][52060] Updated weights for policy 0, policy_version 92910 (0.0007) [2023-10-08 03:42:52,195][52059] Updated weights for policy 1, policy_version 94092 (0.0008) [2023-10-08 03:42:52,443][52060] Updated weights for policy 0, policy_version 92920 (0.0008) [2023-10-08 03:42:52,547][52059] Updated weights for policy 1, policy_version 94102 (0.0009) [2023-10-08 03:42:52,913][52059] Updated weights for policy 1, policy_version 94112 (0.0009) [2023-10-08 03:42:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 191528960. Throughput: 0: 1720.4, 1: 1733.7. Samples: 47894086. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:42:56,211][50642] Avg episode reward: [(0, '17.000'), (1, '20.930')] [2023-10-08 03:42:56,274][52060] Updated weights for policy 0, policy_version 92930 (0.0007) [2023-10-08 03:42:56,635][52060] Updated weights for policy 0, policy_version 92940 (0.0007) [2023-10-08 03:42:56,820][52059] Updated weights for policy 1, policy_version 94122 (0.0010) [2023-10-08 03:42:57,005][52060] Updated weights for policy 0, policy_version 92950 (0.0007) [2023-10-08 03:42:57,173][52059] Updated weights for policy 1, policy_version 94132 (0.0008) [2023-10-08 03:42:57,368][52060] Updated weights for policy 0, policy_version 92960 (0.0008) [2023-10-08 03:42:57,543][52059] Updated weights for policy 1, policy_version 94142 (0.0008) [2023-10-08 03:43:01,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 191594496. Throughput: 0: 1729.2, 1: 1746.7. Samples: 47915602. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:43:01,211][50642] Avg episode reward: [(0, '15.300'), (1, '18.800')] [2023-10-08 03:43:01,318][52060] Updated weights for policy 0, policy_version 92970 (0.0008) [2023-10-08 03:43:01,597][52059] Updated weights for policy 1, policy_version 94152 (0.0008) [2023-10-08 03:43:01,691][52060] Updated weights for policy 0, policy_version 92980 (0.0008) [2023-10-08 03:43:01,953][52059] Updated weights for policy 1, policy_version 94162 (0.0009) [2023-10-08 03:43:02,054][52060] Updated weights for policy 0, policy_version 92990 (0.0007) [2023-10-08 03:43:02,315][52059] Updated weights for policy 1, policy_version 94172 (0.0007) [2023-10-08 03:43:05,954][52060] Updated weights for policy 0, policy_version 93000 (0.0008) [2023-10-08 03:43:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 191660032. Throughput: 0: 1711.9, 1: 1722.1. Samples: 47925032. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:43:06,211][50642] Avg episode reward: [(0, '17.060'), (1, '19.430')] [2023-10-08 03:43:06,315][52060] Updated weights for policy 0, policy_version 93010 (0.0008) [2023-10-08 03:43:06,343][52059] Updated weights for policy 1, policy_version 94182 (0.0008) [2023-10-08 03:43:06,682][52060] Updated weights for policy 0, policy_version 93020 (0.0007) [2023-10-08 03:43:06,728][52059] Updated weights for policy 1, policy_version 94192 (0.0007) [2023-10-08 03:43:07,098][52059] Updated weights for policy 1, policy_version 94202 (0.0007) [2023-10-08 03:43:10,694][52060] Updated weights for policy 0, policy_version 93030 (0.0007) [2023-10-08 03:43:11,055][52060] Updated weights for policy 0, policy_version 93040 (0.0007) [2023-10-08 03:43:11,092][52059] Updated weights for policy 1, policy_version 94212 (0.0009) [2023-10-08 03:43:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 191725568. Throughput: 0: 1728.7, 1: 1744.4. Samples: 47946016. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:43:11,211][50642] Avg episode reward: [(0, '15.030'), (1, '21.300')] [2023-10-08 03:43:11,420][52060] Updated weights for policy 0, policy_version 93050 (0.0008) [2023-10-08 03:43:11,458][52059] Updated weights for policy 1, policy_version 94222 (0.0009) [2023-10-08 03:43:11,816][52059] Updated weights for policy 1, policy_version 94232 (0.0011) [2023-10-08 03:43:15,479][52060] Updated weights for policy 0, policy_version 93060 (0.0011) [2023-10-08 03:43:15,668][52059] Updated weights for policy 1, policy_version 94242 (0.0008) [2023-10-08 03:43:15,850][52060] Updated weights for policy 0, policy_version 93070 (0.0009) [2023-10-08 03:43:16,032][52059] Updated weights for policy 1, policy_version 94252 (0.0009) [2023-10-08 03:43:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 191791104. Throughput: 0: 1712.1, 1: 1741.5. Samples: 47966690. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:43:16,211][50642] Avg episode reward: [(0, '14.860'), (1, '24.270')] [2023-10-08 03:43:16,217][52060] Updated weights for policy 0, policy_version 93080 (0.0007) [2023-10-08 03:43:16,386][52059] Updated weights for policy 1, policy_version 94262 (0.0009) [2023-10-08 03:43:16,750][52059] Updated weights for policy 1, policy_version 94272 (0.0009) [2023-10-08 03:43:20,385][52060] Updated weights for policy 0, policy_version 93090 (0.0010) [2023-10-08 03:43:20,736][52059] Updated weights for policy 1, policy_version 94282 (0.0009) [2023-10-08 03:43:20,769][52060] Updated weights for policy 0, policy_version 93100 (0.0008) [2023-10-08 03:43:21,108][52059] Updated weights for policy 1, policy_version 94292 (0.0007) [2023-10-08 03:43:21,139][52060] Updated weights for policy 0, policy_version 93110 (0.0008) [2023-10-08 03:43:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 191856640. Throughput: 0: 1722.4, 1: 1732.9. Samples: 47976640. Policy #0 lag: (min: 31.0, avg: 31.6, max: 47.0) [2023-10-08 03:43:21,211][50642] Avg episode reward: [(0, '16.390'), (1, '23.380')] [2023-10-08 03:43:21,467][52059] Updated weights for policy 1, policy_version 94302 (0.0008) [2023-10-08 03:43:21,508][52060] Updated weights for policy 0, policy_version 93120 (0.0007) [2023-10-08 03:43:25,480][52059] Updated weights for policy 1, policy_version 94312 (0.0009) [2023-10-08 03:43:25,520][52060] Updated weights for policy 0, policy_version 93130 (0.0009) [2023-10-08 03:43:25,851][52059] Updated weights for policy 1, policy_version 94322 (0.0009) [2023-10-08 03:43:25,888][52060] Updated weights for policy 0, policy_version 93140 (0.0008) [2023-10-08 03:43:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 191922176. Throughput: 0: 1717.7, 1: 1737.8. Samples: 47997650. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:26,211][50642] Avg episode reward: [(0, '14.170'), (1, '22.950')] [2023-10-08 03:43:26,214][52059] Updated weights for policy 1, policy_version 94332 (0.0007) [2023-10-08 03:43:26,265][52060] Updated weights for policy 0, policy_version 93150 (0.0008) [2023-10-08 03:43:29,972][52059] Updated weights for policy 1, policy_version 94342 (0.0008) [2023-10-08 03:43:30,286][52060] Updated weights for policy 0, policy_version 93160 (0.0008) [2023-10-08 03:43:30,341][52059] Updated weights for policy 1, policy_version 94352 (0.0007) [2023-10-08 03:43:30,657][52060] Updated weights for policy 0, policy_version 93170 (0.0008) [2023-10-08 03:43:30,700][52059] Updated weights for policy 1, policy_version 94362 (0.0007) [2023-10-08 03:43:31,024][52060] Updated weights for policy 0, policy_version 93180 (0.0008) [2023-10-08 03:43:31,210][50642] Fps is (10 sec: 19661.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 192053248. Throughput: 0: 1698.1, 1: 1718.1. Samples: 48017086. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:31,211][50642] Avg episode reward: [(0, '15.350'), (1, '21.770')] [2023-10-08 03:43:34,587][52059] Updated weights for policy 1, policy_version 94372 (0.0008) [2023-10-08 03:43:34,950][52059] Updated weights for policy 1, policy_version 94382 (0.0008) [2023-10-08 03:43:35,032][52060] Updated weights for policy 0, policy_version 93190 (0.0008) [2023-10-08 03:43:35,311][52059] Updated weights for policy 1, policy_version 94392 (0.0008) [2023-10-08 03:43:35,398][52060] Updated weights for policy 0, policy_version 93200 (0.0008) [2023-10-08 03:43:35,765][52060] Updated weights for policy 0, policy_version 93210 (0.0007) [2023-10-08 03:43:36,210][50642] Fps is (10 sec: 19660.6, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 192118784. Throughput: 0: 1717.2, 1: 1749.2. Samples: 48028594. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:36,211][50642] Avg episode reward: [(0, '14.500'), (1, '25.210')] [2023-10-08 03:43:39,280][52059] Updated weights for policy 1, policy_version 94402 (0.0009) [2023-10-08 03:43:39,643][52059] Updated weights for policy 1, policy_version 94412 (0.0009) [2023-10-08 03:43:39,644][52060] Updated weights for policy 0, policy_version 93220 (0.0007) [2023-10-08 03:43:40,008][52060] Updated weights for policy 0, policy_version 93230 (0.0007) [2023-10-08 03:43:40,010][52059] Updated weights for policy 1, policy_version 94422 (0.0007) [2023-10-08 03:43:40,371][52059] Updated weights for policy 1, policy_version 94432 (0.0007) [2023-10-08 03:43:40,378][52060] Updated weights for policy 0, policy_version 93240 (0.0008) [2023-10-08 03:43:41,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 192184320. Throughput: 0: 1705.1, 1: 1733.6. Samples: 48048828. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:41,211][50642] Avg episode reward: [(0, '14.440'), (1, '24.440')] [2023-10-08 03:43:44,363][52059] Updated weights for policy 1, policy_version 94442 (0.0009) [2023-10-08 03:43:44,377][52060] Updated weights for policy 0, policy_version 93250 (0.0009) [2023-10-08 03:43:44,729][52059] Updated weights for policy 1, policy_version 94452 (0.0008) [2023-10-08 03:43:44,745][52060] Updated weights for policy 0, policy_version 93260 (0.0008) [2023-10-08 03:43:45,102][52059] Updated weights for policy 1, policy_version 94462 (0.0008) [2023-10-08 03:43:45,115][52060] Updated weights for policy 0, policy_version 93270 (0.0008) [2023-10-08 03:43:45,479][52060] Updated weights for policy 0, policy_version 93280 (0.0009) [2023-10-08 03:43:46,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 192249856. Throughput: 0: 1677.9, 1: 1719.7. Samples: 48068496. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:46,211][50642] Avg episode reward: [(0, '15.420'), (1, '21.130')] [2023-10-08 03:43:48,941][52059] Updated weights for policy 1, policy_version 94472 (0.0010) [2023-10-08 03:43:49,305][52059] Updated weights for policy 1, policy_version 94482 (0.0007) [2023-10-08 03:43:49,442][52060] Updated weights for policy 0, policy_version 93290 (0.0009) [2023-10-08 03:43:49,669][52059] Updated weights for policy 1, policy_version 94492 (0.0008) [2023-10-08 03:43:49,802][52060] Updated weights for policy 0, policy_version 93300 (0.0007) [2023-10-08 03:43:50,175][52060] Updated weights for policy 0, policy_version 93310 (0.0007) [2023-10-08 03:43:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 192315392. Throughput: 0: 1705.8, 1: 1744.4. Samples: 48080290. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:51,211][50642] Avg episode reward: [(0, '13.320'), (1, '21.090')] [2023-10-08 03:43:53,640][52059] Updated weights for policy 1, policy_version 94502 (0.0007) [2023-10-08 03:43:54,005][52059] Updated weights for policy 1, policy_version 94512 (0.0009) [2023-10-08 03:43:54,272][52060] Updated weights for policy 0, policy_version 93320 (0.0008) [2023-10-08 03:43:54,371][52059] Updated weights for policy 1, policy_version 94522 (0.0007) [2023-10-08 03:43:54,637][52060] Updated weights for policy 0, policy_version 93330 (0.0008) [2023-10-08 03:43:55,019][52060] Updated weights for policy 0, policy_version 93340 (0.0007) [2023-10-08 03:43:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 192380928. Throughput: 0: 1679.8, 1: 1725.5. Samples: 48099256. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:43:56,211][50642] Avg episode reward: [(0, '16.040'), (1, '23.610')] [2023-10-08 03:43:58,358][52059] Updated weights for policy 1, policy_version 94532 (0.0007) [2023-10-08 03:43:58,720][52059] Updated weights for policy 1, policy_version 94542 (0.0008) [2023-10-08 03:43:58,804][52060] Updated weights for policy 0, policy_version 93350 (0.0008) [2023-10-08 03:43:59,078][52059] Updated weights for policy 1, policy_version 94552 (0.0008) [2023-10-08 03:43:59,173][52060] Updated weights for policy 0, policy_version 93360 (0.0009) [2023-10-08 03:43:59,546][52060] Updated weights for policy 0, policy_version 93370 (0.0008) [2023-10-08 03:44:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 192446464. Throughput: 0: 1691.4, 1: 1728.6. Samples: 48120588. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:01,211][50642] Avg episode reward: [(0, '13.760'), (1, '23.190')] [2023-10-08 03:44:02,931][52059] Updated weights for policy 1, policy_version 94562 (0.0008) [2023-10-08 03:44:03,301][52059] Updated weights for policy 1, policy_version 94572 (0.0008) [2023-10-08 03:44:03,378][52060] Updated weights for policy 0, policy_version 93380 (0.0009) [2023-10-08 03:44:03,661][52059] Updated weights for policy 1, policy_version 94582 (0.0007) [2023-10-08 03:44:03,749][52060] Updated weights for policy 0, policy_version 93390 (0.0008) [2023-10-08 03:44:04,035][52059] Updated weights for policy 1, policy_version 94592 (0.0008) [2023-10-08 03:44:04,103][52060] Updated weights for policy 0, policy_version 93400 (0.0009) [2023-10-08 03:44:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 192512000. Throughput: 0: 1702.0, 1: 1728.8. Samples: 48131024. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:06,211][50642] Avg episode reward: [(0, '15.170'), (1, '21.530')] [2023-10-08 03:44:08,099][52059] Updated weights for policy 1, policy_version 94602 (0.0008) [2023-10-08 03:44:08,169][52060] Updated weights for policy 0, policy_version 93410 (0.0009) [2023-10-08 03:44:08,465][52059] Updated weights for policy 1, policy_version 94612 (0.0007) [2023-10-08 03:44:08,577][52060] Updated weights for policy 0, policy_version 93420 (0.0008) [2023-10-08 03:44:08,822][52059] Updated weights for policy 1, policy_version 94622 (0.0008) [2023-10-08 03:44:08,945][52060] Updated weights for policy 0, policy_version 93430 (0.0009) [2023-10-08 03:44:09,316][52060] Updated weights for policy 0, policy_version 93440 (0.0009) [2023-10-08 03:44:11,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 192577536. Throughput: 0: 1688.7, 1: 1721.6. Samples: 48151114. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:11,211][50642] Avg episode reward: [(0, '14.420'), (1, '20.510')] [2023-10-08 03:44:12,594][52059] Updated weights for policy 1, policy_version 94632 (0.0007) [2023-10-08 03:44:12,961][52059] Updated weights for policy 1, policy_version 94642 (0.0008) [2023-10-08 03:44:13,319][52059] Updated weights for policy 1, policy_version 94652 (0.0008) [2023-10-08 03:44:13,383][52060] Updated weights for policy 0, policy_version 93450 (0.0008) [2023-10-08 03:44:13,748][52060] Updated weights for policy 0, policy_version 93460 (0.0008) [2023-10-08 03:44:14,121][52060] Updated weights for policy 0, policy_version 93470 (0.0009) [2023-10-08 03:44:16,210][50642] Fps is (10 sec: 13106.8, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 192643072. Throughput: 0: 1708.2, 1: 1741.5. Samples: 48172322. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:16,211][50642] Avg episode reward: [(0, '14.830'), (1, '21.480')] [2023-10-08 03:44:17,307][52059] Updated weights for policy 1, policy_version 94662 (0.0008) [2023-10-08 03:44:17,671][52059] Updated weights for policy 1, policy_version 94672 (0.0008) [2023-10-08 03:44:18,031][52059] Updated weights for policy 1, policy_version 94682 (0.0008) [2023-10-08 03:44:18,136][52060] Updated weights for policy 0, policy_version 93480 (0.0009) [2023-10-08 03:44:18,506][52060] Updated weights for policy 0, policy_version 93490 (0.0009) [2023-10-08 03:44:18,872][52060] Updated weights for policy 0, policy_version 93500 (0.0007) [2023-10-08 03:44:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 192708608. Throughput: 0: 1697.2, 1: 1713.9. Samples: 48182094. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:21,211][50642] Avg episode reward: [(0, '14.610'), (1, '22.260')] [2023-10-08 03:44:21,953][52059] Updated weights for policy 1, policy_version 94692 (0.0010) [2023-10-08 03:44:22,328][52059] Updated weights for policy 1, policy_version 94702 (0.0011) [2023-10-08 03:44:22,685][52059] Updated weights for policy 1, policy_version 94712 (0.0010) [2023-10-08 03:44:23,084][52060] Updated weights for policy 0, policy_version 93510 (0.0008) [2023-10-08 03:44:23,457][52060] Updated weights for policy 0, policy_version 93520 (0.0007) [2023-10-08 03:44:23,820][52060] Updated weights for policy 0, policy_version 93530 (0.0009) [2023-10-08 03:44:26,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 192774144. Throughput: 0: 1696.6, 1: 1729.7. Samples: 48203014. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:26,211][50642] Avg episode reward: [(0, '14.370'), (1, '24.450')] [2023-10-08 03:44:26,642][52059] Updated weights for policy 1, policy_version 94722 (0.0008) [2023-10-08 03:44:27,005][52059] Updated weights for policy 1, policy_version 94732 (0.0008) [2023-10-08 03:44:27,361][52059] Updated weights for policy 1, policy_version 94742 (0.0008) [2023-10-08 03:44:27,724][52059] Updated weights for policy 1, policy_version 94752 (0.0007) [2023-10-08 03:44:27,889][52060] Updated weights for policy 0, policy_version 93540 (0.0010) [2023-10-08 03:44:28,244][52060] Updated weights for policy 0, policy_version 93550 (0.0010) [2023-10-08 03:44:28,606][52060] Updated weights for policy 0, policy_version 93560 (0.0007) [2023-10-08 03:44:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.1, 300 sec: 13773.7). Total num frames: 192839680. Throughput: 0: 1720.8, 1: 1747.2. Samples: 48224556. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:31,211][50642] Avg episode reward: [(0, '15.580'), (1, '22.770')] [2023-10-08 03:44:31,646][52059] Updated weights for policy 1, policy_version 94762 (0.0007) [2023-10-08 03:44:32,013][52059] Updated weights for policy 1, policy_version 94772 (0.0007) [2023-10-08 03:44:32,380][52059] Updated weights for policy 1, policy_version 94782 (0.0007) [2023-10-08 03:44:32,632][52060] Updated weights for policy 0, policy_version 93570 (0.0008) [2023-10-08 03:44:32,995][52060] Updated weights for policy 0, policy_version 93580 (0.0007) [2023-10-08 03:44:33,368][52060] Updated weights for policy 0, policy_version 93590 (0.0008) [2023-10-08 03:44:33,735][52060] Updated weights for policy 0, policy_version 93600 (0.0010) [2023-10-08 03:44:36,115][52059] Updated weights for policy 1, policy_version 94792 (0.0008) [2023-10-08 03:44:36,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13773.7). Total num frames: 192905216. Throughput: 0: 1696.1, 1: 1724.4. Samples: 48234210. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:36,211][50642] Avg episode reward: [(0, '14.370'), (1, '23.000')] [2023-10-08 03:44:36,485][52059] Updated weights for policy 1, policy_version 94802 (0.0009) [2023-10-08 03:44:36,854][52059] Updated weights for policy 1, policy_version 94812 (0.0009) [2023-10-08 03:44:37,496][52060] Updated weights for policy 0, policy_version 93610 (0.0011) [2023-10-08 03:44:37,858][52060] Updated weights for policy 0, policy_version 93620 (0.0009) [2023-10-08 03:44:38,238][52060] Updated weights for policy 0, policy_version 93630 (0.0009) [2023-10-08 03:44:41,000][52059] Updated weights for policy 1, policy_version 94822 (0.0008) [2023-10-08 03:44:41,211][50642] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 192970752. Throughput: 0: 1723.1, 1: 1752.0. Samples: 48255640. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:41,212][50642] Avg episode reward: [(0, '14.730'), (1, '23.400')] [2023-10-08 03:44:41,377][52059] Updated weights for policy 1, policy_version 94832 (0.0010) [2023-10-08 03:44:41,741][52059] Updated weights for policy 1, policy_version 94842 (0.0008) [2023-10-08 03:44:42,165][52060] Updated weights for policy 0, policy_version 93640 (0.0008) [2023-10-08 03:44:42,531][52060] Updated weights for policy 0, policy_version 93650 (0.0010) [2023-10-08 03:44:42,911][52060] Updated weights for policy 0, policy_version 93660 (0.0009) [2023-10-08 03:44:45,411][52059] Updated weights for policy 1, policy_version 94852 (0.0008) [2023-10-08 03:44:45,777][52059] Updated weights for policy 1, policy_version 94862 (0.0007) [2023-10-08 03:44:46,134][52059] Updated weights for policy 1, policy_version 94872 (0.0007) [2023-10-08 03:44:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 193036288. Throughput: 0: 1727.6, 1: 1735.2. Samples: 48276410. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:46,211][50642] Avg episode reward: [(0, '12.460'), (1, '24.340')] [2023-10-08 03:44:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000093664_95911936.pth... [2023-10-08 03:44:46,254][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000092064_94273536.pth [2023-10-08 03:44:46,424][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000094880_97157120.pth... [2023-10-08 03:44:46,452][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000093248_95485952.pth [2023-10-08 03:44:46,688][52060] Updated weights for policy 0, policy_version 93670 (0.0008) [2023-10-08 03:44:47,057][52060] Updated weights for policy 0, policy_version 93680 (0.0007) [2023-10-08 03:44:47,417][52060] Updated weights for policy 0, policy_version 93690 (0.0008) [2023-10-08 03:44:50,091][52059] Updated weights for policy 1, policy_version 94882 (0.0007) [2023-10-08 03:44:50,453][52059] Updated weights for policy 1, policy_version 94892 (0.0010) [2023-10-08 03:44:50,810][52059] Updated weights for policy 1, policy_version 94902 (0.0009) [2023-10-08 03:44:51,182][52059] Updated weights for policy 1, policy_version 94912 (0.0007) [2023-10-08 03:44:51,210][50642] Fps is (10 sec: 16384.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 193134592. Throughput: 0: 1707.0, 1: 1742.3. Samples: 48286240. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:51,211][50642] Avg episode reward: [(0, '13.020'), (1, '25.450')] [2023-10-08 03:44:51,520][52060] Updated weights for policy 0, policy_version 93700 (0.0009) [2023-10-08 03:44:51,887][52060] Updated weights for policy 0, policy_version 93710 (0.0008) [2023-10-08 03:44:52,245][52060] Updated weights for policy 0, policy_version 93720 (0.0010) [2023-10-08 03:44:55,173][52059] Updated weights for policy 1, policy_version 94922 (0.0008) [2023-10-08 03:44:55,537][52059] Updated weights for policy 1, policy_version 94932 (0.0009) [2023-10-08 03:44:55,900][52059] Updated weights for policy 1, policy_version 94942 (0.0007) [2023-10-08 03:44:56,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 193200128. Throughput: 0: 1726.1, 1: 1747.7. Samples: 48307430. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:44:56,211][50642] Avg episode reward: [(0, '11.010'), (1, '25.940')] [2023-10-08 03:44:56,263][52060] Updated weights for policy 0, policy_version 93730 (0.0011) [2023-10-08 03:44:56,671][52060] Updated weights for policy 0, policy_version 93740 (0.0011) [2023-10-08 03:44:57,044][52060] Updated weights for policy 0, policy_version 93750 (0.0011) [2023-10-08 03:44:57,412][52060] Updated weights for policy 0, policy_version 93760 (0.0010) [2023-10-08 03:44:59,827][52059] Updated weights for policy 1, policy_version 94952 (0.0009) [2023-10-08 03:45:00,189][52059] Updated weights for policy 1, policy_version 94962 (0.0008) [2023-10-08 03:45:00,558][52059] Updated weights for policy 1, policy_version 94972 (0.0009) [2023-10-08 03:45:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 193265664. Throughput: 0: 1727.2, 1: 1726.1. Samples: 48327722. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:01,211][50642] Avg episode reward: [(0, '12.860'), (1, '23.640')] [2023-10-08 03:45:01,461][52060] Updated weights for policy 0, policy_version 93770 (0.0008) [2023-10-08 03:45:01,825][52060] Updated weights for policy 0, policy_version 93780 (0.0010) [2023-10-08 03:45:02,202][52060] Updated weights for policy 0, policy_version 93790 (0.0011) [2023-10-08 03:45:04,556][52059] Updated weights for policy 1, policy_version 94982 (0.0009) [2023-10-08 03:45:04,908][52059] Updated weights for policy 1, policy_version 94992 (0.0008) [2023-10-08 03:45:05,289][52059] Updated weights for policy 1, policy_version 95002 (0.0008) [2023-10-08 03:45:06,131][52060] Updated weights for policy 0, policy_version 93800 (0.0007) [2023-10-08 03:45:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 193331200. Throughput: 0: 1720.5, 1: 1753.6. Samples: 48338430. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:06,211][50642] Avg episode reward: [(0, '11.120'), (1, '25.150')] [2023-10-08 03:45:06,500][52060] Updated weights for policy 0, policy_version 93810 (0.0008) [2023-10-08 03:45:06,875][52060] Updated weights for policy 0, policy_version 93820 (0.0009) [2023-10-08 03:45:09,128][52059] Updated weights for policy 1, policy_version 95012 (0.0010) [2023-10-08 03:45:09,486][52059] Updated weights for policy 1, policy_version 95022 (0.0010) [2023-10-08 03:45:09,856][52059] Updated weights for policy 1, policy_version 95032 (0.0010) [2023-10-08 03:45:11,015][52060] Updated weights for policy 0, policy_version 93830 (0.0009) [2023-10-08 03:45:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 193396736. Throughput: 0: 1726.7, 1: 1729.2. Samples: 48358530. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:11,211][50642] Avg episode reward: [(0, '13.880'), (1, '25.440')] [2023-10-08 03:45:11,385][52060] Updated weights for policy 0, policy_version 93840 (0.0009) [2023-10-08 03:45:11,753][52060] Updated weights for policy 0, policy_version 93850 (0.0009) [2023-10-08 03:45:13,801][52059] Updated weights for policy 1, policy_version 95042 (0.0010) [2023-10-08 03:45:14,164][52059] Updated weights for policy 1, policy_version 95052 (0.0008) [2023-10-08 03:45:14,537][52059] Updated weights for policy 1, policy_version 95062 (0.0009) [2023-10-08 03:45:14,903][52059] Updated weights for policy 1, policy_version 95072 (0.0008) [2023-10-08 03:45:15,677][52060] Updated weights for policy 0, policy_version 93860 (0.0010) [2023-10-08 03:45:16,044][52060] Updated weights for policy 0, policy_version 93870 (0.0008) [2023-10-08 03:45:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 193462272. Throughput: 0: 1721.3, 1: 1713.8. Samples: 48379134. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:16,211][50642] Avg episode reward: [(0, '12.450'), (1, '26.660')] [2023-10-08 03:45:16,411][52060] Updated weights for policy 0, policy_version 93880 (0.0008) [2023-10-08 03:45:18,913][52059] Updated weights for policy 1, policy_version 95082 (0.0007) [2023-10-08 03:45:19,280][52059] Updated weights for policy 1, policy_version 95092 (0.0008) [2023-10-08 03:45:19,643][52059] Updated weights for policy 1, policy_version 95102 (0.0009) [2023-10-08 03:45:20,382][52060] Updated weights for policy 0, policy_version 93890 (0.0009) [2023-10-08 03:45:20,751][52060] Updated weights for policy 0, policy_version 93900 (0.0007) [2023-10-08 03:45:21,124][52060] Updated weights for policy 0, policy_version 93910 (0.0007) [2023-10-08 03:45:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 193527808. Throughput: 0: 1725.2, 1: 1733.7. Samples: 48389860. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:21,211][50642] Avg episode reward: [(0, '14.340'), (1, '24.740')] [2023-10-08 03:45:21,489][52060] Updated weights for policy 0, policy_version 93920 (0.0007) [2023-10-08 03:45:23,474][52059] Updated weights for policy 1, policy_version 95112 (0.0010) [2023-10-08 03:45:23,832][52059] Updated weights for policy 1, policy_version 95122 (0.0008) [2023-10-08 03:45:24,197][52059] Updated weights for policy 1, policy_version 95132 (0.0007) [2023-10-08 03:45:25,338][52060] Updated weights for policy 0, policy_version 93930 (0.0009) [2023-10-08 03:45:25,707][52060] Updated weights for policy 0, policy_version 93940 (0.0009) [2023-10-08 03:45:26,073][52060] Updated weights for policy 0, policy_version 93950 (0.0007) [2023-10-08 03:45:26,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 193626112. Throughput: 0: 1722.5, 1: 1714.6. Samples: 48410310. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:26,211][50642] Avg episode reward: [(0, '10.940'), (1, '24.640')] [2023-10-08 03:45:28,301][52059] Updated weights for policy 1, policy_version 95142 (0.0008) [2023-10-08 03:45:28,689][52059] Updated weights for policy 1, policy_version 95152 (0.0007) [2023-10-08 03:45:29,051][52059] Updated weights for policy 1, policy_version 95162 (0.0010) [2023-10-08 03:45:29,881][52060] Updated weights for policy 0, policy_version 93960 (0.0009) [2023-10-08 03:45:30,253][52060] Updated weights for policy 0, policy_version 93970 (0.0008) [2023-10-08 03:45:30,613][52060] Updated weights for policy 0, policy_version 93980 (0.0009) [2023-10-08 03:45:31,210][50642] Fps is (10 sec: 16383.6, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 193691648. Throughput: 0: 1691.6, 1: 1724.6. Samples: 48430140. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:31,211][50642] Avg episode reward: [(0, '14.230'), (1, '25.140')] [2023-10-08 03:45:32,923][52059] Updated weights for policy 1, policy_version 95172 (0.0009) [2023-10-08 03:45:33,295][52059] Updated weights for policy 1, policy_version 95182 (0.0007) [2023-10-08 03:45:33,657][52059] Updated weights for policy 1, policy_version 95192 (0.0007) [2023-10-08 03:45:34,735][52060] Updated weights for policy 0, policy_version 93990 (0.0010) [2023-10-08 03:45:35,103][52060] Updated weights for policy 0, policy_version 94000 (0.0008) [2023-10-08 03:45:35,483][52060] Updated weights for policy 0, policy_version 94010 (0.0007) [2023-10-08 03:45:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 193757184. Throughput: 0: 1720.2, 1: 1718.5. Samples: 48440984. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:36,211][50642] Avg episode reward: [(0, '11.990'), (1, '25.510')] [2023-10-08 03:45:37,506][52059] Updated weights for policy 1, policy_version 95202 (0.0011) [2023-10-08 03:45:37,871][52059] Updated weights for policy 1, policy_version 95212 (0.0007) [2023-10-08 03:45:38,228][52059] Updated weights for policy 1, policy_version 95222 (0.0007) [2023-10-08 03:45:38,590][52059] Updated weights for policy 1, policy_version 95232 (0.0010) [2023-10-08 03:45:39,341][52060] Updated weights for policy 0, policy_version 94020 (0.0009) [2023-10-08 03:45:39,699][52060] Updated weights for policy 0, policy_version 94030 (0.0008) [2023-10-08 03:45:40,061][52060] Updated weights for policy 0, policy_version 94040 (0.0007) [2023-10-08 03:45:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 193822720. Throughput: 0: 1709.3, 1: 1721.6. Samples: 48461822. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:41,211][50642] Avg episode reward: [(0, '13.350'), (1, '23.520')] [2023-10-08 03:45:42,448][52059] Updated weights for policy 1, policy_version 95242 (0.0007) [2023-10-08 03:45:42,815][52059] Updated weights for policy 1, policy_version 95252 (0.0007) [2023-10-08 03:45:43,185][52059] Updated weights for policy 1, policy_version 95262 (0.0007) [2023-10-08 03:45:44,117][52060] Updated weights for policy 0, policy_version 94050 (0.0008) [2023-10-08 03:45:44,525][52060] Updated weights for policy 0, policy_version 94060 (0.0008) [2023-10-08 03:45:44,895][52060] Updated weights for policy 0, policy_version 94070 (0.0009) [2023-10-08 03:45:45,257][52060] Updated weights for policy 0, policy_version 94080 (0.0007) [2023-10-08 03:45:46,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 193888256. Throughput: 0: 1693.7, 1: 1744.8. Samples: 48482456. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:46,211][50642] Avg episode reward: [(0, '14.220'), (1, '22.710')] [2023-10-08 03:45:47,019][52059] Updated weights for policy 1, policy_version 95272 (0.0008) [2023-10-08 03:45:47,379][52059] Updated weights for policy 1, policy_version 95282 (0.0007) [2023-10-08 03:45:47,742][52059] Updated weights for policy 1, policy_version 95292 (0.0010) [2023-10-08 03:45:49,195][52060] Updated weights for policy 0, policy_version 94090 (0.0010) [2023-10-08 03:45:49,563][52060] Updated weights for policy 0, policy_version 94100 (0.0009) [2023-10-08 03:45:49,929][52060] Updated weights for policy 0, policy_version 94110 (0.0007) [2023-10-08 03:45:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 193953792. Throughput: 0: 1720.6, 1: 1714.0. Samples: 48492988. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:51,211][50642] Avg episode reward: [(0, '12.860'), (1, '23.740')] [2023-10-08 03:45:51,820][52059] Updated weights for policy 1, policy_version 95302 (0.0007) [2023-10-08 03:45:52,191][52059] Updated weights for policy 1, policy_version 95312 (0.0008) [2023-10-08 03:45:52,556][52059] Updated weights for policy 1, policy_version 95322 (0.0007) [2023-10-08 03:45:53,980][52060] Updated weights for policy 0, policy_version 94120 (0.0010) [2023-10-08 03:45:54,348][52060] Updated weights for policy 0, policy_version 94130 (0.0011) [2023-10-08 03:45:54,724][52060] Updated weights for policy 0, policy_version 94140 (0.0010) [2023-10-08 03:45:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194019328. Throughput: 0: 1695.0, 1: 1737.1. Samples: 48512972. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:45:56,211][50642] Avg episode reward: [(0, '14.390'), (1, '24.260')] [2023-10-08 03:45:56,635][52059] Updated weights for policy 1, policy_version 95332 (0.0008) [2023-10-08 03:45:56,985][52059] Updated weights for policy 1, policy_version 95342 (0.0009) [2023-10-08 03:45:57,354][52059] Updated weights for policy 1, policy_version 95352 (0.0009) [2023-10-08 03:45:58,654][52060] Updated weights for policy 0, policy_version 94150 (0.0008) [2023-10-08 03:45:59,018][52060] Updated weights for policy 0, policy_version 94160 (0.0008) [2023-10-08 03:45:59,391][52060] Updated weights for policy 0, policy_version 94170 (0.0007) [2023-10-08 03:46:01,197][52059] Updated weights for policy 1, policy_version 95362 (0.0008) [2023-10-08 03:46:01,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194084864. Throughput: 0: 1697.1, 1: 1746.4. Samples: 48534092. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:46:01,211][50642] Avg episode reward: [(0, '12.130'), (1, '25.560')] [2023-10-08 03:46:01,564][52059] Updated weights for policy 1, policy_version 95372 (0.0009) [2023-10-08 03:46:01,928][52059] Updated weights for policy 1, policy_version 95382 (0.0008) [2023-10-08 03:46:02,289][52059] Updated weights for policy 1, policy_version 95392 (0.0008) [2023-10-08 03:46:03,319][52060] Updated weights for policy 0, policy_version 94180 (0.0008) [2023-10-08 03:46:03,697][52060] Updated weights for policy 0, policy_version 94190 (0.0009) [2023-10-08 03:46:04,075][52060] Updated weights for policy 0, policy_version 94200 (0.0009) [2023-10-08 03:46:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194150400. Throughput: 0: 1705.4, 1: 1723.3. Samples: 48544154. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:46:06,211][50642] Avg episode reward: [(0, '14.840'), (1, '23.890')] [2023-10-08 03:46:06,246][52059] Updated weights for policy 1, policy_version 95402 (0.0007) [2023-10-08 03:46:06,618][52059] Updated weights for policy 1, policy_version 95412 (0.0008) [2023-10-08 03:46:06,983][52059] Updated weights for policy 1, policy_version 95422 (0.0009) [2023-10-08 03:46:08,011][52060] Updated weights for policy 0, policy_version 94210 (0.0008) [2023-10-08 03:46:08,380][52060] Updated weights for policy 0, policy_version 94220 (0.0011) [2023-10-08 03:46:08,757][52060] Updated weights for policy 0, policy_version 94230 (0.0008) [2023-10-08 03:46:09,127][52060] Updated weights for policy 0, policy_version 94240 (0.0007) [2023-10-08 03:46:11,049][52059] Updated weights for policy 1, policy_version 95432 (0.0008) [2023-10-08 03:46:11,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 194215936. Throughput: 0: 1693.4, 1: 1742.1. Samples: 48564910. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:46:11,211][50642] Avg episode reward: [(0, '14.420'), (1, '25.940')] [2023-10-08 03:46:11,411][52059] Updated weights for policy 1, policy_version 95442 (0.0007) [2023-10-08 03:46:11,771][52059] Updated weights for policy 1, policy_version 95452 (0.0009) [2023-10-08 03:46:13,070][52060] Updated weights for policy 0, policy_version 94250 (0.0011) [2023-10-08 03:46:13,447][52060] Updated weights for policy 0, policy_version 94260 (0.0008) [2023-10-08 03:46:13,822][52060] Updated weights for policy 0, policy_version 94270 (0.0007) [2023-10-08 03:46:15,511][52059] Updated weights for policy 1, policy_version 95462 (0.0010) [2023-10-08 03:46:15,908][52059] Updated weights for policy 1, policy_version 95472 (0.0007) [2023-10-08 03:46:16,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 194281472. Throughput: 0: 1724.4, 1: 1740.4. Samples: 48586052. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:46:16,211][50642] Avg episode reward: [(0, '14.160'), (1, '24.650')] [2023-10-08 03:46:16,276][52059] Updated weights for policy 1, policy_version 95482 (0.0009) [2023-10-08 03:46:17,696][52060] Updated weights for policy 0, policy_version 94280 (0.0009) [2023-10-08 03:46:18,066][52060] Updated weights for policy 0, policy_version 94290 (0.0009) [2023-10-08 03:46:18,439][52060] Updated weights for policy 0, policy_version 94300 (0.0008) [2023-10-08 03:46:20,082][52059] Updated weights for policy 1, policy_version 95492 (0.0008) [2023-10-08 03:46:20,449][52059] Updated weights for policy 1, policy_version 95502 (0.0007) [2023-10-08 03:46:20,808][52059] Updated weights for policy 1, policy_version 95512 (0.0009) [2023-10-08 03:46:21,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 194379776. Throughput: 0: 1697.5, 1: 1749.5. Samples: 48596100. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:21,211][50642] Avg episode reward: [(0, '13.000'), (1, '27.260')] [2023-10-08 03:46:22,447][52060] Updated weights for policy 0, policy_version 94310 (0.0009) [2023-10-08 03:46:22,803][52060] Updated weights for policy 0, policy_version 94320 (0.0010) [2023-10-08 03:46:23,167][52060] Updated weights for policy 0, policy_version 94330 (0.0008) [2023-10-08 03:46:24,621][52059] Updated weights for policy 1, policy_version 95522 (0.0011) [2023-10-08 03:46:24,978][52059] Updated weights for policy 1, policy_version 95532 (0.0010) [2023-10-08 03:46:25,347][52059] Updated weights for policy 1, policy_version 95542 (0.0009) [2023-10-08 03:46:25,705][52059] Updated weights for policy 1, policy_version 95552 (0.0009) [2023-10-08 03:46:26,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.3, 300 sec: 13884.7). Total num frames: 194445312. Throughput: 0: 1706.5, 1: 1746.0. Samples: 48617184. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:26,211][50642] Avg episode reward: [(0, '14.890'), (1, '25.260')] [2023-10-08 03:46:27,168][52060] Updated weights for policy 0, policy_version 94340 (0.0009) [2023-10-08 03:46:27,541][52060] Updated weights for policy 0, policy_version 94350 (0.0008) [2023-10-08 03:46:27,914][52060] Updated weights for policy 0, policy_version 94360 (0.0008) [2023-10-08 03:46:29,627][52059] Updated weights for policy 1, policy_version 95562 (0.0008) [2023-10-08 03:46:29,998][52059] Updated weights for policy 1, policy_version 95572 (0.0007) [2023-10-08 03:46:30,359][52059] Updated weights for policy 1, policy_version 95582 (0.0009) [2023-10-08 03:46:31,210][50642] Fps is (10 sec: 13106.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194510848. Throughput: 0: 1722.7, 1: 1724.6. Samples: 48637586. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:31,211][50642] Avg episode reward: [(0, '13.810'), (1, '25.120')] [2023-10-08 03:46:32,050][52060] Updated weights for policy 0, policy_version 94370 (0.0008) [2023-10-08 03:46:32,439][52060] Updated weights for policy 0, policy_version 94380 (0.0007) [2023-10-08 03:46:32,802][52060] Updated weights for policy 0, policy_version 94390 (0.0007) [2023-10-08 03:46:33,163][52060] Updated weights for policy 0, policy_version 94400 (0.0007) [2023-10-08 03:46:34,185][52059] Updated weights for policy 1, policy_version 95592 (0.0009) [2023-10-08 03:46:34,553][52059] Updated weights for policy 1, policy_version 95602 (0.0008) [2023-10-08 03:46:34,911][52059] Updated weights for policy 1, policy_version 95612 (0.0008) [2023-10-08 03:46:36,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 194576384. Throughput: 0: 1694.2, 1: 1759.5. Samples: 48648406. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:36,211][50642] Avg episode reward: [(0, '13.520'), (1, '25.930')] [2023-10-08 03:46:37,055][52060] Updated weights for policy 0, policy_version 94410 (0.0010) [2023-10-08 03:46:37,431][52060] Updated weights for policy 0, policy_version 94420 (0.0009) [2023-10-08 03:46:37,798][52060] Updated weights for policy 0, policy_version 94430 (0.0009) [2023-10-08 03:46:38,885][52059] Updated weights for policy 1, policy_version 95622 (0.0007) [2023-10-08 03:46:39,248][52059] Updated weights for policy 1, policy_version 95632 (0.0007) [2023-10-08 03:46:39,613][52059] Updated weights for policy 1, policy_version 95642 (0.0007) [2023-10-08 03:46:41,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194641920. Throughput: 0: 1722.8, 1: 1729.7. Samples: 48668334. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:41,211][50642] Avg episode reward: [(0, '14.440'), (1, '27.550')] [2023-10-08 03:46:41,949][52060] Updated weights for policy 0, policy_version 94440 (0.0008) [2023-10-08 03:46:42,319][52060] Updated weights for policy 0, policy_version 94450 (0.0010) [2023-10-08 03:46:42,683][52060] Updated weights for policy 0, policy_version 94460 (0.0010) [2023-10-08 03:46:43,553][52059] Updated weights for policy 1, policy_version 95652 (0.0008) [2023-10-08 03:46:43,924][52059] Updated weights for policy 1, policy_version 95662 (0.0008) [2023-10-08 03:46:44,288][52059] Updated weights for policy 1, policy_version 95672 (0.0008) [2023-10-08 03:46:46,211][50642] Fps is (10 sec: 13106.7, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194707456. Throughput: 0: 1724.3, 1: 1728.0. Samples: 48689450. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:46,212][50642] Avg episode reward: [(0, '12.270'), (1, '25.950')] [2023-10-08 03:46:46,224][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000095680_97976320.pth... [2023-10-08 03:46:46,224][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000094464_96731136.pth... [2023-10-08 03:46:46,253][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000094048_96305152.pth [2023-10-08 03:46:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000092864_95092736.pth [2023-10-08 03:46:46,712][52060] Updated weights for policy 0, policy_version 94470 (0.0010) [2023-10-08 03:46:47,087][52060] Updated weights for policy 0, policy_version 94480 (0.0008) [2023-10-08 03:46:47,449][52060] Updated weights for policy 0, policy_version 94490 (0.0009) [2023-10-08 03:46:48,232][52059] Updated weights for policy 1, policy_version 95682 (0.0008) [2023-10-08 03:46:48,594][52059] Updated weights for policy 1, policy_version 95692 (0.0007) [2023-10-08 03:46:48,961][52059] Updated weights for policy 1, policy_version 95702 (0.0007) [2023-10-08 03:46:49,323][52059] Updated weights for policy 1, policy_version 95712 (0.0007) [2023-10-08 03:46:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194772992. Throughput: 0: 1706.0, 1: 1743.0. Samples: 48699362. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:51,211][50642] Avg episode reward: [(0, '15.690'), (1, '25.110')] [2023-10-08 03:46:51,420][52060] Updated weights for policy 0, policy_version 94500 (0.0010) [2023-10-08 03:46:51,783][52060] Updated weights for policy 0, policy_version 94510 (0.0010) [2023-10-08 03:46:52,156][52060] Updated weights for policy 0, policy_version 94520 (0.0011) [2023-10-08 03:46:53,106][52059] Updated weights for policy 1, policy_version 95722 (0.0010) [2023-10-08 03:46:53,478][52059] Updated weights for policy 1, policy_version 95732 (0.0011) [2023-10-08 03:46:53,847][52059] Updated weights for policy 1, policy_version 95742 (0.0007) [2023-10-08 03:46:56,166][52060] Updated weights for policy 0, policy_version 94530 (0.0009) [2023-10-08 03:46:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 194838528. Throughput: 0: 1717.3, 1: 1740.3. Samples: 48720504. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:46:56,211][50642] Avg episode reward: [(0, '11.750'), (1, '25.140')] [2023-10-08 03:46:56,529][52060] Updated weights for policy 0, policy_version 94540 (0.0010) [2023-10-08 03:46:56,891][52060] Updated weights for policy 0, policy_version 94550 (0.0008) [2023-10-08 03:46:57,266][52060] Updated weights for policy 0, policy_version 94560 (0.0009) [2023-10-08 03:46:57,770][52059] Updated weights for policy 1, policy_version 95752 (0.0011) [2023-10-08 03:46:58,136][52059] Updated weights for policy 1, policy_version 95762 (0.0009) [2023-10-08 03:46:58,495][52059] Updated weights for policy 1, policy_version 95772 (0.0007) [2023-10-08 03:47:01,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 194904064. Throughput: 0: 1716.3, 1: 1743.3. Samples: 48741734. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:47:01,211][50642] Avg episode reward: [(0, '14.900'), (1, '25.860')] [2023-10-08 03:47:01,274][52060] Updated weights for policy 0, policy_version 94570 (0.0010) [2023-10-08 03:47:01,641][52060] Updated weights for policy 0, policy_version 94580 (0.0010) [2023-10-08 03:47:02,010][52060] Updated weights for policy 0, policy_version 94590 (0.0008) [2023-10-08 03:47:02,453][52059] Updated weights for policy 1, policy_version 95782 (0.0008) [2023-10-08 03:47:02,843][52059] Updated weights for policy 1, policy_version 95792 (0.0010) [2023-10-08 03:47:03,197][52059] Updated weights for policy 1, policy_version 95802 (0.0010) [2023-10-08 03:47:05,947][52060] Updated weights for policy 0, policy_version 94600 (0.0010) [2023-10-08 03:47:06,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13662.6). Total num frames: 194969600. Throughput: 0: 1716.2, 1: 1727.1. Samples: 48751046. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:47:06,211][50642] Avg episode reward: [(0, '13.540'), (1, '26.980')] [2023-10-08 03:47:06,309][52060] Updated weights for policy 0, policy_version 94610 (0.0010) [2023-10-08 03:47:06,683][52060] Updated weights for policy 0, policy_version 94620 (0.0009) [2023-10-08 03:47:06,952][52059] Updated weights for policy 1, policy_version 95812 (0.0011) [2023-10-08 03:47:07,308][52059] Updated weights for policy 1, policy_version 95822 (0.0007) [2023-10-08 03:47:07,667][52059] Updated weights for policy 1, policy_version 95832 (0.0008) [2023-10-08 03:47:10,644][52060] Updated weights for policy 0, policy_version 94630 (0.0009) [2023-10-08 03:47:11,022][52060] Updated weights for policy 0, policy_version 94640 (0.0007) [2023-10-08 03:47:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 195035136. Throughput: 0: 1713.5, 1: 1737.2. Samples: 48772464. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:47:11,211][50642] Avg episode reward: [(0, '13.230'), (1, '24.940')] [2023-10-08 03:47:11,396][52060] Updated weights for policy 0, policy_version 94650 (0.0007) [2023-10-08 03:47:11,540][52059] Updated weights for policy 1, policy_version 95842 (0.0009) [2023-10-08 03:47:11,901][52059] Updated weights for policy 1, policy_version 95852 (0.0010) [2023-10-08 03:47:12,278][52059] Updated weights for policy 1, policy_version 95862 (0.0009) [2023-10-08 03:47:12,640][52059] Updated weights for policy 1, policy_version 95872 (0.0007) [2023-10-08 03:47:15,379][52060] Updated weights for policy 0, policy_version 94660 (0.0008) [2023-10-08 03:47:15,735][52060] Updated weights for policy 0, policy_version 94670 (0.0007) [2023-10-08 03:47:16,112][52060] Updated weights for policy 0, policy_version 94680 (0.0007) [2023-10-08 03:47:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 195100672. Throughput: 0: 1694.2, 1: 1762.0. Samples: 48793116. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:47:16,211][50642] Avg episode reward: [(0, '14.380'), (1, '25.040')] [2023-10-08 03:47:16,548][52059] Updated weights for policy 1, policy_version 95882 (0.0008) [2023-10-08 03:47:16,910][52059] Updated weights for policy 1, policy_version 95892 (0.0007) [2023-10-08 03:47:17,277][52059] Updated weights for policy 1, policy_version 95902 (0.0009) [2023-10-08 03:47:20,252][52060] Updated weights for policy 0, policy_version 94690 (0.0008) [2023-10-08 03:47:20,647][52060] Updated weights for policy 0, policy_version 94700 (0.0008) [2023-10-08 03:47:21,010][52060] Updated weights for policy 0, policy_version 94710 (0.0007) [2023-10-08 03:47:21,199][52059] Updated weights for policy 1, policy_version 95912 (0.0008) [2023-10-08 03:47:21,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 195166208. Throughput: 0: 1710.9, 1: 1731.1. Samples: 48803296. Policy #0 lag: (min: 31.0, avg: 34.0, max: 63.0) [2023-10-08 03:47:21,211][50642] Avg episode reward: [(0, '13.350'), (1, '25.930')] [2023-10-08 03:47:21,374][52060] Updated weights for policy 0, policy_version 94720 (0.0008) [2023-10-08 03:47:21,560][52059] Updated weights for policy 1, policy_version 95922 (0.0009) [2023-10-08 03:47:21,926][52059] Updated weights for policy 1, policy_version 95932 (0.0009) [2023-10-08 03:47:25,318][52060] Updated weights for policy 0, policy_version 94730 (0.0010) [2023-10-08 03:47:25,687][52060] Updated weights for policy 0, policy_version 94740 (0.0009) [2023-10-08 03:47:25,791][52059] Updated weights for policy 1, policy_version 95942 (0.0008) [2023-10-08 03:47:26,054][52060] Updated weights for policy 0, policy_version 94750 (0.0009) [2023-10-08 03:47:26,156][52059] Updated weights for policy 1, policy_version 95952 (0.0007) [2023-10-08 03:47:26,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 195264512. Throughput: 0: 1710.9, 1: 1763.3. Samples: 48824668. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:26,211][50642] Avg episode reward: [(0, '15.580'), (1, '26.790')] [2023-10-08 03:47:26,529][52059] Updated weights for policy 1, policy_version 95962 (0.0009) [2023-10-08 03:47:29,990][52060] Updated weights for policy 0, policy_version 94760 (0.0010) [2023-10-08 03:47:30,360][52060] Updated weights for policy 0, policy_version 94770 (0.0011) [2023-10-08 03:47:30,503][52059] Updated weights for policy 1, policy_version 95972 (0.0009) [2023-10-08 03:47:30,725][52060] Updated weights for policy 0, policy_version 94780 (0.0008) [2023-10-08 03:47:30,866][52059] Updated weights for policy 1, policy_version 95982 (0.0008) [2023-10-08 03:47:31,210][50642] Fps is (10 sec: 16383.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 195330048. Throughput: 0: 1689.2, 1: 1755.3. Samples: 48844450. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:31,211][50642] Avg episode reward: [(0, '13.350'), (1, '25.640')] [2023-10-08 03:47:31,233][52059] Updated weights for policy 1, policy_version 95992 (0.0007) [2023-10-08 03:47:34,629][52060] Updated weights for policy 0, policy_version 94790 (0.0009) [2023-10-08 03:47:35,006][52060] Updated weights for policy 0, policy_version 94800 (0.0009) [2023-10-08 03:47:35,106][52059] Updated weights for policy 1, policy_version 96002 (0.0008) [2023-10-08 03:47:35,365][52060] Updated weights for policy 0, policy_version 94810 (0.0007) [2023-10-08 03:47:35,473][52059] Updated weights for policy 1, policy_version 96012 (0.0009) [2023-10-08 03:47:35,830][52059] Updated weights for policy 1, policy_version 96022 (0.0007) [2023-10-08 03:47:36,199][52059] Updated weights for policy 1, policy_version 96032 (0.0008) [2023-10-08 03:47:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 195428352. Throughput: 0: 1721.4, 1: 1753.0. Samples: 48855712. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:36,211][50642] Avg episode reward: [(0, '14.840'), (1, '23.450')] [2023-10-08 03:47:39,327][52060] Updated weights for policy 0, policy_version 94820 (0.0009) [2023-10-08 03:47:39,704][52060] Updated weights for policy 0, policy_version 94830 (0.0009) [2023-10-08 03:47:39,994][52059] Updated weights for policy 1, policy_version 96042 (0.0007) [2023-10-08 03:47:40,067][52060] Updated weights for policy 0, policy_version 94840 (0.0008) [2023-10-08 03:47:40,358][52059] Updated weights for policy 1, policy_version 96052 (0.0008) [2023-10-08 03:47:40,723][52059] Updated weights for policy 1, policy_version 96062 (0.0008) [2023-10-08 03:47:41,210][50642] Fps is (10 sec: 16384.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 195493888. Throughput: 0: 1706.3, 1: 1757.8. Samples: 48876388. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:41,211][50642] Avg episode reward: [(0, '13.920'), (1, '24.120')] [2023-10-08 03:47:44,148][52060] Updated weights for policy 0, policy_version 94850 (0.0008) [2023-10-08 03:47:44,514][52060] Updated weights for policy 0, policy_version 94860 (0.0009) [2023-10-08 03:47:44,550][52059] Updated weights for policy 1, policy_version 96072 (0.0009) [2023-10-08 03:47:44,887][52060] Updated weights for policy 0, policy_version 94870 (0.0008) [2023-10-08 03:47:44,909][52059] Updated weights for policy 1, policy_version 96082 (0.0007) [2023-10-08 03:47:45,248][52060] Updated weights for policy 0, policy_version 94880 (0.0007) [2023-10-08 03:47:45,276][52059] Updated weights for policy 1, policy_version 96092 (0.0007) [2023-10-08 03:47:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.6, 300 sec: 13884.7). Total num frames: 195559424. Throughput: 0: 1686.3, 1: 1740.8. Samples: 48895956. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:46,211][50642] Avg episode reward: [(0, '15.010'), (1, '24.680')] [2023-10-08 03:47:49,307][52059] Updated weights for policy 1, policy_version 96102 (0.0008) [2023-10-08 03:47:49,356][52060] Updated weights for policy 0, policy_version 94890 (0.0008) [2023-10-08 03:47:49,696][52059] Updated weights for policy 1, policy_version 96112 (0.0010) [2023-10-08 03:47:49,723][52060] Updated weights for policy 0, policy_version 94900 (0.0008) [2023-10-08 03:47:50,055][52059] Updated weights for policy 1, policy_version 96122 (0.0009) [2023-10-08 03:47:50,094][52060] Updated weights for policy 0, policy_version 94910 (0.0008) [2023-10-08 03:47:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 195624960. Throughput: 0: 1712.3, 1: 1774.3. Samples: 48907946. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:51,211][50642] Avg episode reward: [(0, '16.110'), (1, '24.090')] [2023-10-08 03:47:54,094][52059] Updated weights for policy 1, policy_version 96132 (0.0010) [2023-10-08 03:47:54,131][52060] Updated weights for policy 0, policy_version 94920 (0.0009) [2023-10-08 03:47:54,452][52059] Updated weights for policy 1, policy_version 96142 (0.0007) [2023-10-08 03:47:54,499][52060] Updated weights for policy 0, policy_version 94930 (0.0008) [2023-10-08 03:47:54,808][52059] Updated weights for policy 1, policy_version 96152 (0.0008) [2023-10-08 03:47:54,867][52060] Updated weights for policy 0, policy_version 94940 (0.0008) [2023-10-08 03:47:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 195690496. Throughput: 0: 1686.9, 1: 1737.5. Samples: 48926564. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:47:56,211][50642] Avg episode reward: [(0, '13.280'), (1, '24.910')] [2023-10-08 03:47:58,610][52059] Updated weights for policy 1, policy_version 96162 (0.0007) [2023-10-08 03:47:58,926][52060] Updated weights for policy 0, policy_version 94950 (0.0008) [2023-10-08 03:47:58,960][52059] Updated weights for policy 1, policy_version 96172 (0.0008) [2023-10-08 03:47:59,292][52060] Updated weights for policy 0, policy_version 94960 (0.0007) [2023-10-08 03:47:59,322][52059] Updated weights for policy 1, policy_version 96182 (0.0007) [2023-10-08 03:47:59,654][52060] Updated weights for policy 0, policy_version 94970 (0.0008) [2023-10-08 03:47:59,688][52059] Updated weights for policy 1, policy_version 96192 (0.0007) [2023-10-08 03:48:01,211][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 195756032. Throughput: 0: 1695.5, 1: 1732.5. Samples: 48947376. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:48:01,212][50642] Avg episode reward: [(0, '14.010'), (1, '21.540')] [2023-10-08 03:48:03,503][52059] Updated weights for policy 1, policy_version 96202 (0.0007) [2023-10-08 03:48:03,665][52060] Updated weights for policy 0, policy_version 94980 (0.0008) [2023-10-08 03:48:03,869][52059] Updated weights for policy 1, policy_version 96212 (0.0009) [2023-10-08 03:48:04,032][52060] Updated weights for policy 0, policy_version 94990 (0.0008) [2023-10-08 03:48:04,234][52059] Updated weights for policy 1, policy_version 96222 (0.0008) [2023-10-08 03:48:04,406][52060] Updated weights for policy 0, policy_version 95000 (0.0009) [2023-10-08 03:48:06,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 195821568. Throughput: 0: 1701.1, 1: 1745.2. Samples: 48958376. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:48:06,211][50642] Avg episode reward: [(0, '15.020'), (1, '23.470')] [2023-10-08 03:48:08,145][52059] Updated weights for policy 1, policy_version 96232 (0.0008) [2023-10-08 03:48:08,385][52060] Updated weights for policy 0, policy_version 95010 (0.0007) [2023-10-08 03:48:08,506][52059] Updated weights for policy 1, policy_version 96242 (0.0008) [2023-10-08 03:48:08,800][52060] Updated weights for policy 0, policy_version 95020 (0.0007) [2023-10-08 03:48:08,875][52059] Updated weights for policy 1, policy_version 96252 (0.0007) [2023-10-08 03:48:09,168][52060] Updated weights for policy 0, policy_version 95030 (0.0008) [2023-10-08 03:48:09,520][52060] Updated weights for policy 0, policy_version 95040 (0.0008) [2023-10-08 03:48:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 195887104. Throughput: 0: 1679.5, 1: 1734.8. Samples: 48978314. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:48:11,211][50642] Avg episode reward: [(0, '14.130'), (1, '26.600')] [2023-10-08 03:48:12,696][52059] Updated weights for policy 1, policy_version 96262 (0.0008) [2023-10-08 03:48:13,062][52059] Updated weights for policy 1, policy_version 96272 (0.0007) [2023-10-08 03:48:13,421][52059] Updated weights for policy 1, policy_version 96282 (0.0008) [2023-10-08 03:48:13,569][52060] Updated weights for policy 0, policy_version 95050 (0.0009) [2023-10-08 03:48:13,935][52060] Updated weights for policy 0, policy_version 95060 (0.0008) [2023-10-08 03:48:14,303][52060] Updated weights for policy 0, policy_version 95070 (0.0007) [2023-10-08 03:48:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 195952640. Throughput: 0: 1700.4, 1: 1747.3. Samples: 48999594. Policy #0 lag: (min: 8.0, avg: 24.5, max: 40.0) [2023-10-08 03:48:16,211][50642] Avg episode reward: [(0, '17.130'), (1, '27.250')] [2023-10-08 03:48:17,333][52059] Updated weights for policy 1, policy_version 96292 (0.0010) [2023-10-08 03:48:17,692][52059] Updated weights for policy 1, policy_version 96302 (0.0010) [2023-10-08 03:48:18,051][52059] Updated weights for policy 1, policy_version 96312 (0.0010) [2023-10-08 03:48:18,246][52060] Updated weights for policy 0, policy_version 95080 (0.0008) [2023-10-08 03:48:18,619][52060] Updated weights for policy 0, policy_version 95090 (0.0008) [2023-10-08 03:48:18,981][52060] Updated weights for policy 0, policy_version 95100 (0.0009) [2023-10-08 03:48:21,210][50642] Fps is (10 sec: 13107.6, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 196018176. Throughput: 0: 1680.4, 1: 1732.4. Samples: 49009292. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:21,211][50642] Avg episode reward: [(0, '14.540'), (1, '24.560')] [2023-10-08 03:48:22,065][52059] Updated weights for policy 1, policy_version 96322 (0.0009) [2023-10-08 03:48:22,433][52059] Updated weights for policy 1, policy_version 96332 (0.0009) [2023-10-08 03:48:22,798][52059] Updated weights for policy 1, policy_version 96342 (0.0009) [2023-10-08 03:48:22,892][52060] Updated weights for policy 0, policy_version 95110 (0.0008) [2023-10-08 03:48:23,170][52059] Updated weights for policy 1, policy_version 96352 (0.0009) [2023-10-08 03:48:23,259][52060] Updated weights for policy 0, policy_version 95120 (0.0007) [2023-10-08 03:48:23,629][52060] Updated weights for policy 0, policy_version 95130 (0.0009) [2023-10-08 03:48:26,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 196083712. Throughput: 0: 1688.3, 1: 1729.6. Samples: 49030190. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:26,211][50642] Avg episode reward: [(0, '15.630'), (1, '23.450')] [2023-10-08 03:48:27,163][52059] Updated weights for policy 1, policy_version 96362 (0.0009) [2023-10-08 03:48:27,528][52059] Updated weights for policy 1, policy_version 96372 (0.0009) [2023-10-08 03:48:27,549][52060] Updated weights for policy 0, policy_version 95140 (0.0008) [2023-10-08 03:48:27,889][52059] Updated weights for policy 1, policy_version 96382 (0.0008) [2023-10-08 03:48:27,929][52060] Updated weights for policy 0, policy_version 95150 (0.0008) [2023-10-08 03:48:28,293][52060] Updated weights for policy 0, policy_version 95160 (0.0010) [2023-10-08 03:48:31,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 196149248. Throughput: 0: 1706.4, 1: 1750.7. Samples: 49051524. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:31,211][50642] Avg episode reward: [(0, '15.830'), (1, '24.260')] [2023-10-08 03:48:31,807][52059] Updated weights for policy 1, policy_version 96392 (0.0008) [2023-10-08 03:48:32,177][52059] Updated weights for policy 1, policy_version 96402 (0.0009) [2023-10-08 03:48:32,262][52060] Updated weights for policy 0, policy_version 95170 (0.0007) [2023-10-08 03:48:32,536][52059] Updated weights for policy 1, policy_version 96412 (0.0008) [2023-10-08 03:48:32,632][52060] Updated weights for policy 0, policy_version 95180 (0.0007) [2023-10-08 03:48:32,991][52060] Updated weights for policy 0, policy_version 95190 (0.0007) [2023-10-08 03:48:33,361][52060] Updated weights for policy 0, policy_version 95200 (0.0009) [2023-10-08 03:48:36,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.1, 300 sec: 13662.6). Total num frames: 196214784. Throughput: 0: 1680.4, 1: 1720.4. Samples: 49060980. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:36,211][50642] Avg episode reward: [(0, '16.110'), (1, '27.030')] [2023-10-08 03:48:36,521][52059] Updated weights for policy 1, policy_version 96422 (0.0008) [2023-10-08 03:48:36,917][52059] Updated weights for policy 1, policy_version 96432 (0.0009) [2023-10-08 03:48:37,289][52059] Updated weights for policy 1, policy_version 96442 (0.0007) [2023-10-08 03:48:37,447][52060] Updated weights for policy 0, policy_version 95210 (0.0009) [2023-10-08 03:48:37,815][52060] Updated weights for policy 0, policy_version 95220 (0.0011) [2023-10-08 03:48:38,177][52060] Updated weights for policy 0, policy_version 95230 (0.0009) [2023-10-08 03:48:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 196280320. Throughput: 0: 1709.0, 1: 1744.4. Samples: 49081966. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:41,211][50642] Avg episode reward: [(0, '14.850'), (1, '27.840')] [2023-10-08 03:48:41,314][52059] Updated weights for policy 1, policy_version 96452 (0.0010) [2023-10-08 03:48:41,672][52059] Updated weights for policy 1, policy_version 96462 (0.0009) [2023-10-08 03:48:42,035][52059] Updated weights for policy 1, policy_version 96472 (0.0009) [2023-10-08 03:48:42,193][52060] Updated weights for policy 0, policy_version 95240 (0.0007) [2023-10-08 03:48:42,555][52060] Updated weights for policy 0, policy_version 95250 (0.0007) [2023-10-08 03:48:42,920][52060] Updated weights for policy 0, policy_version 95260 (0.0008) [2023-10-08 03:48:45,897][52059] Updated weights for policy 1, policy_version 96482 (0.0008) [2023-10-08 03:48:46,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 196345856. Throughput: 0: 1719.7, 1: 1742.7. Samples: 49103184. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:46,211][50642] Avg episode reward: [(0, '15.360'), (1, '24.210')] [2023-10-08 03:48:46,218][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000095264_97550336.pth... [2023-10-08 03:48:46,254][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000093664_95911936.pth [2023-10-08 03:48:46,259][52059] Updated weights for policy 1, policy_version 96492 (0.0010) [2023-10-08 03:48:46,631][52059] Updated weights for policy 1, policy_version 96502 (0.0008) [2023-10-08 03:48:46,889][52060] Updated weights for policy 0, policy_version 95270 (0.0009) [2023-10-08 03:48:46,984][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000096512_98828288.pth... [2023-10-08 03:48:46,987][52059] Updated weights for policy 1, policy_version 96512 (0.0009) [2023-10-08 03:48:47,013][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000094880_97157120.pth [2023-10-08 03:48:47,262][52060] Updated weights for policy 0, policy_version 95280 (0.0010) [2023-10-08 03:48:47,631][52060] Updated weights for policy 0, policy_version 95290 (0.0008) [2023-10-08 03:48:50,996][52059] Updated weights for policy 1, policy_version 96522 (0.0008) [2023-10-08 03:48:51,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.3, 300 sec: 13662.6). Total num frames: 196411392. Throughput: 0: 1698.8, 1: 1727.3. Samples: 49112554. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:51,211][50642] Avg episode reward: [(0, '14.550'), (1, '23.610')] [2023-10-08 03:48:51,361][52059] Updated weights for policy 1, policy_version 96532 (0.0007) [2023-10-08 03:48:51,724][52060] Updated weights for policy 0, policy_version 95300 (0.0008) [2023-10-08 03:48:51,725][52059] Updated weights for policy 1, policy_version 96542 (0.0007) [2023-10-08 03:48:52,094][52060] Updated weights for policy 0, policy_version 95310 (0.0008) [2023-10-08 03:48:52,458][52060] Updated weights for policy 0, policy_version 95320 (0.0007) [2023-10-08 03:48:55,569][52059] Updated weights for policy 1, policy_version 96552 (0.0011) [2023-10-08 03:48:55,929][52059] Updated weights for policy 1, policy_version 96562 (0.0009) [2023-10-08 03:48:56,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 196476928. Throughput: 0: 1719.3, 1: 1740.1. Samples: 49133988. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:48:56,211][50642] Avg episode reward: [(0, '14.560'), (1, '25.010')] [2023-10-08 03:48:56,294][52059] Updated weights for policy 1, policy_version 96572 (0.0007) [2023-10-08 03:48:56,477][52060] Updated weights for policy 0, policy_version 95330 (0.0007) [2023-10-08 03:48:56,873][52060] Updated weights for policy 0, policy_version 95340 (0.0009) [2023-10-08 03:48:57,249][52060] Updated weights for policy 0, policy_version 95350 (0.0009) [2023-10-08 03:48:57,614][52060] Updated weights for policy 0, policy_version 95360 (0.0008) [2023-10-08 03:49:00,222][52059] Updated weights for policy 1, policy_version 96582 (0.0009) [2023-10-08 03:49:00,586][52059] Updated weights for policy 1, policy_version 96592 (0.0010) [2023-10-08 03:49:00,949][52059] Updated weights for policy 1, policy_version 96602 (0.0009) [2023-10-08 03:49:01,210][50642] Fps is (10 sec: 16383.7, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 196575232. Throughput: 0: 1719.1, 1: 1716.1. Samples: 49154180. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:49:01,211][50642] Avg episode reward: [(0, '16.140'), (1, '28.550')] [2023-10-08 03:49:01,600][52060] Updated weights for policy 0, policy_version 95370 (0.0009) [2023-10-08 03:49:01,971][52060] Updated weights for policy 0, policy_version 95380 (0.0010) [2023-10-08 03:49:02,341][52060] Updated weights for policy 0, policy_version 95390 (0.0009) [2023-10-08 03:49:04,891][52059] Updated weights for policy 1, policy_version 96612 (0.0009) [2023-10-08 03:49:05,251][52059] Updated weights for policy 1, policy_version 96622 (0.0009) [2023-10-08 03:49:05,620][52059] Updated weights for policy 1, policy_version 96632 (0.0008) [2023-10-08 03:49:06,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 196640768. Throughput: 0: 1710.2, 1: 1738.7. Samples: 49164490. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:49:06,211][50642] Avg episode reward: [(0, '16.120'), (1, '23.970')] [2023-10-08 03:49:06,444][52060] Updated weights for policy 0, policy_version 95400 (0.0008) [2023-10-08 03:49:06,816][52060] Updated weights for policy 0, policy_version 95410 (0.0007) [2023-10-08 03:49:07,182][52060] Updated weights for policy 0, policy_version 95420 (0.0007) [2023-10-08 03:49:09,570][52059] Updated weights for policy 1, policy_version 96642 (0.0009) [2023-10-08 03:49:09,927][52059] Updated weights for policy 1, policy_version 96652 (0.0007) [2023-10-08 03:49:10,285][52059] Updated weights for policy 1, policy_version 96662 (0.0008) [2023-10-08 03:49:10,657][52059] Updated weights for policy 1, policy_version 96672 (0.0009) [2023-10-08 03:49:11,152][52060] Updated weights for policy 0, policy_version 95430 (0.0009) [2023-10-08 03:49:11,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 196706304. Throughput: 0: 1718.0, 1: 1730.4. Samples: 49185368. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:49:11,211][50642] Avg episode reward: [(0, '16.650'), (1, '20.660')] [2023-10-08 03:49:11,516][52060] Updated weights for policy 0, policy_version 95440 (0.0007) [2023-10-08 03:49:11,886][52060] Updated weights for policy 0, policy_version 95450 (0.0008) [2023-10-08 03:49:14,492][52059] Updated weights for policy 1, policy_version 96682 (0.0009) [2023-10-08 03:49:14,848][52059] Updated weights for policy 1, policy_version 96692 (0.0011) [2023-10-08 03:49:15,209][52059] Updated weights for policy 1, policy_version 96702 (0.0008) [2023-10-08 03:49:15,744][52060] Updated weights for policy 0, policy_version 95460 (0.0008) [2023-10-08 03:49:16,115][52060] Updated weights for policy 0, policy_version 95470 (0.0008) [2023-10-08 03:49:16,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 196771840. Throughput: 0: 1715.6, 1: 1708.6. Samples: 49205610. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:49:16,211][50642] Avg episode reward: [(0, '15.400'), (1, '24.630')] [2023-10-08 03:49:16,483][52060] Updated weights for policy 0, policy_version 95480 (0.0007) [2023-10-08 03:49:19,421][52059] Updated weights for policy 1, policy_version 96712 (0.0008) [2023-10-08 03:49:19,781][52059] Updated weights for policy 1, policy_version 96722 (0.0008) [2023-10-08 03:49:20,148][52059] Updated weights for policy 1, policy_version 96732 (0.0010) [2023-10-08 03:49:20,425][52060] Updated weights for policy 0, policy_version 95490 (0.0007) [2023-10-08 03:49:20,783][52060] Updated weights for policy 0, policy_version 95500 (0.0009) [2023-10-08 03:49:21,154][52060] Updated weights for policy 0, policy_version 95510 (0.0008) [2023-10-08 03:49:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 196837376. Throughput: 0: 1718.6, 1: 1737.0. Samples: 49216482. Policy #0 lag: (min: 31.0, avg: 31.0, max: 31.0) [2023-10-08 03:49:21,211][50642] Avg episode reward: [(0, '16.200'), (1, '25.580')] [2023-10-08 03:49:21,515][52060] Updated weights for policy 0, policy_version 95520 (0.0009) [2023-10-08 03:49:24,088][52059] Updated weights for policy 1, policy_version 96742 (0.0009) [2023-10-08 03:49:24,448][52059] Updated weights for policy 1, policy_version 96752 (0.0010) [2023-10-08 03:49:24,811][52059] Updated weights for policy 1, policy_version 96762 (0.0007) [2023-10-08 03:49:25,480][52060] Updated weights for policy 0, policy_version 95530 (0.0009) [2023-10-08 03:49:25,846][52060] Updated weights for policy 0, policy_version 95540 (0.0007) [2023-10-08 03:49:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 196902912. Throughput: 0: 1721.6, 1: 1719.4. Samples: 49236810. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:26,211][52060] Updated weights for policy 0, policy_version 95550 (0.0009) [2023-10-08 03:49:26,211][50642] Avg episode reward: [(0, '15.450'), (1, '23.880')] [2023-10-08 03:49:28,801][52059] Updated weights for policy 1, policy_version 96772 (0.0009) [2023-10-08 03:49:29,175][52059] Updated weights for policy 1, policy_version 96782 (0.0009) [2023-10-08 03:49:29,542][52059] Updated weights for policy 1, policy_version 96792 (0.0008) [2023-10-08 03:49:30,179][52060] Updated weights for policy 0, policy_version 95560 (0.0007) [2023-10-08 03:49:30,541][52060] Updated weights for policy 0, policy_version 95570 (0.0008) [2023-10-08 03:49:30,912][52060] Updated weights for policy 0, policy_version 95580 (0.0010) [2023-10-08 03:49:31,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 197001216. Throughput: 0: 1702.2, 1: 1716.8. Samples: 49257042. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:31,211][50642] Avg episode reward: [(0, '16.270'), (1, '20.720')] [2023-10-08 03:49:33,359][52059] Updated weights for policy 1, policy_version 96802 (0.0008) [2023-10-08 03:49:33,724][52059] Updated weights for policy 1, policy_version 96812 (0.0009) [2023-10-08 03:49:34,092][52059] Updated weights for policy 1, policy_version 96822 (0.0010) [2023-10-08 03:49:34,458][52059] Updated weights for policy 1, policy_version 96832 (0.0010) [2023-10-08 03:49:34,909][52060] Updated weights for policy 0, policy_version 95590 (0.0010) [2023-10-08 03:49:35,271][52060] Updated weights for policy 0, policy_version 95600 (0.0009) [2023-10-08 03:49:35,636][52060] Updated weights for policy 0, policy_version 95610 (0.0009) [2023-10-08 03:49:36,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 197066752. Throughput: 0: 1723.4, 1: 1734.4. Samples: 49268156. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:36,211][50642] Avg episode reward: [(0, '16.890'), (1, '21.570')] [2023-10-08 03:49:38,271][52059] Updated weights for policy 1, policy_version 96842 (0.0009) [2023-10-08 03:49:38,637][52059] Updated weights for policy 1, policy_version 96852 (0.0008) [2023-10-08 03:49:39,010][52059] Updated weights for policy 1, policy_version 96862 (0.0008) [2023-10-08 03:49:39,668][52060] Updated weights for policy 0, policy_version 95620 (0.0008) [2023-10-08 03:49:40,023][52060] Updated weights for policy 0, policy_version 95630 (0.0007) [2023-10-08 03:49:40,388][52060] Updated weights for policy 0, policy_version 95640 (0.0007) [2023-10-08 03:49:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 197132288. Throughput: 0: 1713.5, 1: 1722.4. Samples: 49288602. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:41,211][50642] Avg episode reward: [(0, '15.650'), (1, '24.870')] [2023-10-08 03:49:42,844][52059] Updated weights for policy 1, policy_version 96872 (0.0008) [2023-10-08 03:49:43,215][52059] Updated weights for policy 1, policy_version 96882 (0.0009) [2023-10-08 03:49:43,581][52059] Updated weights for policy 1, policy_version 96892 (0.0011) [2023-10-08 03:49:44,488][52060] Updated weights for policy 0, policy_version 95650 (0.0008) [2023-10-08 03:49:44,902][52060] Updated weights for policy 0, policy_version 95660 (0.0010) [2023-10-08 03:49:45,278][52060] Updated weights for policy 0, policy_version 95670 (0.0008) [2023-10-08 03:49:45,645][52060] Updated weights for policy 0, policy_version 95680 (0.0008) [2023-10-08 03:49:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 197197824. Throughput: 0: 1693.1, 1: 1745.9. Samples: 49308936. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:46,211][50642] Avg episode reward: [(0, '14.730'), (1, '25.880')] [2023-10-08 03:49:47,357][52059] Updated weights for policy 1, policy_version 96902 (0.0009) [2023-10-08 03:49:47,718][52059] Updated weights for policy 1, policy_version 96912 (0.0008) [2023-10-08 03:49:48,091][52059] Updated weights for policy 1, policy_version 96922 (0.0008) [2023-10-08 03:49:49,519][52060] Updated weights for policy 0, policy_version 95690 (0.0007) [2023-10-08 03:49:49,886][52060] Updated weights for policy 0, policy_version 95700 (0.0007) [2023-10-08 03:49:50,253][52060] Updated weights for policy 0, policy_version 95710 (0.0008) [2023-10-08 03:49:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 197263360. Throughput: 0: 1719.2, 1: 1730.7. Samples: 49319736. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:51,211][50642] Avg episode reward: [(0, '10.020'), (1, '23.480')] [2023-10-08 03:49:52,000][52059] Updated weights for policy 1, policy_version 96932 (0.0008) [2023-10-08 03:49:52,364][52059] Updated weights for policy 1, policy_version 96942 (0.0007) [2023-10-08 03:49:52,713][52059] Updated weights for policy 1, policy_version 96952 (0.0008) [2023-10-08 03:49:54,216][52060] Updated weights for policy 0, policy_version 95720 (0.0008) [2023-10-08 03:49:54,590][52060] Updated weights for policy 0, policy_version 95730 (0.0009) [2023-10-08 03:49:54,951][52060] Updated weights for policy 0, policy_version 95740 (0.0009) [2023-10-08 03:49:56,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 197328896. Throughput: 0: 1694.3, 1: 1743.5. Samples: 49340066. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:49:56,211][50642] Avg episode reward: [(0, '11.070'), (1, '24.250')] [2023-10-08 03:49:56,614][52059] Updated weights for policy 1, policy_version 96962 (0.0007) [2023-10-08 03:49:56,975][52059] Updated weights for policy 1, policy_version 96972 (0.0010) [2023-10-08 03:49:57,343][52059] Updated weights for policy 1, policy_version 96982 (0.0008) [2023-10-08 03:49:57,708][52059] Updated weights for policy 1, policy_version 96992 (0.0007) [2023-10-08 03:49:58,943][52060] Updated weights for policy 0, policy_version 95750 (0.0009) [2023-10-08 03:49:59,326][52060] Updated weights for policy 0, policy_version 95760 (0.0007) [2023-10-08 03:49:59,694][52060] Updated weights for policy 0, policy_version 95770 (0.0007) [2023-10-08 03:50:01,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 197394432. Throughput: 0: 1689.4, 1: 1762.0. Samples: 49360922. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:50:01,211][50642] Avg episode reward: [(0, '10.870'), (1, '27.850')] [2023-10-08 03:50:01,773][52059] Updated weights for policy 1, policy_version 97002 (0.0009) [2023-10-08 03:50:02,136][52059] Updated weights for policy 1, policy_version 97012 (0.0008) [2023-10-08 03:50:02,502][52059] Updated weights for policy 1, policy_version 97022 (0.0007) [2023-10-08 03:50:03,735][52060] Updated weights for policy 0, policy_version 95780 (0.0008) [2023-10-08 03:50:04,113][52060] Updated weights for policy 0, policy_version 95790 (0.0009) [2023-10-08 03:50:04,485][52060] Updated weights for policy 0, policy_version 95800 (0.0009) [2023-10-08 03:50:06,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 197459968. Throughput: 0: 1708.5, 1: 1733.6. Samples: 49371380. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:50:06,211][50642] Avg episode reward: [(0, '12.240'), (1, '27.820')] [2023-10-08 03:50:06,342][52059] Updated weights for policy 1, policy_version 97032 (0.0007) [2023-10-08 03:50:06,712][52059] Updated weights for policy 1, policy_version 97042 (0.0009) [2023-10-08 03:50:07,072][52059] Updated weights for policy 1, policy_version 97052 (0.0009) [2023-10-08 03:50:08,398][52060] Updated weights for policy 0, policy_version 95810 (0.0010) [2023-10-08 03:50:08,764][52060] Updated weights for policy 0, policy_version 95820 (0.0007) [2023-10-08 03:50:09,140][52060] Updated weights for policy 0, policy_version 95830 (0.0010) [2023-10-08 03:50:09,490][52060] Updated weights for policy 0, policy_version 95840 (0.0007) [2023-10-08 03:50:10,954][52059] Updated weights for policy 1, policy_version 97062 (0.0008) [2023-10-08 03:50:11,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 197525504. Throughput: 0: 1683.9, 1: 1761.8. Samples: 49391866. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:50:11,211][50642] Avg episode reward: [(0, '12.230'), (1, '24.850')] [2023-10-08 03:50:11,341][52059] Updated weights for policy 1, policy_version 97072 (0.0010) [2023-10-08 03:50:11,704][52059] Updated weights for policy 1, policy_version 97082 (0.0007) [2023-10-08 03:50:13,573][52060] Updated weights for policy 0, policy_version 95850 (0.0009) [2023-10-08 03:50:13,937][52060] Updated weights for policy 0, policy_version 95860 (0.0009) [2023-10-08 03:50:14,301][52060] Updated weights for policy 0, policy_version 95870 (0.0007) [2023-10-08 03:50:15,539][52059] Updated weights for policy 1, policy_version 97092 (0.0010) [2023-10-08 03:50:15,903][52059] Updated weights for policy 1, policy_version 97102 (0.0010) [2023-10-08 03:50:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 197591040. Throughput: 0: 1701.3, 1: 1754.5. Samples: 49412554. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:50:16,211][50642] Avg episode reward: [(0, '11.470'), (1, '23.910')] [2023-10-08 03:50:16,263][52059] Updated weights for policy 1, policy_version 97112 (0.0011) [2023-10-08 03:50:18,286][52060] Updated weights for policy 0, policy_version 95880 (0.0008) [2023-10-08 03:50:18,652][52060] Updated weights for policy 0, policy_version 95890 (0.0008) [2023-10-08 03:50:19,017][52060] Updated weights for policy 0, policy_version 95900 (0.0007) [2023-10-08 03:50:20,073][52059] Updated weights for policy 1, policy_version 97122 (0.0009) [2023-10-08 03:50:20,439][52059] Updated weights for policy 1, policy_version 97132 (0.0009) [2023-10-08 03:50:20,798][52059] Updated weights for policy 1, policy_version 97142 (0.0009) [2023-10-08 03:50:21,161][52059] Updated weights for policy 1, policy_version 97152 (0.0009) [2023-10-08 03:50:21,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 197689344. Throughput: 0: 1689.8, 1: 1751.4. Samples: 49423010. Policy #0 lag: (min: 31.0, avg: 36.3, max: 63.0) [2023-10-08 03:50:21,211][50642] Avg episode reward: [(0, '12.140'), (1, '24.710')] [2023-10-08 03:50:23,038][52060] Updated weights for policy 0, policy_version 95910 (0.0009) [2023-10-08 03:50:23,405][52060] Updated weights for policy 0, policy_version 95920 (0.0007) [2023-10-08 03:50:23,780][52060] Updated weights for policy 0, policy_version 95930 (0.0008) [2023-10-08 03:50:25,071][52059] Updated weights for policy 1, policy_version 97162 (0.0009) [2023-10-08 03:50:25,424][52059] Updated weights for policy 1, policy_version 97172 (0.0008) [2023-10-08 03:50:25,793][52059] Updated weights for policy 1, policy_version 97182 (0.0009) [2023-10-08 03:50:26,210][50642] Fps is (10 sec: 16384.2, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 197754880. Throughput: 0: 1688.7, 1: 1759.3. Samples: 49443762. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:26,211][50642] Avg episode reward: [(0, '11.930'), (1, '26.230')] [2023-10-08 03:50:27,650][52060] Updated weights for policy 0, policy_version 95940 (0.0009) [2023-10-08 03:50:28,017][52060] Updated weights for policy 0, policy_version 95950 (0.0010) [2023-10-08 03:50:28,383][52060] Updated weights for policy 0, policy_version 95960 (0.0011) [2023-10-08 03:50:29,630][52059] Updated weights for policy 1, policy_version 97192 (0.0010) [2023-10-08 03:50:30,000][52059] Updated weights for policy 1, policy_version 97202 (0.0008) [2023-10-08 03:50:30,360][52059] Updated weights for policy 1, policy_version 97212 (0.0007) [2023-10-08 03:50:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 197820416. Throughput: 0: 1713.8, 1: 1732.2. Samples: 49464004. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:31,211][50642] Avg episode reward: [(0, '12.620'), (1, '23.360')] [2023-10-08 03:50:32,666][52060] Updated weights for policy 0, policy_version 95970 (0.0010) [2023-10-08 03:50:33,040][52060] Updated weights for policy 0, policy_version 95980 (0.0007) [2023-10-08 03:50:33,405][52060] Updated weights for policy 0, policy_version 95990 (0.0009) [2023-10-08 03:50:33,779][52060] Updated weights for policy 0, policy_version 96000 (0.0010) [2023-10-08 03:50:34,317][52059] Updated weights for policy 1, policy_version 97222 (0.0011) [2023-10-08 03:50:34,681][52059] Updated weights for policy 1, policy_version 97232 (0.0009) [2023-10-08 03:50:35,052][52059] Updated weights for policy 1, policy_version 97242 (0.0009) [2023-10-08 03:50:36,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 197885952. Throughput: 0: 1681.6, 1: 1758.8. Samples: 49474554. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:36,211][50642] Avg episode reward: [(0, '12.800'), (1, '23.600')] [2023-10-08 03:50:37,640][52060] Updated weights for policy 0, policy_version 96010 (0.0008) [2023-10-08 03:50:38,015][52060] Updated weights for policy 0, policy_version 96020 (0.0007) [2023-10-08 03:50:38,390][52060] Updated weights for policy 0, policy_version 96030 (0.0009) [2023-10-08 03:50:39,007][52059] Updated weights for policy 1, policy_version 97252 (0.0009) [2023-10-08 03:50:39,374][52059] Updated weights for policy 1, policy_version 97262 (0.0007) [2023-10-08 03:50:39,731][52059] Updated weights for policy 1, policy_version 97272 (0.0009) [2023-10-08 03:50:41,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 197951488. Throughput: 0: 1707.6, 1: 1731.1. Samples: 49494806. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:41,211][50642] Avg episode reward: [(0, '13.550'), (1, '23.500')] [2023-10-08 03:50:42,307][52060] Updated weights for policy 0, policy_version 96040 (0.0007) [2023-10-08 03:50:42,667][52060] Updated weights for policy 0, policy_version 96050 (0.0010) [2023-10-08 03:50:43,036][52060] Updated weights for policy 0, policy_version 96060 (0.0010) [2023-10-08 03:50:43,714][52059] Updated weights for policy 1, policy_version 97282 (0.0009) [2023-10-08 03:50:44,076][52059] Updated weights for policy 1, policy_version 97292 (0.0008) [2023-10-08 03:50:44,441][52059] Updated weights for policy 1, policy_version 97302 (0.0008) [2023-10-08 03:50:44,810][52059] Updated weights for policy 1, policy_version 97312 (0.0009) [2023-10-08 03:50:46,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 198017024. Throughput: 0: 1718.9, 1: 1725.0. Samples: 49515898. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:46,211][50642] Avg episode reward: [(0, '13.750'), (1, '24.190')] [2023-10-08 03:50:46,220][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000097312_99647488.pth... [2023-10-08 03:50:46,220][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000096064_98369536.pth... [2023-10-08 03:50:46,256][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000095680_97976320.pth [2023-10-08 03:50:46,257][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000094464_96731136.pth [2023-10-08 03:50:46,260][51710] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p1/milestones/checkpoint_000097312_99647488.pth [2023-10-08 03:50:46,261][51605] Saving a milestone ./train_atari/atari_amidar_APPO/checkpoint_p0/milestones/checkpoint_000096064_98369536.pth [2023-10-08 03:50:46,842][52060] Updated weights for policy 0, policy_version 96070 (0.0009) [2023-10-08 03:50:47,217][52060] Updated weights for policy 0, policy_version 96080 (0.0008) [2023-10-08 03:50:47,599][52060] Updated weights for policy 0, policy_version 96090 (0.0011) [2023-10-08 03:50:48,737][52059] Updated weights for policy 1, policy_version 97322 (0.0011) [2023-10-08 03:50:49,105][52059] Updated weights for policy 1, policy_version 97332 (0.0009) [2023-10-08 03:50:49,470][52059] Updated weights for policy 1, policy_version 97342 (0.0007) [2023-10-08 03:50:51,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 198082560. Throughput: 0: 1694.9, 1: 1740.9. Samples: 49525990. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:51,211][50642] Avg episode reward: [(0, '13.360'), (1, '25.930')] [2023-10-08 03:50:51,794][52060] Updated weights for policy 0, policy_version 96100 (0.0010) [2023-10-08 03:50:52,164][52060] Updated weights for policy 0, policy_version 96110 (0.0011) [2023-10-08 03:50:52,528][52060] Updated weights for policy 0, policy_version 96120 (0.0011) [2023-10-08 03:50:53,466][52059] Updated weights for policy 1, policy_version 97352 (0.0007) [2023-10-08 03:50:53,827][52059] Updated weights for policy 1, policy_version 97362 (0.0009) [2023-10-08 03:50:54,188][52059] Updated weights for policy 1, policy_version 97372 (0.0009) [2023-10-08 03:50:56,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 198148096. Throughput: 0: 1713.4, 1: 1719.7. Samples: 49546354. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:50:56,211][50642] Avg episode reward: [(0, '14.070'), (1, '25.190')] [2023-10-08 03:50:56,488][52060] Updated weights for policy 0, policy_version 96130 (0.0009) [2023-10-08 03:50:56,857][52060] Updated weights for policy 0, policy_version 96140 (0.0007) [2023-10-08 03:50:57,231][52060] Updated weights for policy 0, policy_version 96150 (0.0007) [2023-10-08 03:50:57,607][52060] Updated weights for policy 0, policy_version 96160 (0.0008) [2023-10-08 03:50:58,180][52059] Updated weights for policy 1, policy_version 97382 (0.0009) [2023-10-08 03:50:58,565][52059] Updated weights for policy 1, policy_version 97392 (0.0008) [2023-10-08 03:50:58,924][52059] Updated weights for policy 1, policy_version 97402 (0.0007) [2023-10-08 03:51:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 198213632. Throughput: 0: 1712.9, 1: 1726.3. Samples: 49567320. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:51:01,211][50642] Avg episode reward: [(0, '14.090'), (1, '24.530')] [2023-10-08 03:51:01,750][52060] Updated weights for policy 0, policy_version 96170 (0.0008) [2023-10-08 03:51:02,126][52060] Updated weights for policy 0, policy_version 96180 (0.0008) [2023-10-08 03:51:02,501][52060] Updated weights for policy 0, policy_version 96190 (0.0010) [2023-10-08 03:51:02,766][52059] Updated weights for policy 1, policy_version 97412 (0.0008) [2023-10-08 03:51:03,129][52059] Updated weights for policy 1, policy_version 97422 (0.0008) [2023-10-08 03:51:03,492][52059] Updated weights for policy 1, policy_version 97432 (0.0008) [2023-10-08 03:51:06,124][52060] Updated weights for policy 0, policy_version 96200 (0.0008) [2023-10-08 03:51:06,210][50642] Fps is (10 sec: 13107.1, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 198279168. Throughput: 0: 1705.4, 1: 1715.7. Samples: 49576960. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:51:06,211][50642] Avg episode reward: [(0, '14.230'), (1, '25.450')] [2023-10-08 03:51:06,490][52060] Updated weights for policy 0, policy_version 96210 (0.0007) [2023-10-08 03:51:06,847][52060] Updated weights for policy 0, policy_version 96220 (0.0010) [2023-10-08 03:51:07,272][52059] Updated weights for policy 1, policy_version 97442 (0.0008) [2023-10-08 03:51:07,639][52059] Updated weights for policy 1, policy_version 97452 (0.0009) [2023-10-08 03:51:08,002][52059] Updated weights for policy 1, policy_version 97462 (0.0009) [2023-10-08 03:51:08,375][52059] Updated weights for policy 1, policy_version 97472 (0.0011) [2023-10-08 03:51:10,914][52060] Updated weights for policy 0, policy_version 96230 (0.0010) [2023-10-08 03:51:11,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 198344704. Throughput: 0: 1720.8, 1: 1720.2. Samples: 49598606. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:51:11,211][50642] Avg episode reward: [(0, '14.980'), (1, '24.730')] [2023-10-08 03:51:11,287][52060] Updated weights for policy 0, policy_version 96240 (0.0010) [2023-10-08 03:51:11,649][52060] Updated weights for policy 0, policy_version 96250 (0.0010) [2023-10-08 03:51:12,380][52059] Updated weights for policy 1, policy_version 97482 (0.0008) [2023-10-08 03:51:12,744][52059] Updated weights for policy 1, policy_version 97492 (0.0007) [2023-10-08 03:51:13,107][52059] Updated weights for policy 1, policy_version 97502 (0.0007) [2023-10-08 03:51:15,755][52060] Updated weights for policy 0, policy_version 96260 (0.0009) [2023-10-08 03:51:16,115][52060] Updated weights for policy 0, policy_version 96270 (0.0009) [2023-10-08 03:51:16,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 198410240. Throughput: 0: 1713.1, 1: 1745.3. Samples: 49619636. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:51:16,211][50642] Avg episode reward: [(0, '13.860'), (1, '25.190')] [2023-10-08 03:51:16,479][52060] Updated weights for policy 0, policy_version 96280 (0.0007) [2023-10-08 03:51:17,022][52059] Updated weights for policy 1, policy_version 97512 (0.0010) [2023-10-08 03:51:17,384][52059] Updated weights for policy 1, policy_version 97522 (0.0008) [2023-10-08 03:51:17,749][52059] Updated weights for policy 1, policy_version 97532 (0.0010) [2023-10-08 03:51:20,332][52060] Updated weights for policy 0, policy_version 96290 (0.0007) [2023-10-08 03:51:20,725][52060] Updated weights for policy 0, policy_version 96300 (0.0007) [2023-10-08 03:51:21,091][52060] Updated weights for policy 0, policy_version 96310 (0.0008) [2023-10-08 03:51:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 198475776. Throughput: 0: 1727.3, 1: 1711.9. Samples: 49629316. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:51:21,211][50642] Avg episode reward: [(0, '15.210'), (1, '25.810')] [2023-10-08 03:51:21,462][52060] Updated weights for policy 0, policy_version 96320 (0.0007) [2023-10-08 03:51:21,724][52059] Updated weights for policy 1, policy_version 97542 (0.0011) [2023-10-08 03:51:22,082][52059] Updated weights for policy 1, policy_version 97552 (0.0009) [2023-10-08 03:51:22,449][52059] Updated weights for policy 1, policy_version 97562 (0.0009) [2023-10-08 03:51:25,432][52060] Updated weights for policy 0, policy_version 96330 (0.0009) [2023-10-08 03:51:25,799][52060] Updated weights for policy 0, policy_version 96340 (0.0009) [2023-10-08 03:51:26,172][52060] Updated weights for policy 0, policy_version 96350 (0.0008) [2023-10-08 03:51:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 198541312. Throughput: 0: 1726.5, 1: 1735.1. Samples: 49650576. Policy #0 lag: (min: 31.0, avg: 38.9, max: 63.0) [2023-10-08 03:51:26,211][50642] Avg episode reward: [(0, '14.300'), (1, '24.190')] [2023-10-08 03:51:26,292][52059] Updated weights for policy 1, policy_version 97572 (0.0008) [2023-10-08 03:51:26,657][52059] Updated weights for policy 1, policy_version 97582 (0.0007) [2023-10-08 03:51:27,035][52059] Updated weights for policy 1, policy_version 97592 (0.0009) [2023-10-08 03:51:30,158][52060] Updated weights for policy 0, policy_version 96360 (0.0009) [2023-10-08 03:51:30,526][52060] Updated weights for policy 0, policy_version 96370 (0.0010) [2023-10-08 03:51:30,893][52060] Updated weights for policy 0, policy_version 96380 (0.0009) [2023-10-08 03:51:30,968][52059] Updated weights for policy 1, policy_version 97602 (0.0008) [2023-10-08 03:51:31,210][50642] Fps is (10 sec: 16384.1, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 198639616. Throughput: 0: 1699.1, 1: 1744.3. Samples: 49670850. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:51:31,211][50642] Avg episode reward: [(0, '15.020'), (1, '24.370')] [2023-10-08 03:51:31,332][52059] Updated weights for policy 1, policy_version 97612 (0.0007) [2023-10-08 03:51:31,686][52059] Updated weights for policy 1, policy_version 97622 (0.0007) [2023-10-08 03:51:32,058][52059] Updated weights for policy 1, policy_version 97632 (0.0007) [2023-10-08 03:51:34,914][52060] Updated weights for policy 0, policy_version 96390 (0.0007) [2023-10-08 03:51:35,277][52060] Updated weights for policy 0, policy_version 96400 (0.0007) [2023-10-08 03:51:35,646][52060] Updated weights for policy 0, policy_version 96410 (0.0008) [2023-10-08 03:51:36,002][52059] Updated weights for policy 1, policy_version 97642 (0.0007) [2023-10-08 03:51:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 198705152. Throughput: 0: 1721.1, 1: 1725.6. Samples: 49681088. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:51:36,211][50642] Avg episode reward: [(0, '15.850'), (1, '23.290')] [2023-10-08 03:51:36,369][52059] Updated weights for policy 1, policy_version 97652 (0.0009) [2023-10-08 03:51:36,734][52059] Updated weights for policy 1, policy_version 97662 (0.0007) [2023-10-08 03:51:39,471][52060] Updated weights for policy 0, policy_version 96420 (0.0008) [2023-10-08 03:51:39,836][52060] Updated weights for policy 0, policy_version 96430 (0.0010) [2023-10-08 03:51:40,210][52060] Updated weights for policy 0, policy_version 96440 (0.0009) [2023-10-08 03:51:40,569][52059] Updated weights for policy 1, policy_version 97672 (0.0009) [2023-10-08 03:51:40,929][52059] Updated weights for policy 1, policy_version 97682 (0.0008) [2023-10-08 03:51:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 198770688. Throughput: 0: 1712.7, 1: 1745.3. Samples: 49701966. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:51:41,211][50642] Avg episode reward: [(0, '14.520'), (1, '24.900')] [2023-10-08 03:51:41,295][52059] Updated weights for policy 1, policy_version 97692 (0.0007) [2023-10-08 03:51:44,199][52060] Updated weights for policy 0, policy_version 96450 (0.0008) [2023-10-08 03:51:44,565][52060] Updated weights for policy 0, policy_version 96460 (0.0010) [2023-10-08 03:51:44,941][52060] Updated weights for policy 0, policy_version 96470 (0.0010) [2023-10-08 03:51:45,234][52059] Updated weights for policy 1, policy_version 97702 (0.0007) [2023-10-08 03:51:45,305][52060] Updated weights for policy 0, policy_version 96480 (0.0009) [2023-10-08 03:51:45,612][52059] Updated weights for policy 1, policy_version 97712 (0.0007) [2023-10-08 03:51:45,978][52059] Updated weights for policy 1, policy_version 97722 (0.0007) [2023-10-08 03:51:46,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 198868992. Throughput: 0: 1700.0, 1: 1730.7. Samples: 49721704. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:51:46,211][50642] Avg episode reward: [(0, '16.590'), (1, '24.320')] [2023-10-08 03:51:49,249][52060] Updated weights for policy 0, policy_version 96490 (0.0007) [2023-10-08 03:51:49,623][52060] Updated weights for policy 0, policy_version 96500 (0.0007) [2023-10-08 03:51:49,897][52059] Updated weights for policy 1, policy_version 97732 (0.0008) [2023-10-08 03:51:49,989][52060] Updated weights for policy 0, policy_version 96510 (0.0007) [2023-10-08 03:51:50,261][52059] Updated weights for policy 1, policy_version 97742 (0.0009) [2023-10-08 03:51:50,623][52059] Updated weights for policy 1, policy_version 97752 (0.0010) [2023-10-08 03:51:51,210][50642] Fps is (10 sec: 16383.9, 60 sec: 14199.4, 300 sec: 13884.8). Total num frames: 198934528. Throughput: 0: 1731.6, 1: 1746.4. Samples: 49733466. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:51:51,211][50642] Avg episode reward: [(0, '14.080'), (1, '27.020')] [2023-10-08 03:51:54,108][52060] Updated weights for policy 0, policy_version 96520 (0.0009) [2023-10-08 03:51:54,474][52060] Updated weights for policy 0, policy_version 96530 (0.0008) [2023-10-08 03:51:54,668][52059] Updated weights for policy 1, policy_version 97762 (0.0009) [2023-10-08 03:51:54,854][52060] Updated weights for policy 0, policy_version 96540 (0.0007) [2023-10-08 03:51:55,037][52059] Updated weights for policy 1, policy_version 97772 (0.0009) [2023-10-08 03:51:55,401][52059] Updated weights for policy 1, policy_version 97782 (0.0009) [2023-10-08 03:51:55,758][52059] Updated weights for policy 1, policy_version 97792 (0.0011) [2023-10-08 03:51:56,210][50642] Fps is (10 sec: 13107.0, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 199000064. Throughput: 0: 1705.3, 1: 1738.1. Samples: 49753560. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:51:56,211][50642] Avg episode reward: [(0, '14.680'), (1, '28.090')] [2023-10-08 03:51:58,753][52060] Updated weights for policy 0, policy_version 96550 (0.0008) [2023-10-08 03:51:59,122][52060] Updated weights for policy 0, policy_version 96560 (0.0008) [2023-10-08 03:51:59,489][52060] Updated weights for policy 0, policy_version 96570 (0.0008) [2023-10-08 03:51:59,571][52059] Updated weights for policy 1, policy_version 97802 (0.0009) [2023-10-08 03:51:59,925][52059] Updated weights for policy 1, policy_version 97812 (0.0009) [2023-10-08 03:52:00,299][52059] Updated weights for policy 1, policy_version 97822 (0.0009) [2023-10-08 03:52:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 199065600. Throughput: 0: 1709.2, 1: 1716.1. Samples: 49773776. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:52:01,211][50642] Avg episode reward: [(0, '16.120'), (1, '25.700')] [2023-10-08 03:52:03,496][52060] Updated weights for policy 0, policy_version 96580 (0.0008) [2023-10-08 03:52:03,858][52060] Updated weights for policy 0, policy_version 96590 (0.0008) [2023-10-08 03:52:04,232][52060] Updated weights for policy 0, policy_version 96600 (0.0007) [2023-10-08 03:52:04,269][52059] Updated weights for policy 1, policy_version 97832 (0.0008) [2023-10-08 03:52:04,641][52059] Updated weights for policy 1, policy_version 97842 (0.0008) [2023-10-08 03:52:05,006][52059] Updated weights for policy 1, policy_version 97852 (0.0008) [2023-10-08 03:52:06,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 199131136. Throughput: 0: 1718.5, 1: 1746.9. Samples: 49785260. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:52:06,211][50642] Avg episode reward: [(0, '15.180'), (1, '22.470')] [2023-10-08 03:52:08,293][52060] Updated weights for policy 0, policy_version 96610 (0.0009) [2023-10-08 03:52:08,716][52060] Updated weights for policy 0, policy_version 96620 (0.0009) [2023-10-08 03:52:09,057][52059] Updated weights for policy 1, policy_version 97862 (0.0007) [2023-10-08 03:52:09,075][52060] Updated weights for policy 0, policy_version 96630 (0.0007) [2023-10-08 03:52:09,423][52059] Updated weights for policy 1, policy_version 97872 (0.0008) [2023-10-08 03:52:09,437][52060] Updated weights for policy 0, policy_version 96640 (0.0008) [2023-10-08 03:52:09,786][52059] Updated weights for policy 1, policy_version 97882 (0.0009) [2023-10-08 03:52:11,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 199196672. Throughput: 0: 1693.3, 1: 1719.1. Samples: 49804132. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:52:11,211][50642] Avg episode reward: [(0, '16.340'), (1, '24.390')] [2023-10-08 03:52:13,548][52060] Updated weights for policy 0, policy_version 96650 (0.0007) [2023-10-08 03:52:13,581][52059] Updated weights for policy 1, policy_version 97892 (0.0009) [2023-10-08 03:52:13,921][52060] Updated weights for policy 0, policy_version 96660 (0.0008) [2023-10-08 03:52:13,951][52059] Updated weights for policy 1, policy_version 97902 (0.0008) [2023-10-08 03:52:14,285][52060] Updated weights for policy 0, policy_version 96670 (0.0008) [2023-10-08 03:52:14,311][52059] Updated weights for policy 1, policy_version 97912 (0.0008) [2023-10-08 03:52:16,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 199262208. Throughput: 0: 1710.2, 1: 1719.6. Samples: 49825190. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:52:16,211][50642] Avg episode reward: [(0, '16.100'), (1, '25.690')] [2023-10-08 03:52:18,071][52059] Updated weights for policy 1, policy_version 97922 (0.0007) [2023-10-08 03:52:18,308][52060] Updated weights for policy 0, policy_version 96680 (0.0007) [2023-10-08 03:52:18,427][52059] Updated weights for policy 1, policy_version 97932 (0.0008) [2023-10-08 03:52:18,673][52060] Updated weights for policy 0, policy_version 96690 (0.0007) [2023-10-08 03:52:18,801][52059] Updated weights for policy 1, policy_version 97942 (0.0007) [2023-10-08 03:52:19,043][52060] Updated weights for policy 0, policy_version 96700 (0.0008) [2023-10-08 03:52:19,159][52059] Updated weights for policy 1, policy_version 97952 (0.0009) [2023-10-08 03:52:21,210][50642] Fps is (10 sec: 13107.4, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 199327744. Throughput: 0: 1699.3, 1: 1737.6. Samples: 49835748. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:52:21,211][50642] Avg episode reward: [(0, '16.160'), (1, '29.180')] [2023-10-08 03:52:23,085][52060] Updated weights for policy 0, policy_version 96710 (0.0008) [2023-10-08 03:52:23,241][52059] Updated weights for policy 1, policy_version 97962 (0.0008) [2023-10-08 03:52:23,455][52060] Updated weights for policy 0, policy_version 96720 (0.0009) [2023-10-08 03:52:23,615][52059] Updated weights for policy 1, policy_version 97972 (0.0008) [2023-10-08 03:52:23,833][52060] Updated weights for policy 0, policy_version 96730 (0.0009) [2023-10-08 03:52:23,971][52059] Updated weights for policy 1, policy_version 97982 (0.0009) [2023-10-08 03:52:26,210][50642] Fps is (10 sec: 13106.9, 60 sec: 14199.4, 300 sec: 13773.7). Total num frames: 199393280. Throughput: 0: 1695.7, 1: 1728.0. Samples: 49856034. Policy #0 lag: (min: 31.0, avg: 31.9, max: 51.0) [2023-10-08 03:52:26,211][50642] Avg episode reward: [(0, '15.800'), (1, '25.570')] [2023-10-08 03:52:27,778][52060] Updated weights for policy 0, policy_version 96740 (0.0009) [2023-10-08 03:52:27,962][52059] Updated weights for policy 1, policy_version 97992 (0.0008) [2023-10-08 03:52:28,149][52060] Updated weights for policy 0, policy_version 96750 (0.0008) [2023-10-08 03:52:28,323][52059] Updated weights for policy 1, policy_version 98002 (0.0008) [2023-10-08 03:52:28,512][52060] Updated weights for policy 0, policy_version 96760 (0.0008) [2023-10-08 03:52:28,682][52059] Updated weights for policy 1, policy_version 98012 (0.0009) [2023-10-08 03:52:31,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 199458816. Throughput: 0: 1712.8, 1: 1750.9. Samples: 49877568. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:52:31,211][50642] Avg episode reward: [(0, '15.610'), (1, '23.760')] [2023-10-08 03:52:32,521][52060] Updated weights for policy 0, policy_version 96770 (0.0008) [2023-10-08 03:52:32,655][52059] Updated weights for policy 1, policy_version 98022 (0.0009) [2023-10-08 03:52:32,900][52060] Updated weights for policy 0, policy_version 96780 (0.0007) [2023-10-08 03:52:33,041][52059] Updated weights for policy 1, policy_version 98032 (0.0009) [2023-10-08 03:52:33,257][52060] Updated weights for policy 0, policy_version 96790 (0.0007) [2023-10-08 03:52:33,405][52059] Updated weights for policy 1, policy_version 98042 (0.0008) [2023-10-08 03:52:33,632][52060] Updated weights for policy 0, policy_version 96800 (0.0007) [2023-10-08 03:52:36,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 199524352. Throughput: 0: 1678.0, 1: 1726.7. Samples: 49886676. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:52:36,211][50642] Avg episode reward: [(0, '15.080'), (1, '24.320')] [2023-10-08 03:52:37,099][52059] Updated weights for policy 1, policy_version 98052 (0.0009) [2023-10-08 03:52:37,396][52060] Updated weights for policy 0, policy_version 96810 (0.0008) [2023-10-08 03:52:37,465][52059] Updated weights for policy 1, policy_version 98062 (0.0008) [2023-10-08 03:52:37,769][52060] Updated weights for policy 0, policy_version 96820 (0.0009) [2023-10-08 03:52:37,831][52059] Updated weights for policy 1, policy_version 98072 (0.0007) [2023-10-08 03:52:38,142][52060] Updated weights for policy 0, policy_version 96830 (0.0008) [2023-10-08 03:52:41,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 199589888. Throughput: 0: 1701.1, 1: 1734.5. Samples: 49908160. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:52:41,211][50642] Avg episode reward: [(0, '16.070'), (1, '28.650')] [2023-10-08 03:52:41,843][52059] Updated weights for policy 1, policy_version 98082 (0.0009) [2023-10-08 03:52:42,207][52060] Updated weights for policy 0, policy_version 96840 (0.0009) [2023-10-08 03:52:42,213][52059] Updated weights for policy 1, policy_version 98092 (0.0008) [2023-10-08 03:52:42,575][52060] Updated weights for policy 0, policy_version 96850 (0.0008) [2023-10-08 03:52:42,579][52059] Updated weights for policy 1, policy_version 98102 (0.0008) [2023-10-08 03:52:42,934][52060] Updated weights for policy 0, policy_version 96860 (0.0007) [2023-10-08 03:52:42,942][52059] Updated weights for policy 1, policy_version 98112 (0.0007) [2023-10-08 03:52:46,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 199655424. Throughput: 0: 1702.5, 1: 1757.4. Samples: 49929470. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:52:46,211][50642] Avg episode reward: [(0, '16.240'), (1, '28.220')] [2023-10-08 03:52:46,222][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000096864_99188736.pth... [2023-10-08 03:52:46,222][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000098112_100466688.pth... [2023-10-08 03:52:46,257][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000096512_98828288.pth [2023-10-08 03:52:46,259][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000095264_97550336.pth [2023-10-08 03:52:46,879][52059] Updated weights for policy 1, policy_version 98122 (0.0007) [2023-10-08 03:52:46,987][52060] Updated weights for policy 0, policy_version 96870 (0.0008) [2023-10-08 03:52:47,246][52059] Updated weights for policy 1, policy_version 98132 (0.0007) [2023-10-08 03:52:47,346][52060] Updated weights for policy 0, policy_version 96880 (0.0008) [2023-10-08 03:52:47,609][52059] Updated weights for policy 1, policy_version 98142 (0.0009) [2023-10-08 03:52:47,720][52060] Updated weights for policy 0, policy_version 96890 (0.0009) [2023-10-08 03:52:51,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 199720960. Throughput: 0: 1684.6, 1: 1732.8. Samples: 49939044. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:52:51,211][50642] Avg episode reward: [(0, '15.640'), (1, '24.190')] [2023-10-08 03:52:51,578][52059] Updated weights for policy 1, policy_version 98152 (0.0008) [2023-10-08 03:52:51,641][52060] Updated weights for policy 0, policy_version 96900 (0.0010) [2023-10-08 03:52:51,930][52059] Updated weights for policy 1, policy_version 98162 (0.0008) [2023-10-08 03:52:52,005][52060] Updated weights for policy 0, policy_version 96910 (0.0009) [2023-10-08 03:52:52,294][52059] Updated weights for policy 1, policy_version 98172 (0.0007) [2023-10-08 03:52:52,380][52060] Updated weights for policy 0, policy_version 96920 (0.0008) [2023-10-08 03:52:55,947][52059] Updated weights for policy 1, policy_version 98182 (0.0007) [2023-10-08 03:52:56,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 199786496. Throughput: 0: 1712.0, 1: 1765.1. Samples: 49960600. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:52:56,211][50642] Avg episode reward: [(0, '16.330'), (1, '24.980')] [2023-10-08 03:52:56,310][52059] Updated weights for policy 1, policy_version 98192 (0.0007) [2023-10-08 03:52:56,393][52060] Updated weights for policy 0, policy_version 96930 (0.0009) [2023-10-08 03:52:56,675][52059] Updated weights for policy 1, policy_version 98202 (0.0007) [2023-10-08 03:52:56,799][52060] Updated weights for policy 0, policy_version 96940 (0.0008) [2023-10-08 03:52:57,156][52060] Updated weights for policy 0, policy_version 96950 (0.0007) [2023-10-08 03:52:57,534][52060] Updated weights for policy 0, policy_version 96960 (0.0007) [2023-10-08 03:53:00,562][52059] Updated weights for policy 1, policy_version 98212 (0.0007) [2023-10-08 03:53:00,918][52059] Updated weights for policy 1, policy_version 98222 (0.0008) [2023-10-08 03:53:01,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 199852032. Throughput: 0: 1715.0, 1: 1758.0. Samples: 49981476. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:53:01,211][50642] Avg episode reward: [(0, '16.090'), (1, '25.350')] [2023-10-08 03:53:01,278][52059] Updated weights for policy 1, policy_version 98232 (0.0009) [2023-10-08 03:53:01,562][52060] Updated weights for policy 0, policy_version 96970 (0.0007) [2023-10-08 03:53:01,927][52060] Updated weights for policy 0, policy_version 96980 (0.0011) [2023-10-08 03:53:02,308][52060] Updated weights for policy 0, policy_version 96990 (0.0009) [2023-10-08 03:53:05,178][52059] Updated weights for policy 1, policy_version 98242 (0.0008) [2023-10-08 03:53:05,532][52059] Updated weights for policy 1, policy_version 98252 (0.0009) [2023-10-08 03:53:05,900][52059] Updated weights for policy 1, policy_version 98262 (0.0009) [2023-10-08 03:53:06,210][50642] Fps is (10 sec: 13106.9, 60 sec: 13107.2, 300 sec: 13662.6). Total num frames: 199917568. Throughput: 0: 1705.7, 1: 1754.7. Samples: 49991470. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:53:06,211][50642] Avg episode reward: [(0, '15.220'), (1, '27.410')] [2023-10-08 03:53:06,256][52059] Updated weights for policy 1, policy_version 98272 (0.0007) [2023-10-08 03:53:06,269][52060] Updated weights for policy 0, policy_version 97000 (0.0008) [2023-10-08 03:53:06,642][52060] Updated weights for policy 0, policy_version 97010 (0.0007) [2023-10-08 03:53:07,010][52060] Updated weights for policy 0, policy_version 97020 (0.0007) [2023-10-08 03:53:10,133][52059] Updated weights for policy 1, policy_version 98282 (0.0011) [2023-10-08 03:53:10,507][52059] Updated weights for policy 1, policy_version 98292 (0.0010) [2023-10-08 03:53:10,868][52059] Updated weights for policy 1, policy_version 98302 (0.0008) [2023-10-08 03:53:11,110][52060] Updated weights for policy 0, policy_version 97030 (0.0007) [2023-10-08 03:53:11,210][50642] Fps is (10 sec: 16383.8, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 200015872. Throughput: 0: 1717.7, 1: 1759.6. Samples: 50012512. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:53:11,211][50642] Avg episode reward: [(0, '15.320'), (1, '28.730')] [2023-10-08 03:53:11,482][52060] Updated weights for policy 0, policy_version 97040 (0.0009) [2023-10-08 03:53:11,847][52060] Updated weights for policy 0, policy_version 97050 (0.0009) [2023-10-08 03:53:14,802][52059] Updated weights for policy 1, policy_version 98312 (0.0008) [2023-10-08 03:53:15,174][52059] Updated weights for policy 1, policy_version 98322 (0.0007) [2023-10-08 03:53:15,539][52059] Updated weights for policy 1, policy_version 98332 (0.0008) [2023-10-08 03:53:15,671][52060] Updated weights for policy 0, policy_version 97060 (0.0011) [2023-10-08 03:53:16,034][52060] Updated weights for policy 0, policy_version 97070 (0.0009) [2023-10-08 03:53:16,210][50642] Fps is (10 sec: 16384.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 200081408. Throughput: 0: 1709.2, 1: 1730.8. Samples: 50032372. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:53:16,211][50642] Avg episode reward: [(0, '17.210'), (1, '24.410')] [2023-10-08 03:53:16,406][52060] Updated weights for policy 0, policy_version 97080 (0.0009) [2023-10-08 03:53:19,609][52059] Updated weights for policy 1, policy_version 98342 (0.0007) [2023-10-08 03:53:20,008][52059] Updated weights for policy 1, policy_version 98352 (0.0008) [2023-10-08 03:53:20,365][52059] Updated weights for policy 1, policy_version 98362 (0.0008) [2023-10-08 03:53:20,377][52060] Updated weights for policy 0, policy_version 97090 (0.0009) [2023-10-08 03:53:20,734][52060] Updated weights for policy 0, policy_version 97100 (0.0010) [2023-10-08 03:53:21,107][52060] Updated weights for policy 0, policy_version 97110 (0.0008) [2023-10-08 03:53:21,210][50642] Fps is (10 sec: 13107.3, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 200146944. Throughput: 0: 1718.4, 1: 1765.2. Samples: 50043440. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:53:21,211][50642] Avg episode reward: [(0, '15.690'), (1, '25.020')] [2023-10-08 03:53:21,476][52060] Updated weights for policy 0, policy_version 97120 (0.0007) [2023-10-08 03:53:24,305][52059] Updated weights for policy 1, policy_version 98372 (0.0009) [2023-10-08 03:53:24,668][52059] Updated weights for policy 1, policy_version 98382 (0.0010) [2023-10-08 03:53:25,028][52059] Updated weights for policy 1, policy_version 98392 (0.0008) [2023-10-08 03:53:25,480][52060] Updated weights for policy 0, policy_version 97130 (0.0008) [2023-10-08 03:53:25,852][52060] Updated weights for policy 0, policy_version 97140 (0.0009) [2023-10-08 03:53:26,210][50642] Fps is (10 sec: 13107.5, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 200212480. Throughput: 0: 1717.6, 1: 1740.8. Samples: 50063788. Policy #0 lag: (min: 31.0, avg: 37.3, max: 63.0) [2023-10-08 03:53:26,211][50642] Avg episode reward: [(0, '15.400'), (1, '24.650')] [2023-10-08 03:53:26,219][52060] Updated weights for policy 0, policy_version 97150 (0.0009) [2023-10-08 03:53:28,899][52059] Updated weights for policy 1, policy_version 98402 (0.0008) [2023-10-08 03:53:29,258][52059] Updated weights for policy 1, policy_version 98412 (0.0008) [2023-10-08 03:53:29,618][52059] Updated weights for policy 1, policy_version 98422 (0.0008) [2023-10-08 03:53:29,986][52059] Updated weights for policy 1, policy_version 98432 (0.0007) [2023-10-08 03:53:30,238][52060] Updated weights for policy 0, policy_version 97160 (0.0008) [2023-10-08 03:53:30,603][52060] Updated weights for policy 0, policy_version 97170 (0.0008) [2023-10-08 03:53:30,981][52060] Updated weights for policy 0, policy_version 97180 (0.0010) [2023-10-08 03:53:31,210][50642] Fps is (10 sec: 16383.7, 60 sec: 14199.4, 300 sec: 13884.7). Total num frames: 200310784. Throughput: 0: 1696.7, 1: 1730.4. Samples: 50083688. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:53:31,211][50642] Avg episode reward: [(0, '17.150'), (1, '25.130')] [2023-10-08 03:53:33,759][52059] Updated weights for policy 1, policy_version 98442 (0.0009) [2023-10-08 03:53:34,135][52059] Updated weights for policy 1, policy_version 98452 (0.0010) [2023-10-08 03:53:34,493][52059] Updated weights for policy 1, policy_version 98462 (0.0007) [2023-10-08 03:53:34,905][52060] Updated weights for policy 0, policy_version 97190 (0.0010) [2023-10-08 03:53:35,271][52060] Updated weights for policy 0, policy_version 97200 (0.0008) [2023-10-08 03:53:35,645][52060] Updated weights for policy 0, policy_version 97210 (0.0012) [2023-10-08 03:53:36,210][50642] Fps is (10 sec: 16384.0, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 200376320. Throughput: 0: 1714.3, 1: 1745.9. Samples: 50094750. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:53:36,211][50642] Avg episode reward: [(0, '14.990'), (1, '27.620')] [2023-10-08 03:53:38,491][52059] Updated weights for policy 1, policy_version 98472 (0.0008) [2023-10-08 03:53:38,864][52059] Updated weights for policy 1, policy_version 98482 (0.0007) [2023-10-08 03:53:39,221][52059] Updated weights for policy 1, policy_version 98492 (0.0009) [2023-10-08 03:53:39,722][52060] Updated weights for policy 0, policy_version 97220 (0.0009) [2023-10-08 03:53:40,087][52060] Updated weights for policy 0, policy_version 97230 (0.0007) [2023-10-08 03:53:40,448][52060] Updated weights for policy 0, policy_version 97240 (0.0007) [2023-10-08 03:53:41,210][50642] Fps is (10 sec: 13107.5, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 200441856. Throughput: 0: 1705.2, 1: 1723.7. Samples: 50114900. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:53:41,211][50642] Avg episode reward: [(0, '16.090'), (1, '26.410')] [2023-10-08 03:53:43,265][52059] Updated weights for policy 1, policy_version 98502 (0.0008) [2023-10-08 03:53:43,631][52059] Updated weights for policy 1, policy_version 98512 (0.0008) [2023-10-08 03:53:43,992][52059] Updated weights for policy 1, policy_version 98522 (0.0008) [2023-10-08 03:53:44,540][52060] Updated weights for policy 0, policy_version 97250 (0.0009) [2023-10-08 03:53:44,945][52060] Updated weights for policy 0, policy_version 97260 (0.0009) [2023-10-08 03:53:45,320][52060] Updated weights for policy 0, policy_version 97270 (0.0007) [2023-10-08 03:53:45,683][52060] Updated weights for policy 0, policy_version 97280 (0.0008) [2023-10-08 03:53:46,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.7). Total num frames: 200507392. Throughput: 0: 1682.8, 1: 1733.5. Samples: 50135210. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:53:46,211][50642] Avg episode reward: [(0, '17.760'), (1, '24.970')] [2023-10-08 03:53:47,895][52059] Updated weights for policy 1, policy_version 98532 (0.0008) [2023-10-08 03:53:48,254][52059] Updated weights for policy 1, policy_version 98542 (0.0009) [2023-10-08 03:53:48,619][52059] Updated weights for policy 1, policy_version 98552 (0.0008) [2023-10-08 03:53:49,564][52060] Updated weights for policy 0, policy_version 97290 (0.0008) [2023-10-08 03:53:49,930][52060] Updated weights for policy 0, policy_version 97300 (0.0008) [2023-10-08 03:53:50,299][52060] Updated weights for policy 0, policy_version 97310 (0.0008) [2023-10-08 03:53:51,210][50642] Fps is (10 sec: 13107.2, 60 sec: 14199.5, 300 sec: 13884.8). Total num frames: 200572928. Throughput: 0: 1712.5, 1: 1719.5. Samples: 50145908. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:53:51,211][50642] Avg episode reward: [(0, '16.610'), (1, '23.810')] [2023-10-08 03:53:52,442][52059] Updated weights for policy 1, policy_version 98562 (0.0008) [2023-10-08 03:53:52,796][52059] Updated weights for policy 1, policy_version 98572 (0.0009) [2023-10-08 03:53:53,159][52059] Updated weights for policy 1, policy_version 98582 (0.0010) [2023-10-08 03:53:53,521][52059] Updated weights for policy 1, policy_version 98592 (0.0008) [2023-10-08 03:53:54,334][52060] Updated weights for policy 0, policy_version 97320 (0.0009) [2023-10-08 03:53:54,699][52060] Updated weights for policy 0, policy_version 97330 (0.0009) [2023-10-08 03:53:55,068][52060] Updated weights for policy 0, policy_version 97340 (0.0008) [2023-10-08 03:53:56,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 200638464. Throughput: 0: 1696.8, 1: 1723.2. Samples: 50166408. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:53:56,211][50642] Avg episode reward: [(0, '17.530'), (1, '25.040')] [2023-10-08 03:53:57,442][52059] Updated weights for policy 1, policy_version 98602 (0.0008) [2023-10-08 03:53:57,805][52059] Updated weights for policy 1, policy_version 98612 (0.0007) [2023-10-08 03:53:58,161][52059] Updated weights for policy 1, policy_version 98622 (0.0009) [2023-10-08 03:53:59,175][52060] Updated weights for policy 0, policy_version 97350 (0.0008) [2023-10-08 03:53:59,550][52060] Updated weights for policy 0, policy_version 97360 (0.0007) [2023-10-08 03:53:59,906][52060] Updated weights for policy 0, policy_version 97370 (0.0010) [2023-10-08 03:54:01,210][50642] Fps is (10 sec: 13107.1, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 200704000. Throughput: 0: 1686.2, 1: 1755.7. Samples: 50187256. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:01,211][50642] Avg episode reward: [(0, '17.270'), (1, '27.650')] [2023-10-08 03:54:01,960][52059] Updated weights for policy 1, policy_version 98632 (0.0009) [2023-10-08 03:54:02,330][52059] Updated weights for policy 1, policy_version 98642 (0.0007) [2023-10-08 03:54:02,695][52059] Updated weights for policy 1, policy_version 98652 (0.0008) [2023-10-08 03:54:03,840][52060] Updated weights for policy 0, policy_version 97380 (0.0010) [2023-10-08 03:54:04,213][52060] Updated weights for policy 0, policy_version 97390 (0.0007) [2023-10-08 03:54:04,585][52060] Updated weights for policy 0, policy_version 97400 (0.0007) [2023-10-08 03:54:06,210][50642] Fps is (10 sec: 13107.3, 60 sec: 14199.5, 300 sec: 13773.7). Total num frames: 200769536. Throughput: 0: 1705.3, 1: 1724.9. Samples: 50197802. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:06,211][50642] Avg episode reward: [(0, '16.520'), (1, '26.470')] [2023-10-08 03:54:06,692][52059] Updated weights for policy 1, policy_version 98662 (0.0007) [2023-10-08 03:54:07,074][52059] Updated weights for policy 1, policy_version 98672 (0.0008) [2023-10-08 03:54:07,441][52059] Updated weights for policy 1, policy_version 98682 (0.0008) [2023-10-08 03:54:08,467][52060] Updated weights for policy 0, policy_version 97410 (0.0009) [2023-10-08 03:54:08,841][52060] Updated weights for policy 0, policy_version 97420 (0.0008) [2023-10-08 03:54:09,208][52060] Updated weights for policy 0, policy_version 97430 (0.0008) [2023-10-08 03:54:09,580][52060] Updated weights for policy 0, policy_version 97440 (0.0010) [2023-10-08 03:54:11,210][50642] Fps is (10 sec: 13107.0, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 200835072. Throughput: 0: 1682.1, 1: 1749.3. Samples: 50218202. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:11,211][50642] Avg episode reward: [(0, '18.150'), (1, '22.790')] [2023-10-08 03:54:11,390][52059] Updated weights for policy 1, policy_version 98692 (0.0007) [2023-10-08 03:54:11,760][52059] Updated weights for policy 1, policy_version 98702 (0.0008) [2023-10-08 03:54:12,116][52059] Updated weights for policy 1, policy_version 98712 (0.0007) [2023-10-08 03:54:13,566][52060] Updated weights for policy 0, policy_version 97450 (0.0008) [2023-10-08 03:54:13,939][52060] Updated weights for policy 0, policy_version 97460 (0.0009) [2023-10-08 03:54:14,302][52060] Updated weights for policy 0, policy_version 97470 (0.0008) [2023-10-08 03:54:15,973][52059] Updated weights for policy 1, policy_version 98722 (0.0009) [2023-10-08 03:54:16,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 200900608. Throughput: 0: 1700.2, 1: 1763.5. Samples: 50239556. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:16,211][50642] Avg episode reward: [(0, '15.690'), (1, '24.140')] [2023-10-08 03:54:16,334][52059] Updated weights for policy 1, policy_version 98732 (0.0008) [2023-10-08 03:54:16,704][52059] Updated weights for policy 1, policy_version 98742 (0.0008) [2023-10-08 03:54:17,074][52059] Updated weights for policy 1, policy_version 98752 (0.0008) [2023-10-08 03:54:18,384][52060] Updated weights for policy 0, policy_version 97480 (0.0009) [2023-10-08 03:54:18,750][52060] Updated weights for policy 0, policy_version 97490 (0.0008) [2023-10-08 03:54:19,126][52060] Updated weights for policy 0, policy_version 97500 (0.0011) [2023-10-08 03:54:20,930][52059] Updated weights for policy 1, policy_version 98762 (0.0008) [2023-10-08 03:54:21,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 200966144. Throughput: 0: 1696.3, 1: 1743.5. Samples: 50249546. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:21,211][50642] Avg episode reward: [(0, '17.250'), (1, '24.220')] [2023-10-08 03:54:21,302][52059] Updated weights for policy 1, policy_version 98772 (0.0009) [2023-10-08 03:54:21,668][52059] Updated weights for policy 1, policy_version 98782 (0.0007) [2023-10-08 03:54:23,067][52060] Updated weights for policy 0, policy_version 97510 (0.0008) [2023-10-08 03:54:23,431][52060] Updated weights for policy 0, policy_version 97520 (0.0009) [2023-10-08 03:54:23,791][52060] Updated weights for policy 0, policy_version 97530 (0.0008) [2023-10-08 03:54:25,560][52059] Updated weights for policy 1, policy_version 98792 (0.0007) [2023-10-08 03:54:25,930][52059] Updated weights for policy 1, policy_version 98802 (0.0007) [2023-10-08 03:54:26,210][50642] Fps is (10 sec: 13107.2, 60 sec: 13653.3, 300 sec: 13662.6). Total num frames: 201031680. Throughput: 0: 1693.7, 1: 1763.4. Samples: 50270468. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:26,211][50642] Avg episode reward: [(0, '18.210'), (1, '27.710')] [2023-10-08 03:54:26,288][52059] Updated weights for policy 1, policy_version 98812 (0.0009) [2023-10-08 03:54:27,760][52060] Updated weights for policy 0, policy_version 97540 (0.0009) [2023-10-08 03:54:28,124][52060] Updated weights for policy 0, policy_version 97550 (0.0008) [2023-10-08 03:54:28,494][52060] Updated weights for policy 0, policy_version 97560 (0.0009) [2023-10-08 03:54:29,971][52059] Updated weights for policy 1, policy_version 98822 (0.0008) [2023-10-08 03:54:30,327][52059] Updated weights for policy 1, policy_version 98832 (0.0009) [2023-10-08 03:54:30,691][52059] Updated weights for policy 1, policy_version 98842 (0.0008) [2023-10-08 03:54:31,210][50642] Fps is (10 sec: 16384.2, 60 sec: 13653.4, 300 sec: 13773.7). Total num frames: 201129984. Throughput: 0: 1718.1, 1: 1737.6. Samples: 50290718. Policy #0 lag: (min: 13.0, avg: 13.0, max: 13.0) [2023-10-08 03:54:31,211][50642] Avg episode reward: [(0, '15.240'), (1, '25.750')] [2023-10-08 03:54:32,481][52060] Updated weights for policy 0, policy_version 97570 (0.0009) [2023-10-08 03:54:32,881][52060] Updated weights for policy 0, policy_version 97580 (0.0008) [2023-10-08 03:54:33,251][52060] Updated weights for policy 0, policy_version 97590 (0.0008) [2023-10-08 03:54:33,626][52060] Updated weights for policy 0, policy_version 97600 (0.0011) [2023-10-08 03:54:34,649][52059] Updated weights for policy 1, policy_version 98852 (0.0009) [2023-10-08 03:54:35,025][52059] Updated weights for policy 1, policy_version 98862 (0.0009) [2023-10-08 03:54:35,392][52059] Updated weights for policy 1, policy_version 98872 (0.0010) [2023-10-08 03:54:36,210][50642] Fps is (10 sec: 16383.9, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 201195520. Throughput: 0: 1688.3, 1: 1762.7. Samples: 50301202. Policy #0 lag: (min: 31.0, avg: 42.4, max: 63.0) [2023-10-08 03:54:36,211][50642] Avg episode reward: [(0, '17.970'), (1, '23.860')] [2023-10-08 03:54:37,493][52060] Updated weights for policy 0, policy_version 97610 (0.0008) [2023-10-08 03:54:37,862][52060] Updated weights for policy 0, policy_version 97620 (0.0008) [2023-10-08 03:54:38,231][52060] Updated weights for policy 0, policy_version 97630 (0.0009) [2023-10-08 03:54:39,217][52059] Updated weights for policy 1, policy_version 98882 (0.0009) [2023-10-08 03:54:39,582][52059] Updated weights for policy 1, policy_version 98892 (0.0009) [2023-10-08 03:54:39,950][52059] Updated weights for policy 1, policy_version 98902 (0.0010) [2023-10-08 03:54:40,311][52059] Updated weights for policy 1, policy_version 98912 (0.0008) [2023-10-08 03:54:41,210][50642] Fps is (10 sec: 13107.4, 60 sec: 13653.3, 300 sec: 13773.7). Total num frames: 201261056. Throughput: 0: 1714.9, 1: 1743.2. Samples: 50322026. Policy #0 lag: (min: 31.0, avg: 42.4, max: 63.0) [2023-10-08 03:54:41,211][50642] Avg episode reward: [(0, '16.280'), (1, '24.760')] [2023-10-08 03:54:42,262][52060] Updated weights for policy 0, policy_version 97640 (0.0009) [2023-10-08 03:54:42,623][52060] Updated weights for policy 0, policy_version 97650 (0.0009) [2023-10-08 03:54:42,992][52060] Updated weights for policy 0, policy_version 97660 (0.0007) [2023-10-08 03:54:44,217][52059] Updated weights for policy 1, policy_version 98922 (0.0007) [2023-10-08 03:54:44,574][52059] Updated weights for policy 1, policy_version 98932 (0.0008) [2023-10-08 03:54:44,941][52059] Updated weights for policy 1, policy_version 98942 (0.0008) [2023-10-08 03:54:45,013][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000098944_101318656.pth... [2023-10-08 03:54:45,013][52106] Stopping RolloutWorker_w11... [2023-10-08 03:54:45,013][51605] Stopping Batcher_0... [2023-10-08 03:54:45,013][52101] Stopping RolloutWorker_w6... [2023-10-08 03:54:45,013][52106] Loop rollout_proc11_evt_loop terminating... [2023-10-08 03:54:45,013][50642] Component RolloutWorker_w11 stopped! [2023-10-08 03:54:45,013][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000097664_100007936.pth... [2023-10-08 03:54:45,014][52107] Stopping RolloutWorker_w12... [2023-10-08 03:54:45,014][52101] Loop rollout_proc6_evt_loop terminating... [2023-10-08 03:54:45,014][50642] Component Batcher_0 stopped! [2023-10-08 03:54:45,014][52107] Loop rollout_proc12_evt_loop terminating... [2023-10-08 03:54:45,014][52096] Stopping RolloutWorker_w2... [2023-10-08 03:54:45,014][52102] Stopping RolloutWorker_w8... [2023-10-08 03:54:45,014][52105] Stopping RolloutWorker_w10... [2023-10-08 03:54:45,014][50642] Component RolloutWorker_w6 stopped! [2023-10-08 03:54:45,015][52105] Loop rollout_proc10_evt_loop terminating... [2023-10-08 03:54:45,015][52096] Loop rollout_proc2_evt_loop terminating... [2023-10-08 03:54:45,015][52102] Loop rollout_proc8_evt_loop terminating... [2023-10-08 03:54:45,015][50642] Component RolloutWorker_w12 stopped! [2023-10-08 03:54:45,015][52099] Stopping RolloutWorker_w3... [2023-10-08 03:54:45,015][50642] Component RolloutWorker_w8 stopped! [2023-10-08 03:54:45,016][52099] Loop rollout_proc3_evt_loop terminating... [2023-10-08 03:54:45,016][50642] Component RolloutWorker_w2 stopped! [2023-10-08 03:54:45,016][50642] Component RolloutWorker_w10 stopped! [2023-10-08 03:54:45,016][52100] Stopping RolloutWorker_w5... [2023-10-08 03:54:45,016][52098] Stopping RolloutWorker_w4... [2023-10-08 03:54:45,016][52100] Loop rollout_proc5_evt_loop terminating... [2023-10-08 03:54:45,016][50642] Component RolloutWorker_w3 stopped! [2023-10-08 03:54:45,017][52098] Loop rollout_proc4_evt_loop terminating... [2023-10-08 03:54:45,017][50642] Component RolloutWorker_w5 stopped! [2023-10-08 03:54:45,017][52095] Stopping RolloutWorker_w1... [2023-10-08 03:54:45,017][50642] Component RolloutWorker_w4 stopped! [2023-10-08 03:54:45,017][52103] Stopping RolloutWorker_w7... [2023-10-08 03:54:45,018][52095] Loop rollout_proc1_evt_loop terminating... [2023-10-08 03:54:45,018][50642] Component RolloutWorker_w1 stopped! [2023-10-08 03:54:45,018][52103] Loop rollout_proc7_evt_loop terminating... [2023-10-08 03:54:45,018][52108] Stopping RolloutWorker_w13... [2023-10-08 03:54:45,018][50642] Component RolloutWorker_w7 stopped! [2023-10-08 03:54:45,018][52108] Loop rollout_proc13_evt_loop terminating... [2023-10-08 03:54:45,018][50642] Component RolloutWorker_w13 stopped! [2023-10-08 03:54:45,019][52796] Stopping RolloutWorker_w15... [2023-10-08 03:54:45,019][50642] Component RolloutWorker_w15 stopped! [2023-10-08 03:54:45,019][52061] Stopping RolloutWorker_w0... [2023-10-08 03:54:45,020][52728] Stopping RolloutWorker_w14... [2023-10-08 03:54:45,020][52104] Stopping RolloutWorker_w9... [2023-10-08 03:54:45,020][52796] Loop rollout_proc15_evt_loop terminating... [2023-10-08 03:54:45,020][52061] Loop rollout_proc0_evt_loop terminating... [2023-10-08 03:54:45,020][52728] Loop rollout_proc14_evt_loop terminating... [2023-10-08 03:54:45,020][50642] Component RolloutWorker_w0 stopped! [2023-10-08 03:54:45,021][52104] Loop rollout_proc9_evt_loop terminating... [2023-10-08 03:54:45,021][50642] Component RolloutWorker_w14 stopped! [2023-10-08 03:54:45,021][50642] Component RolloutWorker_w9 stopped! [2023-10-08 03:54:45,026][50642] Component Batcher_1 stopped! [2023-10-08 03:54:45,014][51605] Loop batcher_evt_loop terminating... [2023-10-08 03:54:45,035][52060] Weights refcount: 2 0 [2023-10-08 03:54:45,036][52060] Stopping InferenceWorker_p0-w0... [2023-10-08 03:54:45,037][52060] Loop inference_proc0-0_evt_loop terminating... [2023-10-08 03:54:45,036][50642] Component InferenceWorker_p0-w0 stopped! [2023-10-08 03:54:45,037][52059] Weights refcount: 2 0 [2023-10-08 03:54:45,039][52059] Stopping InferenceWorker_p1-w0... [2023-10-08 03:54:45,039][52059] Loop inference_proc1-0_evt_loop terminating... [2023-10-08 03:54:45,039][50642] Component InferenceWorker_p1-w0 stopped! [2023-10-08 03:54:45,036][51710] Stopping Batcher_1... [2023-10-08 03:54:45,047][51710] Loop batcher_evt_loop terminating... [2023-10-08 03:54:45,048][51710] Removing ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000097312_99647488.pth [2023-10-08 03:54:45,048][51605] Removing ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000096064_98369536.pth [2023-10-08 03:54:45,052][51710] Saving ./train_atari/atari_amidar_APPO/checkpoint_p1/checkpoint_000098944_101318656.pth... [2023-10-08 03:54:45,053][51605] Saving ./train_atari/atari_amidar_APPO/checkpoint_p0/checkpoint_000097664_100007936.pth... [2023-10-08 03:54:45,094][51710] Stopping LearnerWorker_p1... [2023-10-08 03:54:45,094][51605] Stopping LearnerWorker_p0... [2023-10-08 03:54:45,094][51710] Loop learner_proc1_evt_loop terminating... [2023-10-08 03:54:45,094][51605] Loop learner_proc0_evt_loop terminating... [2023-10-08 03:54:45,094][50642] Component LearnerWorker_p1 stopped! [2023-10-08 03:54:45,095][50642] Component LearnerWorker_p0 stopped! [2023-10-08 03:54:45,095][50642] Waiting for process learner_proc0 to stop... [2023-10-08 03:54:45,905][50642] Waiting for process learner_proc1 to stop... [2023-10-08 03:54:45,906][50642] Waiting for process inference_proc0-0 to join... [2023-10-08 03:54:45,906][50642] Waiting for process inference_proc1-0 to join... [2023-10-08 03:54:45,907][50642] Waiting for process rollout_proc0 to join... [2023-10-08 03:54:45,908][50642] Waiting for process rollout_proc1 to join... [2023-10-08 03:54:45,908][50642] Waiting for process rollout_proc2 to join... [2023-10-08 03:54:45,909][50642] Waiting for process rollout_proc3 to join... [2023-10-08 03:54:45,910][50642] Waiting for process rollout_proc4 to join... [2023-10-08 03:54:45,910][50642] Waiting for process rollout_proc5 to join... [2023-10-08 03:54:45,911][50642] Waiting for process rollout_proc6 to join... [2023-10-08 03:54:45,911][50642] Waiting for process rollout_proc7 to join... [2023-10-08 03:54:45,912][50642] Waiting for process rollout_proc8 to join... [2023-10-08 03:54:45,913][50642] Waiting for process rollout_proc9 to join... [2023-10-08 03:54:45,913][50642] Waiting for process rollout_proc10 to join... [2023-10-08 03:54:45,914][50642] Waiting for process rollout_proc11 to join... [2023-10-08 03:54:45,915][50642] Waiting for process rollout_proc12 to join... [2023-10-08 03:54:45,915][50642] Waiting for process rollout_proc13 to join... [2023-10-08 03:54:45,916][50642] Waiting for process rollout_proc14 to join... [2023-10-08 03:54:45,917][50642] Waiting for process rollout_proc15 to join... [2023-10-08 03:54:45,917][50642] Batcher 0 profile tree view: batching: 169.1320, releasing_batches: 0.0934 [2023-10-08 03:54:45,917][50642] Batcher 1 profile tree view: batching: 171.5113, releasing_batches: 0.0926 [2023-10-08 03:54:45,917][50642] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0000 wait_policy_total: 2440.3741 update_model: 203.1704 weight_update: 0.0007 one_step: 0.0019 handle_policy_step: 11290.9860 deserialize: 64.3087, stack: 192.6779, obs_to_device_normalize: 2507.9955, forward: 5127.1664, prepare_outputs: 2434.9962, send_messages: 466.9913 [2023-10-08 03:54:45,918][50642] InferenceWorker_p1-w0 profile tree view: wait_policy: 0.0007 wait_policy_total: 2399.9655 update_model: 205.3717 weight_update: 0.0009 one_step: 0.0023 handle_policy_step: 11328.5911 deserialize: 64.7962, stack: 193.1677, obs_to_device_normalize: 2534.6771, forward: 5115.0962, prepare_outputs: 2452.7480, send_messages: 471.8355 [2023-10-08 03:54:45,918][50642] Learner 0 profile tree view: misc: 0.0198, prepare_batch: 269.2817 train: 3617.2671 epoch_init: 0.1887, minibatch_init: 13.1446, losses_postprocess: 887.6775, kl_divergence: 32.1125, update: 389.6401, after_optimizer: 2110.2863 calculate_losses: 167.1992 losses_init: 0.3982, forward_head: 56.2447, bptt_initial: 1.4586, bptt: 1.8073, tail: 38.3363, advantages_returns: 11.2136, losses: 43.8737 [2023-10-08 03:54:45,918][50642] Learner 1 profile tree view: misc: 0.0200, prepare_batch: 271.8025 train: 3626.1103 epoch_init: 0.1901, minibatch_init: 13.3127, losses_postprocess: 891.9607, kl_divergence: 31.9207, update: 388.0363, after_optimizer: 2115.6668 calculate_losses: 168.1105 losses_init: 0.4114, forward_head: 56.6119, bptt_initial: 1.4665, bptt: 1.8004, tail: 38.4240, advantages_returns: 11.2703, losses: 44.3805 [2023-10-08 03:54:45,918][50642] RolloutWorker_w0 profile tree view: wait_for_trajectories: 1.2305, enqueue_policy_requests: 406.8637, process_policy_outputs: 193.2737, env_step: 7357.0875, finalize_trajectories: 3.5332, complete_rollouts: 2.9252 post_env_step: 380.7238 process_env_step: 85.0578 [2023-10-08 03:54:45,918][50642] RolloutWorker_w15 profile tree view: wait_for_trajectories: 1.2303, enqueue_policy_requests: 409.4056, process_policy_outputs: 190.4964, env_step: 7351.2021, finalize_trajectories: 3.5309, complete_rollouts: 3.0003 post_env_step: 379.2536 process_env_step: 84.9854 [2023-10-08 03:54:45,919][50642] Loop Runner_EvtLoop terminating... [2023-10-08 03:54:45,919][50642] Runner profile tree view: main_loop: 14635.1143 [2023-10-08 03:54:45,919][50642] Collected {0: 100007936, 1: 101318656}, FPS: 13756.4